Models & Pricing
Models on Sight AI
Compare cost structures, context windows, and capability tiers across Anthropic, Google, OpenAI, Deepseek, Bigmodel, and Qwen to assemble the perfect inference stack.
Model pricing matrix
Unified snapshot of per-million token pricing, modality coverage, and context limits straight
Snapshot refreshed Sept 29, 2025
Multi-provider pricing grid
Provider | Model | Type | Unit | Input ($ / 1M) | Output ($ / 1M) | Context window |
---|---|---|---|---|---|---|
Anthropic | Claude Opus 4.1text tokens | Text (Latest) | text tokens | $16 | $75 | 200K |
Anthropic | Claude Sonnet 4text tokens | Text (Latest) | text tokens | $3 ≤ 200K tokens · $6 > 200K tokens | $15 ≤ 200K tokens · $22.5 > 200K tokens | 200K |
Anthropic | Claude Haiku 3.5text tokens | Text (Latest) | text tokens | $0.8 | $4 | 200K |
Anthropic | Claude Sonnet 3.7text tokens | Text (Legacy) | text tokens | $3 | $15 | 200K |
Anthropic | Claude Haiku 3text tokens | Text (Legacy) | text tokens | $0.25 | $1.25 | 200K |
Deepseek | DeepSeek-V3text tokens | Text (Latest) | text tokens | $0.07 | $1.68 | 128K |
Deepseek | DeepSeek-R1text tokens | Text (Latest) | text tokens | $0.07 | $1.68 | 128K |
Deepseek | deepseek-codertext tokens | Text (Latest) | text tokens | $0.07 | $0.56 | 16K |
Gemini-2.0-flashtext tokens | Text (Latest) | text tokens | $0.075 | $0.3 | 1,048K | |
Gemini-2.5-flashtext tokens | Text (Latest) | text tokens | $2.5 | $10 ≤ 200K tokens · $15 > 200K tokens | 1,000K | |
Gemini-2.5-flash-litetext tokens | Text (Latest) | text tokens | $0.1 | $0.4 | 1,048K | |
Gemini-2.5 Protext tokens | Text (Latest) | text tokens | $1.25 ≤ 200K tokens · $2.5 > 200K tokens | $10 ≤ 200K tokens · $15 > 200K tokens | 1,048K | |
Gemini-pro-visionVision (Standard) | Image | Vision (Standard) | $50 | $200 | — | |
OpenAI | gpt-3.5-turbotext tokens | Legacy text (Standard) | text tokens | $0.5 | $1.5 | 16K |
OpenAI | gpt-4text tokens | Legacy text (Standard) | text tokens | $30 | $60 | 32K |
OpenAI | gpt-4.1text tokens | Text (Standard) | text tokens | $2 | $8 | 1,000K |
OpenAI | gpt-4.1-minitext tokens | Text (Standard) | text tokens | $0.4 | $1.6 | 1,000K |
OpenAI | gpt-4.1-nanotext tokens | Text (Standard) | text tokens | $0.1 | $0.4 | 1,000K |
OpenAI | gpt-4otext tokens | Text (Standard) | text tokens | $2.5 | $10 | 128K |
OpenAI | gpt-4-turbotext tokens | Legacy text (Standard) | text tokens | $10 | $30 | 128K |
OpenAI | gpt-5-nanotext tokens | Text (Standard) | text tokens | $0.05 | $0.4 | 400K |
OpenAI | gpt-5text tokens | Text (Standard) | text tokens | $1.25 | $10 | 400K |
OpenAI | gpt-5-minitext tokens | Text (Standard) | text tokens | $0.25 | $2 | 400K |
OpenAI | o3text tokens | Text (Standard, reasoning) | text tokens | $2 | $8 | 200K |
OpenAI | o4-minitext tokens | Text (Standard, reasoning) | text tokens | $1.1 | $4.4 | 200K |
Bigmodel | GLM-4.5text tokens | Text (Standard, reasoning) | text tokens | $0.57 | $2.25 | 128K |
Alibaba Cloud | Qwen3-coder-plustext tokens | Text (Standard, reasoning) | text tokens | $1.8 | $9 | 1,000K |
Providers included: Anthropic, Deepseek, Google, OpenAI, Bigmodel, Alibaba CloudRates assume standard throughput; regional premiums may apply.