AI Model Pricing Index
Compare API pricing across every major AI provider. Sortable table, historical trends, and an interactive cost calculator to estimate your monthly spend.
353
Models Tracked
54
Providers
$0.01
Cheapest Input
15000x
Price Range
Full Pricing Table
| Model | Provider | Input / 1M | Output / 1M | Blended | Quality | Value | Context |
|---|---|---|---|---|---|---|---|
Speed & cost | inclusionai | $0.01 | $0.03 | $0.02 | 50 | 2500.0 | 262K |
Open-source | Mistral AI | $0.02 | $0.03 | $0.03 | 72 | 2880.0 | 131K |
Open-source | Meta | $0.02 | $0.05 | $0.04 | 65 | 1857.1 | 16K |
Open-source | Meta | $0.04 | $0.04 | $0.04 | 65 | 1625.0 | 8K |
Hard reasoning | sao10k | $0.04 | $0.05 | $0.04 | 50 | 1111.1 | 8K |
Open-source | $0.04 | $0.08 | $0.06 | 65 | 1083.3 | 131K | |
Open-source | $0.03 | $0.09 | $0.06 | 65 | 1083.3 | 8K | |
Speed & cost | gryphe | $0.06 | $0.06 | $0.06 | 58 | 966.7 | 4K |
Code generation | Alibaba Cloud | $0.03 | $0.09 | $0.06 | 50 | 833.3 | 33K |
Speed & cost | ibm-granite | $0.02 | $0.11 | $0.06 | 62 | 961.2 | 131K |
Open-source | Mistral AI | $0.05 | $0.08 | $0.07 | 72 | 1107.7 | 33K |
Open-source | Alibaba Cloud | $0.04 | $0.10 | $0.07 | 50 | 714.3 | 33K |
Speed & cost | liquid | $0.03 | $0.12 | $0.07 | 50 | 666.7 | 33K |
Open-source | ibm-granite | $0.05 | $0.10 | $0.08 | 50 | 666.7 | 131K |
Open-source | Alibaba Cloud | $0.03 | $0.13 | $0.08 | 50 | 615.4 | 131K |
Open-source | $0.04 | $0.13 | $0.09 | 74 | 870.6 | 131K | |
Speed & cost | OpenAI | $0.03 | $0.14 | $0.09 | 50 | 588.2 | 131K |
Open-source | Alibaba Cloud | $0.07 | $0.10 | $0.09 | 82 | 959.1 | 262K |
Speed & cost | Amazon | $0.04 | $0.14 | $0.09 | 62 | 708.6 | 128K |
Open-source | $0.06 | $0.12 | $0.09 | 50 | 555.6 | 33K | |
Open-source | Cohere | $0.04 | $0.15 | $0.09 | 50 | 533.3 | 128K |
Open-source | Alibaba Cloud | $0.04 | $0.15 | $0.10 | 82 | 863.2 | 256K |
Speed & cost | arcee | $0.04 | $0.15 | $0.10 | 50 | 512.8 | 131K |
Speed & cost | nvidia | $0.04 | $0.16 | $0.10 | 62 | 620.0 | 131K |
Speed & cost | rekaai | $0.10 | $0.10 | $0.10 | 58 | 580.0 | 16K |
Speed & cost | Mistral AI | $0.10 | $0.10 | $0.10 | 58 | 580.0 | 131K |
Z.ai: GLM 4 32B OSS Open-source | z-ai | $0.10 | $0.10 | $0.10 | 58 | 580.0 | 128K |
Speed & cost | microsoft | $0.07 | $0.14 | $0.10 | 65 | 634.1 | 16K |
Speed & cost | OpenAI | $0.04 | $0.18 | $0.11 | 50 | 456.6 | 131K |
Open-source | Meta | $0.03 | $0.20 | $0.11 | 50 | 438.6 | 60K |
Open-source | $0.08 | $0.16 | $0.12 | 74 | 616.7 | 131K | |
Speed & cost | nvidia | $0.05 | $0.20 | $0.13 | 62 | 496.0 | 262K |
Open-source | allenai | $0.05 | $0.20 | $0.13 | 50 | 400.0 | 128K |
Open-source | Mistral AI | $0.07 | $0.20 | $0.14 | 72 | 523.6 | 128K |
Search + citations | nousresearch | $0.14 | $0.14 | $0.14 | 65 | 464.3 | 8K |
Speed & cost | essentialai | $0.15 | $0.15 | $0.15 | 58 | 386.7 | 33K |
Speed & cost | Mistral AI | $0.15 | $0.15 | $0.15 | 58 | 386.7 | 262K |
Speed & cost | Amazon | $0.06 | $0.24 | $0.15 | 58 | 386.7 | 300K |
Open-source | Mistral AI | $0.11 | $0.19 | $0.15 | 58 | 386.7 | 3K |
Cheap-and-fast cascade tier | DeepSeek | $0.10 | $0.20 | $0.15 | 80 | 533.3 | 1M |
Speed & cost | bytedance | $0.10 | $0.20 | $0.15 | 58 | 386.7 | 128K |
Speed & cost | rekaai | $0.10 | $0.20 | $0.15 | 58 | 386.7 | 66K |
Speed & cost | Alibaba Cloud | $0.07 | $0.26 | $0.16 | 82 | 504.6 | 1M |
Speed & cost | tencent | $0.07 | $0.26 | $0.16 | 58 | 355.8 | 262K |
Open-source | Alibaba Cloud | $0.10 | $0.24 | $0.17 | 82 | 482.4 | 41K |
Code generation | Alibaba Cloud | $0.07 | $0.27 | $0.17 | 82 | 482.4 | 160K |
Hard reasoning | baidu | $0.07 | $0.28 | $0.18 | 58 | 331.4 | 131K |
Speed & cost | baidu | $0.07 | $0.28 | $0.18 | 58 | 331.4 | 120K |
Speed & cost | arcee | $0.18 | $0.18 | $0.18 | 58 | 322.2 | 131K |
Open-source | Meta | $0.18 | $0.18 | $0.18 | 58 | 322.2 | 164K |
Open-source | Alibaba Cloud | $0.08 | $0.28 | $0.18 | 82 | 455.6 | 41K |
Speed & cost | $0.07 | $0.30 | $0.19 | 73 | 389.3 | 1M | |
Speed & cost | bytedance | $0.07 | $0.30 | $0.19 | 58 | 309.3 | 262K |
Speed & cost | OpenAI | $0.07 | $0.30 | $0.19 | 58 | 309.3 | 131K |
Open-source | Meta | $0.05 | $0.34 | $0.19 | 58 | 300.6 | 80K |
Open-source | Alibaba Cloud | $0.09 | $0.30 | $0.20 | 82 | 420.5 | 262K |
Open-source | $0.06 | $0.33 | $0.20 | 76 | 389.7 | 262K | |
Speed & cost | stepfun | $0.09 | $0.30 | $0.20 | 58 | 297.4 | 262K |
Open-source | Mistral AI | $0.10 | $0.30 | $0.20 | 72 | 360.0 | 33K |
Speed & cost | xiaomi | $0.10 | $0.30 | $0.20 | 58 | 290.0 | 262K |
Speed & cost | Mistral AI | $0.20 | $0.20 | $0.20 | 58 | 290.0 | 262K |
Open-source | Mistral AI | $0.10 | $0.30 | $0.20 | 58 | 290.0 | 32K |
Open-source | Mistral AI | $0.10 | $0.30 | $0.20 | 58 | 290.0 | 131K |
Open-source | Meta | $0.10 | $0.32 | $0.21 | 74 | 352.4 | 131K |
Speed & cost | microsoft | $0.08 | $0.35 | $0.21 | 58 | 269.8 | 131K |
Open-source | Alibaba Cloud | $0.05 | $0.40 | $0.23 | 82 | 364.4 | 41K |
Speed & cost | OpenAI | $0.05 | $0.40 | $0.23 | 72 | 320.0 | 400K |
Speed & cost | z-ai | $0.06 | $0.40 | $0.23 | 58 | 252.2 | 203K |
Hard reasoning | Alibaba Cloud | $0.08 | $0.40 | $0.24 | 82 | 341.7 | 131K |
Open-source | $0.12 | $0.37 | $0.24 | 76 | 310.2 | 262K | |
Open-source | Meta | $0.24 | $0.24 | $0.24 | 50 | 204.1 | 131K |
Speed & cost | $0.10 | $0.40 | $0.25 | 80 | 320.0 | 1M | |
Speed & cost | $0.10 | $0.40 | $0.25 | 80 | 320.0 | 1M | |
Open-source | nvidia | $0.10 | $0.40 | $0.25 | 74 | 296.0 | 131K |
Fastest + cheapest | $0.10 | $0.40 | $0.25 | 74 | 296.0 | 1M | |
Speed & cost | OpenAI | $0.10 | $0.40 | $0.25 | 72 | 288.0 | 1M |
Speed & cost | bytedance | $0.10 | $0.40 | $0.25 | 58 | 232.0 | 262K |
Open-source | Meta | $0.48 | $0.03 | $0.26 | 50 | 194.6 | 131K |
Open-source | Alibaba Cloud | $0.10 | $0.42 | $0.26 | 82 | 315.4 | 131K |
Search + citations | nousresearch | $0.13 | $0.40 | $0.27 | 58 | 218.9 | 131K |
Open-source | Alibaba Cloud | $0.09 | $0.45 | $0.27 | 82 | 303.7 | 41K |
Speed & cost | nvidia | $0.09 | $0.45 | $0.27 | 58 | 214.8 | 262K |
Search + citations | Alibaba Cloud | $0.09 | $0.45 | $0.27 | 58 | 214.8 | 131K |
Open-source | Alibaba Cloud | $0.14 | $0.41 | $0.27 | 58 | 212.5 | 131K |
Longest context | Meta | $0.15 | $0.40 | $0.28 | 71 | 258.2 | 10M |
Hard reasoning | DeepSeek | $0.29 | $0.29 | $0.29 | 91 | 313.8 | 33K |
Open-source | Alibaba Cloud | $0.08 | $0.50 | $0.29 | 82 | 282.8 | 131K |
Search + citations | nousresearch | $0.30 | $0.30 | $0.30 | 74 | 246.7 | 131K |
Open-source coding | Alibaba Cloud | $0.15 | $0.45 | $0.30 | 74 | 246.7 | 131K |
Speed & cost | thedrummer | $0.17 | $0.43 | $0.30 | 58 | 193.3 | 33K |
Open-source | DeepSeek | $0.25 | $0.38 | $0.32 | 87 | 276.2 | 164K |
Open-source | nex-agi | $0.14 | $0.50 | $0.32 | 86 | 270.9 | 131K |
Open-source | Alibaba Cloud | $0.13 | $0.52 | $0.33 | 82 | 252.3 | 131K |
Hard reasoning | allenai | $0.15 | $0.50 | $0.33 | 58 | 178.5 | 66K |
Open-source | DeepSeek | $0.27 | $0.41 | $0.34 | 86 | 252.9 | 164K |
Speed & cost | xAI | $0.20 | $0.50 | $0.35 | 58 | 165.7 | 2M |
Speed & cost | xAI | $0.20 | $0.50 | $0.35 | 58 | 165.7 | 2M |
Open-source | inclusionai | $0.07 | $0.63 | $0.35 | 58 | 165.7 | 262K |
Hard reasoning | inclusionai | $0.07 | $0.63 | $0.35 | 58 | 165.7 | 262K |
Speed & cost | baidu | $0.14 | $0.56 | $0.35 | 58 | 165.7 | 30K |
Speed & cost | tencent | $0.14 | $0.57 | $0.35 | 58 | 163.4 | 131K |
Open-source | DeepSeek | $0.29 | $0.43 | $0.36 | 86 | 239.6 | 164K |
Hard reasoning | Alibaba Cloud | $0.15 | $0.58 | $0.36 | 58 | 158.9 | 131K |
Search + citations | OpenAI | $0.15 | $0.60 | $0.38 | 80 | 213.3 | 128K |
Speed & cost | OpenAI | $0.15 | $0.60 | $0.38 | 80 | 213.3 | 128K |
High throughput | OpenAI | $0.15 | $0.60 | $0.38 | 72 | 192.0 | 128K |
Open-source | Mistral AI | $0.15 | $0.60 | $0.38 | 72 | 192.0 | 262K |
Speed & cost | upstage | $0.15 | $0.60 | $0.38 | 58 | 154.7 | 128K |
Open-source | Cohere | $0.15 | $0.60 | $0.38 | 58 | 154.7 | 128K |
Speed & cost | xAI | $0.30 | $0.50 | $0.40 | 82 | 205.0 | 131K |
Open-source value | Meta | $0.20 | $0.60 | $0.40 | 80 | 200.0 | 1M |
Budget reasoning | xAI | $0.30 | $0.50 | $0.40 | 78 | 195.0 | 131K |
Open-source | Meta | $0.40 | $0.40 | $0.40 | 74 | 185.0 | 131K |
Speed & cost | thedrummer | $0.40 | $0.40 | $0.40 | 66 | 165.0 | 33K |
Speed & cost | nvidia | $0.20 | $0.60 | $0.40 | 62 | 155.0 | 131K |
Open-source | allenai | $0.20 | $0.60 | $0.40 | 58 | 145.0 | 66K |
Speed & cost | thedrummer | $0.30 | $0.50 | $0.40 | 58 | 145.0 | 131K |
Open-source | Alibaba Cloud | $0.20 | $0.60 | $0.40 | 58 | 145.0 | 128K |
Open-source | Mistral AI | $0.20 | $0.60 | $0.40 | 58 | 145.0 | 33K |
Hard reasoning | Alibaba Cloud | $0.10 | $0.78 | $0.44 | 82 | 186.9 | 131K |
Open-source | Mistral AI | $0.35 | $0.56 | $0.45 | 72 | 158.9 | 131K |
Code generation | Alibaba Cloud | $0.11 | $0.80 | $0.46 | 82 | 180.2 | 262K |
Open-source | DeepSeek | $0.20 | $0.77 | $0.48 | 86 | 177.3 | 164K |
Open-source | z-ai | $0.13 | $0.85 | $0.49 | 58 | 118.4 | 131K |
Open-source | DeepSeek | $0.21 | $0.79 | $0.50 | 86 | 172.0 | 33K |
Open-source | Alibaba Cloud | $0.25 | $0.75 | $0.50 | 66 | 132.0 | 33K |
Speed & cost | inception | $0.25 | $0.75 | $0.50 | 58 | 116.0 | 128K |
Speed & cost | meituan | $0.20 | $0.80 | $0.50 | 58 | 116.0 | 131K |
Speed & cost | inception | $0.25 | $0.75 | $0.50 | 58 | 116.0 | 128K |
Code generation | inception | $0.25 | $0.75 | $0.50 | 58 | 116.0 | 128K |
Hard reasoning | Alibaba Cloud | $0.26 | $0.78 | $0.52 | 58 | 111.5 | 1M |
Open-source | Alibaba Cloud | $0.26 | $0.78 | $0.52 | 58 | 111.5 | 1M |
Open-source | Alibaba Cloud | $0.26 | $0.78 | $0.52 | 58 | 111.5 | 1M |
Hard reasoning | arcee | $0.22 | $0.85 | $0.54 | 58 | 108.4 | 262K |
Open-source | Alibaba Cloud | $0.20 | $0.88 | $0.54 | 82 | 151.9 | 262K |
Open-source | Mistral AI | $0.54 | $0.54 | $0.54 | 72 | 133.3 | 33K |
Speed & cost | undi95 | $0.45 | $0.65 | $0.55 | 66 | 120.0 | 6K |
Open-source | Alibaba Cloud | $0.14 | $1.00 | $0.57 | 82 | 144.0 | 262K |
Open-source | Alibaba Cloud | $0.15 | $1.00 | $0.57 | 80 | 139.1 | 262K |
Code generation | Alibaba Cloud | $0.20 | $0.97 | $0.58 | 82 | 140.2 | 1M |
Open-source | Alibaba Cloud | $0.09 | $1.10 | $0.60 | 82 | 137.8 | 262K |
Open-source flagship | Alibaba Cloud | $0.30 | $0.90 | $0.60 | 80 | 133.3 | 131K |
Code generation | Mistral AI | $0.30 | $0.90 | $0.60 | 78 | 130.0 | 256K |
Open-source | z-ai | $0.30 | $0.90 | $0.60 | 58 | 96.7 | 131K |
Open-source | DeepSeek | $0.27 | $0.95 | $0.61 | 86 | 141.0 | 164K |
Speed & cost | microsoft | $0.62 | $0.62 | $0.62 | 62 | 100.0 | 66K |
Speed & cost | minimax | $0.29 | $0.95 | $0.62 | 58 | 93.5 | 197K |
Open-source | Meta | $0.51 | $0.74 | $0.63 | 66 | 105.6 | 8K |
Speed & cost | minimax | $0.26 | $1.00 | $0.63 | 58 | 92.4 | 197K |
Speed & cost | minimax | $0.15 | $1.15 | $0.65 | 58 | 89.2 | 197K |
Code generation | arcee | $0.50 | $0.80 | $0.65 | 66 | 101.5 | 33K |
Open-source | $0.65 | $0.65 | $0.65 | 65 | 100.0 | 8K | |
Speed & cost | prime-intellect | $0.20 | $1.10 | $0.65 | 58 | 89.2 | 131K |
Speed & cost | minimax | $0.20 | $1.10 | $0.65 | 58 | 89.2 | 1M |
Speed & cost | Alibaba Cloud | $0.19 | $1.13 | $0.66 | 80 | 121.9 | 1M |
Speed & cost | thedrummer | $0.55 | $0.80 | $0.68 | 66 | 97.8 | 33K |
Best open-source value | DeepSeek | $0.27 | $1.10 | $0.69 | 86 | 125.5 | 128K |
Speed & cost | baidu | $0.28 | $1.10 | $0.69 | 58 | 84.1 | 123K |
Hard reasoning | sao10k | $0.65 | $0.75 | $0.70 | 66 | 94.3 | 131K |
Hard reasoning | tngtech | $0.30 | $1.10 | $0.70 | 91 | 130.0 | 164K |
Speed & cost | OpenAI | $0.20 | $1.25 | $0.72 | 72 | 99.3 | 400K |
Speed & cost | minimax | $0.28 | $1.20 | $0.74 | 58 | 78.4 | 205K |
Hard reasoning | Alibaba Cloud | $0.12 | $1.36 | $0.74 | 82 | 110.7 | 131K |
Hard reasoning | DeepSeek | $0.70 | $0.80 | $0.75 | 91 | 121.3 | 131K |
Speed & cost | Anthropic | $0.25 | $1.25 | $0.75 | 72 | 96.0 | 200K |
Code generation | kwaipilot | $0.30 | $1.20 | $0.75 | 58 | 77.3 | 256K |
Speed & cost | minimax | $0.30 | $1.20 | $0.75 | 58 | 77.3 | 66K |
Hard reasoning | Alibaba Cloud | $0.15 | $1.50 | $0.82 | 82 | 99.7 | 131K |
Speed & cost | perceptron | $0.15 | $1.50 | $0.82 | 58 | 70.3 | 33K |
Speed & cost | baidu | $0.42 | $1.25 | $0.83 | 66 | 79.0 | 123K |
Hard reasoning | Alibaba Cloud | $0.13 | $1.56 | $0.84 | 82 | 97.0 | 131K |
Hard reasoning | sao10k | $0.85 | $0.85 | $0.85 | 66 | 77.6 | 131K |
Speed & cost | xAI | $0.20 | $1.50 | $0.85 | 58 | 68.2 | 256K |
Speed & cost | mancer | $0.75 | $1.00 | $0.88 | 66 | 75.4 | 8K |
Speed & cost | $0.25 | $1.50 | $0.88 | 62 | 70.9 | 1M | |
Speed & cost | $0.25 | $1.50 | $0.88 | 58 | 66.3 | 1M | |
Open-source | Alibaba Cloud | $0.20 | $1.56 | $0.88 | 82 | 93.4 | 262K |
Open-source | Alibaba Cloud | $0.26 | $1.56 | $0.91 | 82 | 90.1 | 1M |
Speed & cost | arcee | $0.75 | $1.20 | $0.97 | 66 | 67.7 | 131K |
Open-source | Mistral AI | $0.50 | $1.50 | $1.00 | 85 | 85.0 | 262K |
Speed & cost | OpenAI | $0.40 | $1.60 | $1.00 | 80 | 80.0 | 1M |
Speed & cost | morph | $0.80 | $1.20 | $1.00 | 66 | 66.0 | 82K |
Speed & cost | eleutherai | $0.80 | $1.20 | $1.00 | 66 | 66.0 | 4K |
Open-source | alfredpros | $0.80 | $1.20 | $1.00 | 66 | 66.0 | 4K |
Search + citations | Perplexity | $1.00 | $1.00 | $1.00 | 66 | 66.0 | 127K |
Search + citations | nousresearch | $1.00 | $1.00 | $1.00 | 66 | 66.0 | 131K |
Speed & cost | OpenAI | $0.50 | $1.50 | $1.00 | 66 | 66.0 | 16K |
Code generation | Alibaba Cloud | $0.22 | $1.80 | $1.01 | 82 | 81.2 | 262K |
Speed & cost | aion | $0.70 | $1.40 | $1.05 | 66 | 62.9 | 131K |
Open-source | Alibaba Cloud | $0.30 | $1.80 | $1.05 | 80 | 76.2 | 1M |
Speed & cost | relace | $0.85 | $1.25 | $1.05 | 66 | 62.9 | 256K |
Open-source | z-ai | $0.40 | $1.75 | $1.07 | 66 | 61.4 | 203K |
Open-source | z-ai | $0.43 | $1.74 | $1.08 | 66 | 60.8 | 205K |
Speed & cost | OpenAI | $0.25 | $2.00 | $1.13 | 83 | 73.8 | 400K |
Speed & cost | bytedance | $0.25 | $2.00 | $1.13 | 58 | 51.6 | 262K |
Speed & cost | bytedance | $0.25 | $2.00 | $1.13 | 58 | 51.6 | 262K |
Code generation | OpenAI | $0.25 | $2.00 | $1.13 | 58 | 51.6 | 400K |
Open-source | Alibaba Cloud | $0.46 | $1.82 | $1.14 | 82 | 72.1 | 131K |
Speed & cost | Moonshot AI | $0.40 | $1.90 | $1.15 | 89 | 77.4 | 262K |
Open-source | Alibaba Cloud | $0.26 | $2.08 | $1.17 | 82 | 70.1 | 262K |
Open-source | nvidia | $1.20 | $1.20 | $1.20 | 74 | 61.7 | 131K |
Speed & cost | xiaomi | $0.40 | $2.00 | $1.20 | 66 | 55.0 | 262K |
Open-source | Mistral AI | $0.40 | $2.00 | $1.20 | 66 | 55.0 | 262K |
Open-source | Mistral AI | $0.40 | $2.00 | $1.20 | 66 | 55.0 | 131K |
Open-source | z-ai | $0.60 | $1.80 | $1.20 | 66 | 55.0 | 66K |
Open-source | Mistral AI | $0.40 | $2.00 | $1.20 | 66 | 55.0 | 131K |
Open-source | Mistral AI | $0.40 | $2.00 | $1.20 | 66 | 55.0 | 131K |
Open-source | nvidia | $0.60 | $1.80 | $1.20 | 66 | 55.0 | 131K |
Open-source | xiaomi | $0.40 | $2.00 | $1.20 | 66 | 55.0 | 1M |
Speed & cost | aion | $0.80 | $1.60 | $1.20 | 66 | 55.0 | 131K |
Open-source | aion | $0.80 | $1.60 | $1.20 | 65 | 54.2 | 33K |
General purpose | deepcogito | $1.25 | $1.25 | $1.25 | 74 | 59.2 | 128K |
Z.ai: GLM 5OSS Open-source | z-ai | $0.60 | $1.92 | $1.26 | 88 | 69.8 | 80K |
Speed & cost | minimax | $0.40 | $2.20 | $1.30 | 66 | 50.8 | 1M |
Open-source | Alibaba Cloud | $0.52 | $2.08 | $1.30 | 66 | 50.8 | 131K |
Hard reasoning | DeepSeek | $0.50 | $2.15 | $1.32 | 91 | 68.7 | 164K |
Open-source | Alibaba Cloud | $0.39 | $2.34 | $1.36 | 82 | 60.1 | 262K |
Image generation | $0.30 | $2.50 | $1.40 | 80 | 57.1 | 33K | |
Speed & cost | $0.30 | $2.50 | $1.40 | 80 | 57.1 | 1M | |
Speed & cost | morph | $0.90 | $1.90 | $1.40 | 66 | 47.1 | 262K |
Speed & cost | Amazon | $0.30 | $2.50 | $1.40 | 58 | 41.4 | 1M |
Open-source | z-ai | $0.60 | $2.20 | $1.40 | 66 | 47.1 | 131K |
Hard reasoning | Alibaba Cloud | $0.26 | $2.60 | $1.43 | 82 | 57.3 | 131K |
Speed & cost | Moonshot AI | $0.57 | $2.30 | $1.43 | 66 | 46.0 | 131K |
Hard reasoning | sao10k | $1.48 | $1.48 | $1.48 | 74 | 50.0 | 8K |
Open-weight agentic coding | minimax | $0.60 | $2.40 | $1.50 | 89 | 59.3 | 1M |
Speed & cost | OpenAI | $0.60 | $2.40 | $1.50 | 66 | 44.0 | 128K |
Speed & cost | OpenAI | $1.00 | $2.00 | $1.50 | 66 | 44.0 | 4K |
Code generation | xAI | $1.00 | $2.00 | $1.50 | 66 | 44.0 | 256K |
Hard reasoning | Moonshot AI | $0.60 | $2.50 | $1.55 | 66 | 42.6 | 131K |
Speed & cost | Moonshot AI | $0.60 | $2.50 | $1.55 | 66 | 42.6 | 131K |
DeepSeek: R1OSS Hard reasoning | DeepSeek | $0.70 | $2.50 | $1.60 | 91 | 56.9 | 64K |
Speed & cost | baidu | $0.68 | $2.81 | $1.75 | 66 | 37.8 | 66K |
Speed & cost | $0.50 | $3.00 | $1.75 | 80 | 45.7 | 1M | |
Open-source | Alibaba Cloud | $0.30 | $3.20 | $1.75 | 80 | 45.7 | 262K |
General purpose | OpenAI | $1.50 | $2.00 | $1.75 | 74 | 42.3 | 4K |
Image generation | $0.50 | $3.00 | $1.75 | 66 | 37.7 | 66K | |
Agentic tasks & real-time info | xAI | $1.25 | $2.50 | $1.88 | 93 | 49.6 | 1M |
General purpose | xAI | $1.25 | $2.50 | $1.88 | 93 | 49.6 | 2M |
Code generation | Alibaba Cloud | $0.65 | $3.25 | $1.95 | 82 | 42.1 | 1M |
Speed & cost | xiaomi | $1.00 | $3.00 | $2.00 | 66 | 33.0 | 1M |
Search + citations | relace | $1.00 | $3.00 | $2.00 | 66 | 33.0 | 256K |
Search + citations | nousresearch | $1.00 | $3.00 | $2.00 | 66 | 33.0 | 131K |
Speed & cost | Amazon | $0.80 | $3.20 | $2.00 | 66 | 33.0 | 300K |
Open-source | xiaomi | $1.00 | $3.00 | $2.00 | 66 | 33.0 | 1M |
Open-weight agentic & tool use | z-ai | $0.98 | $3.08 | $2.03 | 88 | 43.3 | 200K |
Speed & cost | arcee | $0.90 | $3.30 | $2.10 | 66 | 31.4 | 131K |
Frontier quality at low cost | Moonshot AI | $0.73 | $3.49 | $2.11 | 92 | 43.6 | 256K |
Speed & cost | switchpoint | $0.85 | $3.40 | $2.13 | 66 | 31.1 | 131K |
Image generation | OpenAI | $2.50 | $2.00 | $2.25 | 74 | 32.9 | 400K |
Hard reasoning | Alibaba Cloud | $0.78 | $3.90 | $2.34 | 82 | 35.0 | 262K |
Open-source | Alibaba Cloud | $0.78 | $3.90 | $2.34 | 82 | 35.0 | 262K |
Speed & cost | Anthropic | $0.80 | $4.00 | $2.40 | 75 | 31.3 | 200K |
Open-source | z-ai | $1.20 | $4.00 | $2.60 | 74 | 28.5 | 203K |
Open-source | z-ai | $1.20 | $4.00 | $2.60 | 74 | 28.5 | 203K |
Qwen: Qwen-Max OSS Open-source | Alibaba Cloud | $1.04 | $4.16 | $2.60 | 74 | 28.5 | 33K |
Open-source value leader | DeepSeek | $1.74 | $3.48 | $2.61 | 90 | 34.5 | 1M |
Speed & cost | OpenAI | $0.75 | $4.50 | $2.63 | 83 | 31.6 | 400K |
Reasoning & math | OpenAI | $1.10 | $4.40 | $2.75 | 88 | 32.0 | 200K |
Hard reasoning | OpenAI | $1.10 | $4.40 | $2.75 | 82 | 29.8 | 200K |
Hard reasoning | OpenAI | $1.10 | $4.40 | $2.75 | 82 | 29.8 | 200K |
Hard reasoning | OpenAI | $1.10 | $4.40 | $2.75 | 82 | 29.8 | 200K |
Speed & cost | Anthropic | $1.00 | $5.00 | $3.00 | 76 | 25.3 | 200K |
Hard reasoning | sao10k | $3.00 | $3.00 | $3.00 | 74 | 24.7 | 16K |
Speed & cost | writer | $0.60 | $6.00 | $3.30 | 66 | 20.0 | 1M |
Multilingual & APAC | Alibaba Cloud | $1.40 | $5.60 | $3.50 | 86 | 24.6 | 256K |
General purpose | OpenAI | $3.00 | $4.00 | $3.50 | 74 | 21.1 | 16K |
Open-source | Alibaba Cloud | $1.04 | $6.24 | $3.64 | 90 | 24.7 | 262K |
Open-source | Mistral AI | $2.00 | $6.00 | $4.00 | 85 | 21.3 | 131K |
Open-source | Mistral AI | $2.00 | $6.00 | $4.00 | 85 | 21.3 | 128K |
Multilingual | Mistral AI | $2.00 | $6.00 | $4.00 | 79 | 19.8 | 128K |
General purpose | xAI | $2.00 | $6.00 | $4.00 | 74 | 18.5 | 2M |
Open-source | Mistral AI | $2.00 | $6.00 | $4.00 | 74 | 18.5 | 131K |
General purpose | anthracite-org | $3.00 | $5.00 | $4.00 | 74 | 18.5 | 16K |
Open-source | Mistral AI | $2.00 | $6.00 | $4.00 | 72 | 18.0 | 66K |
Open-source | Mistral AI | $1.50 | $7.50 | $4.50 | 82 | 18.2 | 262K |
Deep research | OpenAI | $2.00 | $8.00 | $5.00 | 96 | 19.2 | 200K |
Long autonomous agentic runs | Alibaba Cloud | $2.50 | $7.50 | $5.00 | 94 | 18.8 | 1M |
Long context | OpenAI | $2.00 | $8.00 | $5.00 | 89 | 17.8 | 1M |
General purpose | ai21 | $2.00 | $8.00 | $5.00 | 74 | 14.8 | 256K |
Search + citations | Perplexity | $2.00 | $8.00 | $5.00 | 74 | 14.8 | 128K |
Deep research | Perplexity | $2.00 | $8.00 | $5.00 | 74 | 14.8 | 128K |
Speed & cost | $1.50 | $9.00 | $5.25 | 84 | 16.0 | 1M | |
Code generation | OpenAI | $1.25 | $10.00 | $5.63 | 93 | 16.5 | 400K |
General purpose | OpenAI | $1.25 | $10.00 | $5.63 | 93 | 16.5 | 400K |
General purpose | OpenAI | $1.25 | $10.00 | $5.63 | 93 | 16.5 | 128K |
Code generation | OpenAI | $1.25 | $10.00 | $5.63 | 93 | 16.5 | 400K |
Multimodal + value | $1.25 | $10.00 | $5.63 | 92 | 16.4 | 1M | |
Speed & cost | $1.25 | $10.00 | $5.63 | 91 | 16.2 | 1M | |
Speed & cost | $1.25 | $10.00 | $5.63 | 91 | 16.2 | 1M | |
General purpose | OpenAI | $1.25 | $10.00 | $5.63 | 90 | 16.0 | 400K |
General purpose | alpindale | $3.75 | $7.50 | $5.63 | 82 | 14.6 | 6K |
Code generation | OpenAI | $1.25 | $10.00 | $5.63 | 74 | 13.2 | 400K |
General purpose | OpenAI | $1.25 | $10.00 | $5.63 | 74 | 13.2 | 128K |
General purpose | aion | $4.00 | $8.00 | $6.00 | 82 | 13.7 | 131K |
General purpose | OpenAI | $2.50 | $10.00 | $6.25 | 88 | 14.1 | 128K |
Search + citations | OpenAI | $2.50 | $10.00 | $6.25 | 88 | 14.1 | 128K |
General purpose | OpenAI | $2.50 | $10.00 | $6.25 | 88 | 14.1 | 128K |
General purpose | OpenAI | $2.50 | $10.00 | $6.25 | 88 | 14.1 | 128K |
General purpose | OpenAI | $2.50 | $10.00 | $6.25 | 85 | 13.6 | 128K |
General purpose | OpenAI | $2.50 | $10.00 | $6.25 | 74 | 11.8 | 128K |
General purpose | Cohere | $2.50 | $10.00 | $6.25 | 74 | 11.8 | 256K |
General purpose | inflection | $2.50 | $10.00 | $6.25 | 74 | 11.8 | 8K |
General purpose | inflection | $2.50 | $10.00 | $6.25 | 74 | 11.8 | 8K |
Enterprise RAG | Cohere | $2.50 | $10.00 | $6.25 | 68 | 10.9 | 128K |
Speed & cost | $2.00 | $12.00 | $7.00 | 96 | 13.7 | 1M | |
Science & long-context | $2.00 | $12.00 | $7.00 | 96 | 13.7 | 1M | |
Image generation | $2.00 | $12.00 | $7.00 | 94 | 13.4 | 66K | |
General purpose | Amazon | $2.50 | $12.50 | $7.50 | 74 | 9.9 | 1M |
General purpose | OpenAI | $1.75 | $14.00 | $7.88 | 93 | 11.8 | 128K |
Code generation | OpenAI | $1.75 | $14.00 | $7.88 | 93 | 11.8 | 400K |
Code generation | OpenAI | $1.75 | $14.00 | $7.88 | 93 | 11.8 | 400K |
General purpose | OpenAI | $1.75 | $14.00 | $7.88 | 93 | 11.8 | 128K |
General purpose | OpenAI | $1.75 | $14.00 | $7.88 | 93 | 11.8 | 400K |
General purpose | OpenAI | $2.50 | $15.00 | $8.75 | 93 | 10.6 | 1M |
Coding & balance | Anthropic | $3.00 | $15.00 | $9.00 | 90 | 10.0 | 1M |
General purpose | xAI | $3.00 | $15.00 | $9.00 | 90 | 10.0 | 131K |
General purpose | Anthropic | $3.00 | $15.00 | $9.00 | 88 | 9.8 | 1M |
Coding & balance | Anthropic | $3.00 | $15.00 | $9.00 | 88 | 9.8 | 200K |
Real-time info | xAI | $3.00 | $15.00 | $9.00 | 87 | 9.7 | 131K |
General purpose | Anthropic | $3.00 | $15.00 | $9.00 | 86 | 9.6 | 200K |
Hard reasoning | Anthropic | $3.00 | $15.00 | $9.00 | 86 | 9.6 | 200K |
Search + citations | Perplexity | $3.00 | $15.00 | $9.00 | 78 | 8.7 | 200K |
Search + citations | Perplexity | $3.00 | $15.00 | $9.00 | 74 | 8.2 | 200K |
General purpose | xAI | $3.00 | $15.00 | $9.00 | 74 | 8.2 | 256K |
Multimodal | OpenAI | $10.00 | $10.00 | $10.00 | 88 | 8.8 | 400K |
General purpose | OpenAI | $5.00 | $15.00 | $10.00 | 88 | 8.8 | 128K |
Complex analysis | OpenAI | $8.00 | $15.00 | $11.50 | 93 | 8.1 | 272K |
Multimodal | OpenAI | $6.00 | $18.00 | $12.00 | 88 | 7.3 | 128K |
Coding, agents & computer use | Anthropic | $5.00 | $25.00 | $15.00 | 99 | 6.6 | 1M |
Coding & agentic workflows | Anthropic | $5.00 | $25.00 | $15.00 | 96 | 6.4 | 1M |
General purpose | Anthropic | $5.00 | $25.00 | $15.00 | 95 | 6.3 | 1M |
General purpose | Anthropic | $5.00 | $25.00 | $15.00 | 95 | 6.3 | 200K |
Frontier general purpose | OpenAI | $5.00 | $30.00 | $17.50 | 97 | 5.5 | 1M |
Multimodal | OpenAI | $10.00 | $30.00 | $20.00 | 88 | 4.4 | 128K |
Complex analysis | OpenAI | $10.00 | $30.00 | $20.00 | 88 | 4.4 | 128K |
Multimodal | OpenAI | $10.00 | $30.00 | $20.00 | 88 | 4.4 | 128K |
Deep research | OpenAI | $10.00 | $40.00 | $25.00 | 96 | 3.8 | 200K |
Hard reasoning | OpenAI | $10.00 | $40.00 | $25.00 | 94 | 3.8 | 200K |
Frontier agentic coding & knowledge work | Anthropic | $10.00 | $50.00 | $30.00 | 100 | 3.3 | 1M |
Hard reasoning | OpenAI | $15.00 | $60.00 | $37.50 | 88 | 2.3 | 200K |
Multimodal | Anthropic | $15.00 | $75.00 | $45.00 | 94 | 2.1 | 200K |
Complex analysis | OpenAI | $30.00 | $60.00 | $45.00 | 93 | 2.1 | 8K |
Multimodal | OpenAI | $30.00 | $60.00 | $45.00 | 93 | 2.1 | 8K |
Complex analysis | Anthropic | $15.00 | $75.00 | $45.00 | 91 | 2.0 | 200K |
Hard reasoning | OpenAI | $20.00 | $80.00 | $50.00 | 96 | 1.9 | 200K |
Complex analysis | OpenAI | $15.00 | $120.00 | $67.50 | 88 | 1.3 | 400K |
Complex analysis | Anthropic | $30.00 | $150.00 | $90.00 | 97 | 1.1 | 1M |
Complex analysis | Anthropic | $30.00 | $150.00 | $90.00 | 95 | 1.1 | 1M |
Complex analysis | OpenAI | $21.00 | $168.00 | $94.50 | 97 | 1.0 | 400K |
Reasoning at any cost | OpenAI | $30.00 | $180.00 | $105.00 | 98 | 0.9 | 1M |
Complex analysis | OpenAI | $30.00 | $180.00 | $105.00 | 97 | 0.9 | 1M |
Hard reasoning | OpenAI | $150.00 | $600.00 | $375.00 | 93 | 0.2 | 200K |
Estimate Your Monthly Cost
Monthly cost estimate
Enter your typical request shape. Costs below are projected over one month, based on current public list-price API rates.
Cheapest
inclusionAI: Ling-2.6-flash
$1.40
per month at this volume
Best value (quality ≥ 80)
Qwen: Qwen3.5-9B · Q 82
$6.50
per month at this volume
Most expensive
OpenAI: o1-pro
$25,500
per month at this volume
Save 30-60% with Mixture-of-Routers
Most production traffic is mixed-difficulty. Send the easy 60% to a cheap model and the hard 10% to a frontier model — same quality, fraction of the cost.
Full breakdown by model
Sorted cheapest to most expensive
| Model | Cost / request | Input cost / mo | Output cost / mo | Total / mo |
|---|---|---|---|---|
inclusionAI: Ling-2.6-flash $0.01 in / $0.03 out per 1M | $0.000014 | $0.5000 | $0.9000 | $1.40 |
Mistral: Mistral Nemo $0.02 in / $0.03 out per 1M | $0.000019 | $1.00 | $0.9000 | $1.90 |
Meta: Llama 3.1 8B Instruct $0.02 in / $0.05 out per 1M | $0.000025 | $1.00 | $1.50 | $2.50 |
Meta: Llama 3 8B Instruct $0.04 in / $0.04 out per 1M | $0.000032 | $2.00 | $1.20 | $3.20 |
Sao10K: Llama 3 8B Lunaris $0.04 in / $0.05 out per 1M | $0.000035 | $2.00 | $1.50 | $3.50 |
Google: Gemma 2 9B $0.03 in / $0.09 out per 1M | $0.000042 | $1.50 | $2.70 | $4.20 |
Qwen: Qwen2.5 Coder 7B Instruct $0.03 in / $0.09 out per 1M | $0.000042 | $1.50 | $2.70 | $4.20 |
IBM: Granite 4.0 Micro $0.017 in / $0.112 out per 1M | $0.000042 | $0.8500 | $3.36 | $4.21 |
Google: Gemma 3 4B $0.04 in / $0.08 out per 1M | $0.000044 | $2.00 | $2.40 | $4.40 |
MythoMax 13B $0.06 in / $0.06 out per 1M | $0.000048 | $3.00 | $1.80 | $4.80 |
Mistral: Mistral Small 3 $0.05 in / $0.08 out per 1M | $0.000049 | $2.50 | $2.40 | $4.90 |
Qwen: Qwen2.5 7B Instruct $0.04 in / $0.1 out per 1M | $0.000050 | $2.00 | $3.00 | $5.00 |
LiquidAI: LFM2-24B-A2B $0.03 in / $0.12 out per 1M | $0.000051 | $1.50 | $3.60 | $5.10 |
IBM: Granite 4.1 8B $0.05 in / $0.1 out per 1M | $0.000055 | $2.50 | $3.00 | $5.50 |
Qwen: Qwen-Turbo $0.0325 in / $0.13 out per 1M | $0.000055 | $1.63 | $3.90 | $5.53 |
OpenAI: gpt-oss-20b $0.03 in / $0.14 out per 1M | $0.000057 | $1.50 | $4.20 | $5.70 |
Google: Gemma 3 12B $0.04 in / $0.13 out per 1M | $0.000059 | $2.00 | $3.90 | $5.90 |
Amazon: Nova Micro 1.0 $0.035 in / $0.14 out per 1M | $0.000060 | $1.75 | $4.20 | $5.95 |
Cohere: Command R7B (12-2024) $0.0375 in / $0.15 out per 1M | $0.000064 | $1.88 | $4.50 | $6.38 |
Qwen: Qwen3.5-9B $0.04 in / $0.15 out per 1M | $0.000065 | $2.00 | $4.50 | $6.50 |
Qwen: Qwen3 235B A22B Instruct 2507 $0.071 in / $0.1 out per 1M | $0.000065 | $3.55 | $3.00 | $6.55 |
Google: Gemma 3n 4B $0.06 in / $0.12 out per 1M | $0.000066 | $3.00 | $3.60 | $6.60 |
Arcee AI: Trinity Mini $0.045 in / $0.15 out per 1M | $0.000068 | $2.25 | $4.50 | $6.75 |
NVIDIA: Nemotron Nano 9B V2 $0.04 in / $0.16 out per 1M | $0.000068 | $2.00 | $4.80 | $6.80 |
OpenAI: gpt-oss-120b $0.039 in / $0.18 out per 1M | $0.000073 | $1.95 | $5.40 | $7.35 |
Meta: Llama 3.2 1B Instruct $0.027 in / $0.201 out per 1M | $0.000074 | $1.35 | $6.03 | $7.38 |
Microsoft: Phi 4 $0.065 in / $0.14 out per 1M | $0.000075 | $3.25 | $4.20 | $7.45 |
Reka Edge $0.1 in / $0.1 out per 1M | $0.000080 | $5.00 | $3.00 | $8.00 |
Mistral: Ministral 3 3B 2512 $0.1 in / $0.1 out per 1M | $0.000080 | $5.00 | $3.00 | $8.00 |
Z.ai: GLM 4 32B $0.1 in / $0.1 out per 1M | $0.000080 | $5.00 | $3.00 | $8.00 |
NVIDIA: Nemotron 3 Nano 30B A3B $0.05 in / $0.2 out per 1M | $0.000085 | $2.50 | $6.00 | $8.50 |
AllenAI: Olmo 2 32B Instruct $0.05 in / $0.2 out per 1M | $0.000085 | $2.50 | $6.00 | $8.50 |
Google: Gemma 3 27B $0.08 in / $0.16 out per 1M | $0.000088 | $4.00 | $4.80 | $8.80 |
Mistral: Mistral Small 3.2 24B $0.075 in / $0.2 out per 1M | $0.000097 | $3.75 | $6.00 | $9.75 |
Amazon: Nova Lite 1.0 $0.06 in / $0.24 out per 1M | $0.000102 | $3.00 | $7.20 | $10.20 |
DeepSeek: DeepSeek V4 Flash $0.1 in / $0.2 out per 1M | $0.000110 | $5.00 | $6.00 | $11.00 |
ByteDance: UI-TARS 7B $0.1 in / $0.2 out per 1M | $0.000110 | $5.00 | $6.00 | $11.00 |
Reka Flash 3 $0.1 in / $0.2 out per 1M | $0.000110 | $5.00 | $6.00 | $11.00 |
Qwen: Qwen3.5-Flash $0.065 in / $0.26 out per 1M | $0.000111 | $3.25 | $7.80 | $11.05 |
Tencent: Hy3 preview $0.066 in / $0.26 out per 1M | $0.000111 | $3.30 | $7.80 | $11.10 |
Mistral: Mistral 7B Instruct v0.1 $0.11 in / $0.19 out per 1M | $0.000112 | $5.50 | $5.70 | $11.20 |
NousResearch: Hermes 2 Pro - Llama-3 8B $0.14 in / $0.14 out per 1M | $0.000112 | $7.00 | $4.20 | $11.20 |
Qwen: Qwen3 Coder 30B A3B Instruct $0.07 in / $0.27 out per 1M | $0.000116 | $3.50 | $8.10 | $11.60 |
Baidu: ERNIE 4.5 21B A3B Thinking $0.07 in / $0.28 out per 1M | $0.000119 | $3.50 | $8.40 | $11.90 |
Baidu: ERNIE 4.5 21B A3B $0.07 in / $0.28 out per 1M | $0.000119 | $3.50 | $8.40 | $11.90 |
EssentialAI: Rnj 1 Instruct $0.15 in / $0.15 out per 1M | $0.000120 | $7.50 | $4.50 | $12.00 |
Mistral: Ministral 3 8B 2512 $0.15 in / $0.15 out per 1M | $0.000120 | $7.50 | $4.50 | $12.00 |
Qwen: Qwen3 14B $0.1 in / $0.24 out per 1M | $0.000122 | $5.00 | $7.20 | $12.20 |
Qwen: Qwen3 32B $0.08 in / $0.28 out per 1M | $0.000124 | $4.00 | $8.40 | $12.40 |
Meta: Llama 3.2 3B Instruct $0.0509 in / $0.335 out per 1M | $0.000126 | $2.54 | $10.05 | $12.60 |
Google: Gemini 2.0 Flash Lite $0.075 in / $0.3 out per 1M | $0.000128 | $3.75 | $9.00 | $12.75 |
ByteDance Seed: Seed 1.6 Flash $0.075 in / $0.3 out per 1M | $0.000128 | $3.75 | $9.00 | $12.75 |
OpenAI: gpt-oss-safeguard-20b $0.075 in / $0.3 out per 1M | $0.000128 | $3.75 | $9.00 | $12.75 |
Google: Gemma 4 26B A4B $0.06 in / $0.33 out per 1M | $0.000129 | $3.00 | $9.90 | $12.90 |
Qwen: Qwen3 30B A3B Instruct 2507 $0.09 in / $0.3 out per 1M | $0.000135 | $4.50 | $9.00 | $13.50 |
StepFun: Step 3.5 Flash $0.09 in / $0.3 out per 1M | $0.000135 | $4.50 | $9.00 | $13.50 |
Mistral: Mistral Small Creative $0.1 in / $0.3 out per 1M | $0.000140 | $5.00 | $9.00 | $14.00 |
Xiaomi: MiMo-V2-Flash $0.1 in / $0.3 out per 1M | $0.000140 | $5.00 | $9.00 | $14.00 |
Mistral: Voxtral Small 24B 2507 $0.1 in / $0.3 out per 1M | $0.000140 | $5.00 | $9.00 | $14.00 |
Mistral: Devstral Small 1.1 $0.1 in / $0.3 out per 1M | $0.000140 | $5.00 | $9.00 | $14.00 |
Arcee AI: Spotlight $0.18 in / $0.18 out per 1M | $0.000144 | $9.00 | $5.40 | $14.40 |
Meta: Llama Guard 4 12B $0.18 in / $0.18 out per 1M | $0.000144 | $9.00 | $5.40 | $14.40 |
Qwen: Qwen3 8B $0.05 in / $0.4 out per 1M | $0.000145 | $2.50 | $12.00 | $14.50 |
OpenAI: GPT-5 Nano $0.05 in / $0.4 out per 1M | $0.000145 | $2.50 | $12.00 | $14.50 |
Microsoft: Phi 4 Mini Instruct $0.08 in / $0.35 out per 1M | $0.000145 | $4.00 | $10.50 | $14.50 |
Meta: Llama 3.3 70B Instruct $0.1 in / $0.32 out per 1M | $0.000146 | $5.00 | $9.60 | $14.60 |
Z.ai: GLM 4.7 Flash $0.06 in / $0.4 out per 1M | $0.000150 | $3.00 | $12.00 | $15.00 |
Qwen: Qwen3 30B A3B Thinking 2507 $0.08 in / $0.4 out per 1M | $0.000160 | $4.00 | $12.00 | $16.00 |
Mistral: Ministral 3 14B 2512 $0.2 in / $0.2 out per 1M | $0.000160 | $10.00 | $6.00 | $16.00 |
Google: Gemini 2.5 Flash Lite Preview 09-2025 $0.1 in / $0.4 out per 1M | $0.000170 | $5.00 | $12.00 | $17.00 |
Google: Gemini 2.5 Flash Lite $0.1 in / $0.4 out per 1M | $0.000170 | $5.00 | $12.00 | $17.00 |
NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 $0.1 in / $0.4 out per 1M | $0.000170 | $5.00 | $12.00 | $17.00 |
Google: Gemini 2.0 Flash $0.1 in / $0.4 out per 1M | $0.000170 | $5.00 | $12.00 | $17.00 |
OpenAI: GPT-4.1 Nano $0.1 in / $0.4 out per 1M | $0.000170 | $5.00 | $12.00 | $17.00 |
ByteDance Seed: Seed-2.0-Mini $0.1 in / $0.4 out per 1M | $0.000170 | $5.00 | $12.00 | $17.00 |
Google: Gemma 4 31B $0.12 in / $0.37 out per 1M | $0.000171 | $6.00 | $11.10 | $17.10 |
Qwen: Qwen3 VL 32B Instruct $0.104 in / $0.416 out per 1M | $0.000177 | $5.20 | $12.48 | $17.68 |
Qwen: Qwen3 30B A3B $0.09 in / $0.45 out per 1M | $0.000180 | $4.50 | $13.50 | $18.00 |
NVIDIA: Nemotron 3 Super $0.09 in / $0.45 out per 1M | $0.000180 | $4.50 | $13.50 | $18.00 |
Tongyi DeepResearch 30B A3B $0.09 in / $0.45 out per 1M | $0.000180 | $4.50 | $13.50 | $18.00 |
Nous: Hermes 4 70B $0.13 in / $0.4 out per 1M | $0.000185 | $6.50 | $12.00 | $18.50 |
Qwen: Qwen3 VL 8B Instruct $0.08 in / $0.5 out per 1M | $0.000190 | $4.00 | $15.00 | $19.00 |
Qwen: Qwen VL Plus $0.1365 in / $0.4095 out per 1M | $0.000191 | $6.83 | $12.29 | $19.11 |
Meta: Llama 4 Scout $0.15 in / $0.4 out per 1M | $0.000195 | $7.50 | $12.00 | $19.50 |
Meta: Llama 3.2 11B Vision Instruct $0.245 in / $0.245 out per 1M | $0.000196 | $12.25 | $7.35 | $19.60 |
Qwen2.5 Coder 32B Instruct $0.15 in / $0.45 out per 1M | $0.000210 | $7.50 | $13.50 | $21.00 |
TheDrummer: Rocinante 12B $0.17 in / $0.43 out per 1M | $0.000214 | $8.50 | $12.90 | $21.40 |
Nex AGI: DeepSeek V3.1 Nex N1 $0.135 in / $0.5 out per 1M | $0.000218 | $6.75 | $15.00 | $21.75 |
Qwen: Qwen3 VL 30B A3B Instruct $0.13 in / $0.52 out per 1M | $0.000221 | $6.50 | $15.60 | $22.10 |
AllenAI: Olmo 3 32B Think $0.15 in / $0.5 out per 1M | $0.000225 | $7.50 | $15.00 | $22.50 |
inclusionAI: Ling-2.6-1T $0.075 in / $0.625 out per 1M | $0.000225 | $3.75 | $18.75 | $22.50 |
inclusionAI: Ring-2.6-1T $0.075 in / $0.625 out per 1M | $0.000225 | $3.75 | $18.75 | $22.50 |
DeepSeek: R1 Distill Qwen 32B $0.29 in / $0.29 out per 1M | $0.000232 | $14.50 | $8.70 | $23.20 |
Baidu: ERNIE 4.5 VL 28B A3B $0.14 in / $0.56 out per 1M | $0.000238 | $7.00 | $16.80 | $23.80 |
DeepSeek: DeepSeek V3.2 $0.252 in / $0.378 out per 1M | $0.000239 | $12.60 | $11.34 | $23.94 |
Nous: Hermes 3 70B Instruct $0.3 in / $0.3 out per 1M | $0.000240 | $15.00 | $9.00 | $24.00 |
Tencent: Hunyuan A13B Instruct $0.14 in / $0.57 out per 1M | $0.000241 | $7.00 | $17.10 | $24.10 |
Qwen: QwQ 32B $0.15 in / $0.58 out per 1M | $0.000249 | $7.50 | $17.40 | $24.90 |
xAI: Grok 4.1 Fast $0.2 in / $0.5 out per 1M | $0.000250 | $10.00 | $15.00 | $25.00 |
xAI: Grok 4 Fast $0.2 in / $0.5 out per 1M | $0.000250 | $10.00 | $15.00 | $25.00 |
Llama Guard 3 8B $0.484 in / $0.03 out per 1M | $0.000251 | $24.20 | $0.9000 | $25.10 |
OpenAI: GPT-4o-mini Search Preview $0.15 in / $0.6 out per 1M | $0.000255 | $7.50 | $18.00 | $25.50 |
OpenAI: GPT-4o-mini (2024-07-18) $0.15 in / $0.6 out per 1M | $0.000255 | $7.50 | $18.00 | $25.50 |
OpenAI: GPT-4o-mini $0.15 in / $0.6 out per 1M | $0.000255 | $7.50 | $18.00 | $25.50 |
Mistral: Mistral Small 4 $0.15 in / $0.6 out per 1M | $0.000255 | $7.50 | $18.00 | $25.50 |
Upstage: Solar Pro 3 $0.15 in / $0.6 out per 1M | $0.000255 | $7.50 | $18.00 | $25.50 |
Cohere: Command R (08-2024) $0.15 in / $0.6 out per 1M | $0.000255 | $7.50 | $18.00 | $25.50 |
DeepSeek: DeepSeek V3.2 Exp $0.27 in / $0.41 out per 1M | $0.000258 | $13.50 | $12.30 | $25.80 |
DeepSeek: DeepSeek V3.2 Speciale $0.287 in / $0.431 out per 1M | $0.000273 | $14.35 | $12.93 | $27.28 |
Meta: Llama 4 Maverick $0.2 in / $0.6 out per 1M | $0.000280 | $10.00 | $18.00 | $28.00 |
NVIDIA: Nemotron Nano 12B 2 VL $0.2 in / $0.6 out per 1M | $0.000280 | $10.00 | $18.00 | $28.00 |
AllenAI: Olmo 3.1 32B Instruct $0.2 in / $0.6 out per 1M | $0.000280 | $10.00 | $18.00 | $28.00 |
Qwen: Qwen2.5 VL 32B Instruct $0.2 in / $0.6 out per 1M | $0.000280 | $10.00 | $18.00 | $28.00 |
Mistral: Saba $0.2 in / $0.6 out per 1M | $0.000280 | $10.00 | $18.00 | $28.00 |
Qwen: Qwen3 Next 80B A3B Thinking $0.0975 in / $0.78 out per 1M | $0.000283 | $4.88 | $23.40 | $28.28 |
Qwen: Qwen3 Coder Next $0.11 in / $0.8 out per 1M | $0.000295 | $5.50 | $24.00 | $29.50 |
xAI: Grok 3 Mini Beta $0.3 in / $0.5 out per 1M | $0.000300 | $15.00 | $15.00 | $30.00 |
xAI: Grok 3 Mini $0.3 in / $0.5 out per 1M | $0.000300 | $15.00 | $15.00 | $30.00 |
TheDrummer: Cydonia 24B V4.1 $0.3 in / $0.5 out per 1M | $0.000300 | $15.00 | $15.00 | $30.00 |
Meta: Llama 3.1 70B Instruct $0.4 in / $0.4 out per 1M | $0.000320 | $20.00 | $12.00 | $32.00 |
TheDrummer: UnslopNemo 12B $0.4 in / $0.4 out per 1M | $0.000320 | $20.00 | $12.00 | $32.00 |
Z.ai: GLM 4.5 Air $0.13 in / $0.85 out per 1M | $0.000320 | $6.50 | $25.50 | $32.00 |
DeepSeek: DeepSeek V3 0324 $0.2 in / $0.77 out per 1M | $0.000331 | $10.00 | $23.10 | $33.10 |
Meituan: LongCat Flash Chat $0.2 in / $0.8 out per 1M | $0.000340 | $10.00 | $24.00 | $34.00 |
DeepSeek: DeepSeek V3.1 $0.21 in / $0.79 out per 1M | $0.000342 | $10.50 | $23.70 | $34.20 |
Mistral: Mistral Small 3.1 24B $0.351 in / $0.555 out per 1M | $0.000342 | $17.55 | $16.65 | $34.20 |
Qwen: Qwen2.5 VL 72B Instruct $0.25 in / $0.75 out per 1M | $0.000350 | $12.50 | $22.50 | $35.00 |
Inception: Mercury 2 $0.25 in / $0.75 out per 1M | $0.000350 | $12.50 | $22.50 | $35.00 |
Inception: Mercury $0.25 in / $0.75 out per 1M | $0.000350 | $12.50 | $22.50 | $35.00 |
Inception: Mercury Coder $0.25 in / $0.75 out per 1M | $0.000350 | $12.50 | $22.50 | $35.00 |
Qwen: Qwen3 VL 235B A22B Instruct $0.2 in / $0.88 out per 1M | $0.000364 | $10.00 | $26.40 | $36.40 |
Qwen: Qwen Plus 0728 (thinking) $0.26 in / $0.78 out per 1M | $0.000364 | $13.00 | $23.40 | $36.40 |
Qwen: Qwen Plus 0728 $0.26 in / $0.78 out per 1M | $0.000364 | $13.00 | $23.40 | $36.40 |
Qwen: Qwen-Plus $0.26 in / $0.78 out per 1M | $0.000364 | $13.00 | $23.40 | $36.40 |
Arcee AI: Trinity Large Thinking $0.22 in / $0.85 out per 1M | $0.000365 | $11.00 | $25.50 | $36.50 |
Qwen: Qwen3.5-35B-A3B $0.139 in / $1 out per 1M | $0.000370 | $6.95 | $30.00 | $36.95 |
Qwen: Qwen3 Next 80B A3B Instruct $0.09 in / $1.1 out per 1M | $0.000375 | $4.50 | $33.00 | $37.50 |
Qwen: Qwen3.6 35B A3B $0.15 in / $1 out per 1M | $0.000375 | $7.50 | $30.00 | $37.50 |
Qwen: Qwen3 Coder Flash $0.195 in / $0.975 out per 1M | $0.000390 | $9.75 | $29.25 | $39.00 |
DeepSeek: DeepSeek V3.1 Terminus $0.27 in / $0.95 out per 1M | $0.000420 | $13.50 | $28.50 | $42.00 |
Qwen2.5 72B Instruct $0.3 in / $0.9 out per 1M | $0.000420 | $15.00 | $27.00 | $42.00 |
Mistral: Codestral 2508 $0.3 in / $0.9 out per 1M | $0.000420 | $15.00 | $27.00 | $42.00 |
ReMM SLERP 13B $0.45 in / $0.65 out per 1M | $0.000420 | $22.50 | $19.50 | $42.00 |
MiniMax: MiniMax M2.5 $0.15 in / $1.15 out per 1M | $0.000420 | $7.50 | $34.50 | $42.00 |
Z.ai: GLM 4.6V $0.3 in / $0.9 out per 1M | $0.000420 | $15.00 | $27.00 | $42.00 |
MiniMax: MiniMax M2 $0.255 in / $1 out per 1M | $0.000427 | $12.75 | $30.00 | $42.75 |
MiniMax: MiniMax M2.1 $0.29 in / $0.95 out per 1M | $0.000430 | $14.50 | $28.50 | $43.00 |
Prime Intellect: INTELLECT-3 $0.2 in / $1.1 out per 1M | $0.000430 | $10.00 | $33.00 | $43.00 |
MiniMax: MiniMax-01 $0.2 in / $1.1 out per 1M | $0.000430 | $10.00 | $33.00 | $43.00 |
Qwen: Qwen3.6 Flash $0.1875 in / $1.125 out per 1M | $0.000431 | $9.38 | $33.75 | $43.13 |
Mistral: Mixtral 8x7B Instruct $0.54 in / $0.54 out per 1M | $0.000432 | $27.00 | $16.20 | $43.20 |
DeepSeek: DeepSeek V3 $0.27 in / $1.1 out per 1M | $0.000465 | $13.50 | $33.00 | $46.50 |
Qwen: Qwen3 VL 8B Thinking $0.117 in / $1.365 out per 1M | $0.000468 | $5.85 | $40.95 | $46.80 |
Baidu: ERNIE 4.5 300B A47B $0.28 in / $1.1 out per 1M | $0.000470 | $14.00 | $33.00 | $47.00 |
OpenAI: GPT-5.4 Nano $0.2 in / $1.25 out per 1M | $0.000475 | $10.00 | $37.50 | $47.50 |
Meta: Llama 3 70B Instruct $0.51 in / $0.74 out per 1M | $0.000477 | $25.50 | $22.20 | $47.70 |
TNG: DeepSeek R1T2 Chimera $0.3 in / $1.1 out per 1M | $0.000480 | $15.00 | $33.00 | $48.00 |
Arcee AI: Coder Large $0.5 in / $0.8 out per 1M | $0.000490 | $25.00 | $24.00 | $49.00 |
WizardLM-2 8x22B $0.62 in / $0.62 out per 1M | $0.000496 | $31.00 | $18.60 | $49.60 |
MiniMax: MiniMax M2.7 $0.279 in / $1.2 out per 1M | $0.000500 | $13.95 | $36.00 | $49.95 |
Anthropic: Claude 3 Haiku $0.25 in / $1.25 out per 1M | $0.000500 | $12.50 | $37.50 | $50.00 |
Kwaipilot: KAT-Coder-Pro V2 $0.3 in / $1.2 out per 1M | $0.000510 | $15.00 | $36.00 | $51.00 |
MiniMax: MiniMax M2-her $0.3 in / $1.2 out per 1M | $0.000510 | $15.00 | $36.00 | $51.00 |
TheDrummer: Skyfall 36B V2 $0.55 in / $0.8 out per 1M | $0.000515 | $27.50 | $24.00 | $51.50 |
Google: Gemma 2 27B $0.65 in / $0.65 out per 1M | $0.000520 | $32.50 | $19.50 | $52.00 |
Qwen: Qwen3 235B A22B Thinking 2507 $0.1495 in / $1.495 out per 1M | $0.000523 | $7.47 | $44.85 | $52.33 |
Perceptron: Perceptron Mk1 $0.15 in / $1.5 out per 1M | $0.000525 | $7.50 | $45.00 | $52.50 |
Qwen: Qwen3 VL 30B A3B Thinking $0.13 in / $1.56 out per 1M | $0.000533 | $6.50 | $46.80 | $53.30 |
Sao10K: Llama 3.3 Euryale 70B $0.65 in / $0.75 out per 1M | $0.000550 | $32.50 | $22.50 | $55.00 |
xAI: Grok Code Fast 1 $0.2 in / $1.5 out per 1M | $0.000550 | $10.00 | $45.00 | $55.00 |
Qwen: Qwen3.5-27B $0.195 in / $1.56 out per 1M | $0.000566 | $9.75 | $46.80 | $56.55 |
Google: Gemini 3.1 Flash Lite Preview $0.25 in / $1.5 out per 1M | $0.000575 | $12.50 | $45.00 | $57.50 |
Google: Gemini 3.1 Flash Lite $0.25 in / $1.5 out per 1M | $0.000575 | $12.50 | $45.00 | $57.50 |
Baidu: ERNIE 4.5 VL 424B A47B $0.42 in / $1.25 out per 1M | $0.000585 | $21.00 | $37.50 | $58.50 |
DeepSeek: R1 Distill Llama 70B $0.7 in / $0.8 out per 1M | $0.000590 | $35.00 | $24.00 | $59.00 |
Qwen: Qwen3.5 Plus 2026-02-15 $0.26 in / $1.56 out per 1M | $0.000598 | $13.00 | $46.80 | $59.80 |
Qwen: Qwen3 Coder 480B A35B $0.22 in / $1.8 out per 1M | $0.000650 | $11.00 | $54.00 | $65.00 |
Mancer: Weaver (alpha) $0.75 in / $1 out per 1M | $0.000675 | $37.50 | $30.00 | $67.50 |
OpenAI: GPT-4.1 Mini $0.4 in / $1.6 out per 1M | $0.000680 | $20.00 | $48.00 | $68.00 |
Sao10K: Llama 3.1 Euryale 70B v2.2 $0.85 in / $0.85 out per 1M | $0.000680 | $42.50 | $25.50 | $68.00 |
Qwen: Qwen3.5 Plus 2026-04-20 $0.3 in / $1.8 out per 1M | $0.000690 | $15.00 | $54.00 | $69.00 |
Mistral: Mistral Large 3 2512 $0.5 in / $1.5 out per 1M | $0.000700 | $25.00 | $45.00 | $70.00 |
OpenAI: GPT-3.5 Turbo $0.5 in / $1.5 out per 1M | $0.000700 | $25.00 | $45.00 | $70.00 |
OpenAI: GPT-5 Mini $0.25 in / $2 out per 1M | $0.000725 | $12.50 | $60.00 | $72.50 |
Z.ai: GLM 4.7 $0.4 in / $1.75 out per 1M | $0.000725 | $20.00 | $52.50 | $72.50 |
ByteDance Seed: Seed-2.0-Lite $0.25 in / $2 out per 1M | $0.000725 | $12.50 | $60.00 | $72.50 |
ByteDance Seed: Seed 1.6 $0.25 in / $2 out per 1M | $0.000725 | $12.50 | $60.00 | $72.50 |
OpenAI: GPT-5.1-Codex-Mini $0.25 in / $2 out per 1M | $0.000725 | $12.50 | $60.00 | $72.50 |
Arcee AI: Virtuoso Large $0.75 in / $1.2 out per 1M | $0.000735 | $37.50 | $36.00 | $73.50 |
Z.ai: GLM 4.6 $0.43 in / $1.74 out per 1M | $0.000737 | $21.50 | $52.20 | $73.70 |
Qwen: Qwen3.5-122B-A10B $0.26 in / $2.08 out per 1M | $0.000754 | $13.00 | $62.40 | $75.40 |
Morph: Morph V3 Fast $0.8 in / $1.2 out per 1M | $0.000760 | $40.00 | $36.00 | $76.00 |
EleutherAI: Llemma 7b $0.8 in / $1.2 out per 1M | $0.000760 | $40.00 | $36.00 | $76.00 |
AlfredPros: CodeLLaMa 7B Instruct Solidity $0.8 in / $1.2 out per 1M | $0.000760 | $40.00 | $36.00 | $76.00 |
MoonshotAI: Kimi K2.5 $0.4 in / $1.9 out per 1M | $0.000770 | $20.00 | $57.00 | $77.00 |
AionLabs: Aion-1.0-Mini $0.7 in / $1.4 out per 1M | $0.000770 | $35.00 | $42.00 | $77.00 |
Qwen: Qwen3 235B A22B $0.455 in / $1.82 out per 1M | $0.000773 | $22.75 | $54.60 | $77.35 |
Xiaomi: MiMo-V2-Omni $0.4 in / $2 out per 1M | $0.000800 | $20.00 | $60.00 | $80.00 |
Mistral: Devstral 2 2512 $0.4 in / $2 out per 1M | $0.000800 | $20.00 | $60.00 | $80.00 |
Relace: Relace Apply 3 $0.85 in / $1.25 out per 1M | $0.000800 | $42.50 | $37.50 | $80.00 |
Mistral: Mistral Medium 3.1 $0.4 in / $2 out per 1M | $0.000800 | $20.00 | $60.00 | $80.00 |
Mistral: Devstral Medium $0.4 in / $2 out per 1M | $0.000800 | $20.00 | $60.00 | $80.00 |
Mistral: Mistral Medium 3 $0.4 in / $2 out per 1M | $0.000800 | $20.00 | $60.00 | $80.00 |
Perplexity: Sonar $1 in / $1 out per 1M | $0.000800 | $50.00 | $30.00 | $80.00 |
Nous: Hermes 3 405B Instruct $1 in / $1 out per 1M | $0.000800 | $50.00 | $30.00 | $80.00 |
Xiaomi: MiMo-V2.5 $0.4 in / $2 out per 1M | $0.000800 | $20.00 | $60.00 | $80.00 |
Z.ai: GLM 4.5V $0.6 in / $1.8 out per 1M | $0.000840 | $30.00 | $54.00 | $84.00 |
NVIDIA: Llama 3.1 Nemotron Ultra 253B v1 $0.6 in / $1.8 out per 1M | $0.000840 | $30.00 | $54.00 | $84.00 |
MiniMax: MiniMax M1 $0.4 in / $2.2 out per 1M | $0.000860 | $20.00 | $66.00 | $86.00 |
Z.ai: GLM 5 $0.6 in / $1.92 out per 1M | $0.000876 | $30.00 | $57.60 | $87.60 |
AionLabs: Aion-2.0 $0.8 in / $1.6 out per 1M | $0.000880 | $40.00 | $48.00 | $88.00 |
AionLabs: Aion-RP 1.0 (8B) $0.8 in / $1.6 out per 1M | $0.000880 | $40.00 | $48.00 | $88.00 |
Qwen: Qwen VL Max $0.52 in / $2.08 out per 1M | $0.000884 | $26.00 | $62.40 | $88.40 |
DeepSeek: R1 0528 $0.5 in / $2.15 out per 1M | $0.000895 | $25.00 | $64.50 | $89.50 |
Qwen: Qwen3.5 397B A17B $0.39 in / $2.34 out per 1M | $0.000897 | $19.50 | $70.20 | $89.70 |
Google: Nano Banana (Gemini 2.5 Flash Image) $0.3 in / $2.5 out per 1M | $0.000900 | $15.00 | $75.00 | $90.00 |
Google: Gemini 2.5 Flash $0.3 in / $2.5 out per 1M | $0.000900 | $15.00 | $75.00 | $90.00 |
Amazon: Nova 2 Lite $0.3 in / $2.5 out per 1M | $0.000900 | $15.00 | $75.00 | $90.00 |
Qwen: Qwen3 VL 235B A22B Thinking $0.26 in / $2.6 out per 1M | $0.000910 | $13.00 | $78.00 | $91.00 |
NVIDIA: Llama 3.1 Nemotron 70B Instruct $1.2 in / $1.2 out per 1M | $0.000960 | $60.00 | $36.00 | $96.00 |
Z.ai: GLM 4.5 $0.6 in / $2.2 out per 1M | $0.000960 | $30.00 | $66.00 | $96.00 |
MoonshotAI: Kimi K2 0711 $0.57 in / $2.3 out per 1M | $0.000975 | $28.50 | $69.00 | $97.50 |
Deep Cogito: Cogito v2.1 671B $1.25 in / $1.25 out per 1M | $0.001000 | $62.50 | $37.50 | $100.00 |
MiniMax: MiniMax M3 $0.6 in / $2.4 out per 1M | $0.001020 | $30.00 | $72.00 | $102.00 |
OpenAI: GPT Audio Mini $0.6 in / $2.4 out per 1M | $0.001020 | $30.00 | $72.00 | $102.00 |
Morph: Morph V3 Large $0.9 in / $1.9 out per 1M | $0.001020 | $45.00 | $57.00 | $102.00 |
MoonshotAI: Kimi K2 Thinking $0.6 in / $2.5 out per 1M | $0.001050 | $30.00 | $75.00 | $105.00 |
MoonshotAI: Kimi K2 0905 $0.6 in / $2.5 out per 1M | $0.001050 | $30.00 | $75.00 | $105.00 |
DeepSeek: R1 $0.7 in / $2.5 out per 1M | $0.001100 | $35.00 | $75.00 | $110.00 |
OpenAI: GPT-3.5 Turbo (older v0613) $1 in / $2 out per 1M | $0.001100 | $50.00 | $60.00 | $110.00 |
xAI: Grok Build 0.1 $1 in / $2 out per 1M | $0.001100 | $50.00 | $60.00 | $110.00 |
Qwen: Qwen3.6 27B $0.3 in / $3.2 out per 1M | $0.001110 | $15.00 | $96.00 | $111.00 |
Google: Gemini 3 Flash Preview $0.5 in / $3 out per 1M | $0.001150 | $25.00 | $90.00 | $115.00 |
Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview) $0.5 in / $3 out per 1M | $0.001150 | $25.00 | $90.00 | $115.00 |
Baidu: Qianfan-OCR-Fast $0.68 in / $2.81 out per 1M | $0.001183 | $34.00 | $84.30 | $118.30 |
Sao10k: Llama 3 Euryale 70B v2.1 $1.48 in / $1.48 out per 1M | $0.001184 | $74.00 | $44.40 | $118.40 |
Qwen: Qwen3 Coder Plus $0.65 in / $3.25 out per 1M | $0.001300 | $32.50 | $97.50 | $130.00 |
OpenAI: GPT-3.5 Turbo Instruct $1.5 in / $2 out per 1M | $0.001350 | $75.00 | $60.00 | $135.00 |
Amazon: Nova Pro 1.0 $0.8 in / $3.2 out per 1M | $0.001360 | $40.00 | $96.00 | $136.00 |
xAI: Grok 4.3 $1.25 in / $2.5 out per 1M | $0.001375 | $62.50 | $75.00 | $137.50 |
xAI: Grok 4.20 $1.25 in / $2.5 out per 1M | $0.001375 | $62.50 | $75.00 | $137.50 |
Xiaomi: MiMo-V2-Pro $1 in / $3 out per 1M | $0.001400 | $50.00 | $90.00 | $140.00 |
Relace: Relace Search $1 in / $3 out per 1M | $0.001400 | $50.00 | $90.00 | $140.00 |
Nous: Hermes 4 405B $1 in / $3 out per 1M | $0.001400 | $50.00 | $90.00 | $140.00 |
Xiaomi: MiMo-V2.5-Pro $1 in / $3 out per 1M | $0.001400 | $50.00 | $90.00 | $140.00 |
MoonshotAI: Kimi K2.6 $0.73 in / $3.49 out per 1M | $0.001412 | $36.50 | $104.70 | $141.20 |
Z.ai: GLM 5.1 $0.98 in / $3.08 out per 1M | $0.001414 | $49.00 | $92.40 | $141.40 |
Arcee AI: Maestro Reasoning $0.9 in / $3.3 out per 1M | $0.001440 | $45.00 | $99.00 | $144.00 |
Switchpoint Router $0.85 in / $3.4 out per 1M | $0.001445 | $42.50 | $102.00 | $144.50 |
Qwen: Qwen3 Max Thinking $0.78 in / $3.9 out per 1M | $0.001560 | $39.00 | $117.00 | $156.00 |
Qwen: Qwen3 Max $0.78 in / $3.9 out per 1M | $0.001560 | $39.00 | $117.00 | $156.00 |
Anthropic: Claude 3.5 Haiku $0.8 in / $4 out per 1M | $0.001600 | $40.00 | $120.00 | $160.00 |
OpenAI: GPT-5.4 Mini $0.75 in / $4.5 out per 1M | $0.001725 | $37.50 | $135.00 | $172.50 |
Qwen: Qwen-Max $1.04 in / $4.16 out per 1M | $0.001768 | $52.00 | $124.80 | $176.80 |
Z.ai: GLM 5V Turbo $1.2 in / $4 out per 1M | $0.001800 | $60.00 | $120.00 | $180.00 |
Z.ai: GLM 5 Turbo $1.2 in / $4 out per 1M | $0.001800 | $60.00 | $120.00 | $180.00 |
OpenAI: GPT-5 Image Mini $2.5 in / $2 out per 1M | $0.001850 | $125.00 | $60.00 | $185.00 |
OpenAI: o3 Mini $1.1 in / $4.4 out per 1M | $0.001870 | $55.00 | $132.00 | $187.00 |
OpenAI: o4 Mini High $1.1 in / $4.4 out per 1M | $0.001870 | $55.00 | $132.00 | $187.00 |
OpenAI: o4 Mini $1.1 in / $4.4 out per 1M | $0.001870 | $55.00 | $132.00 | $187.00 |
OpenAI: o3 Mini High $1.1 in / $4.4 out per 1M | $0.001870 | $55.00 | $132.00 | $187.00 |
DeepSeek: DeepSeek V4 Pro $1.74 in / $3.48 out per 1M | $0.001914 | $87.00 | $104.40 | $191.40 |
Anthropic: Claude Haiku 4.5 $1 in / $5 out per 1M | $0.002000 | $50.00 | $150.00 | $200.00 |
Writer: Palmyra X5 $0.6 in / $6 out per 1M | $0.002100 | $30.00 | $180.00 | $210.00 |
Qwen: Qwen3.6 Plus $1.4 in / $5.6 out per 1M | $0.002380 | $70.00 | $168.00 | $238.00 |
Qwen: Qwen3.6 Max Preview $1.04 in / $6.24 out per 1M | $0.002392 | $52.00 | $187.20 | $239.20 |
Sao10K: Llama 3.1 70B Hanami x1 $3 in / $3 out per 1M | $0.002400 | $150.00 | $90.00 | $240.00 |
OpenAI: GPT-3.5 Turbo 16k $3 in / $4 out per 1M | $0.002700 | $150.00 | $120.00 | $270.00 |
Mistral Large 2407 $2 in / $6 out per 1M | $0.002800 | $100.00 | $180.00 | $280.00 |
Mistral Large $2 in / $6 out per 1M | $0.002800 | $100.00 | $180.00 | $280.00 |
Mistral Large 2411 $2 in / $6 out per 1M | $0.002800 | $100.00 | $180.00 | $280.00 |
xAI: Grok 4.20 Multi-Agent $2 in / $6 out per 1M | $0.002800 | $100.00 | $180.00 | $280.00 |
Mistral: Pixtral Large 2411 $2 in / $6 out per 1M | $0.002800 | $100.00 | $180.00 | $280.00 |
Mistral: Mixtral 8x22B Instruct $2 in / $6 out per 1M | $0.002800 | $100.00 | $180.00 | $280.00 |
Mistral: Mistral Medium 3.5 $1.5 in / $7.5 out per 1M | $0.003000 | $75.00 | $225.00 | $300.00 |
Magnum v4 72B $3 in / $5 out per 1M | $0.003000 | $150.00 | $150.00 | $300.00 |
OpenAI: o4 Mini Deep Research $2 in / $8 out per 1M | $0.003400 | $100.00 | $240.00 | $340.00 |
OpenAI: GPT-4.1 $2 in / $8 out per 1M | $0.003400 | $100.00 | $240.00 | $340.00 |
AI21: Jamba Large 1.7 $2 in / $8 out per 1M | $0.003400 | $100.00 | $240.00 | $340.00 |
Perplexity: Sonar Reasoning Pro $2 in / $8 out per 1M | $0.003400 | $100.00 | $240.00 | $340.00 |
Perplexity: Sonar Deep Research $2 in / $8 out per 1M | $0.003400 | $100.00 | $240.00 | $340.00 |
Google: Gemini 3.5 Flash $1.5 in / $9 out per 1M | $0.003450 | $75.00 | $270.00 | $345.00 |
Qwen: Qwen3.7 Max $2.5 in / $7.5 out per 1M | $0.003500 | $125.00 | $225.00 | $350.00 |
OpenAI: GPT-5.1-Codex-Max $1.25 in / $10 out per 1M | $0.003625 | $62.50 | $300.00 | $362.50 |
OpenAI: GPT-5.1 $1.25 in / $10 out per 1M | $0.003625 | $62.50 | $300.00 | $362.50 |
OpenAI: GPT-5.1 Chat $1.25 in / $10 out per 1M | $0.003625 | $62.50 | $300.00 | $362.50 |
OpenAI: GPT-5.1-Codex $1.25 in / $10 out per 1M | $0.003625 | $62.50 | $300.00 | $362.50 |
Google: Gemini 2.5 Pro $1.25 in / $10 out per 1M | $0.003625 | $62.50 | $300.00 | $362.50 |
Google: Gemini 2.5 Pro Preview 06-05 $1.25 in / $10 out per 1M | $0.003625 | $62.50 | $300.00 | $362.50 |
Google: Gemini 2.5 Pro Preview 05-06 $1.25 in / $10 out per 1M | $0.003625 | $62.50 | $300.00 | $362.50 |
OpenAI: GPT-5 $1.25 in / $10 out per 1M | $0.003625 | $62.50 | $300.00 | $362.50 |
OpenAI: GPT-5 Codex $1.25 in / $10 out per 1M | $0.003625 | $62.50 | $300.00 | $362.50 |
OpenAI: GPT-5 Chat $1.25 in / $10 out per 1M | $0.003625 | $62.50 | $300.00 | $362.50 |
Goliath 120B $3.75 in / $7.5 out per 1M | $0.004125 | $187.50 | $225.00 | $412.50 |
OpenAI: GPT-4o Audio $2.5 in / $10 out per 1M | $0.004250 | $125.00 | $300.00 | $425.00 |
OpenAI: GPT-4o Search Preview $2.5 in / $10 out per 1M | $0.004250 | $125.00 | $300.00 | $425.00 |
OpenAI: GPT-4o (2024-11-20) $2.5 in / $10 out per 1M | $0.004250 | $125.00 | $300.00 | $425.00 |
OpenAI: GPT-4o $2.5 in / $10 out per 1M | $0.004250 | $125.00 | $300.00 | $425.00 |
OpenAI: GPT-4o (2024-08-06) $2.5 in / $10 out per 1M | $0.004250 | $125.00 | $300.00 | $425.00 |
OpenAI: GPT Audio $2.5 in / $10 out per 1M | $0.004250 | $125.00 | $300.00 | $425.00 |
Cohere: Command A $2.5 in / $10 out per 1M | $0.004250 | $125.00 | $300.00 | $425.00 |
Inflection: Inflection 3 Pi $2.5 in / $10 out per 1M | $0.004250 | $125.00 | $300.00 | $425.00 |
Inflection: Inflection 3 Productivity $2.5 in / $10 out per 1M | $0.004250 | $125.00 | $300.00 | $425.00 |
Cohere: Command R+ (08-2024) $2.5 in / $10 out per 1M | $0.004250 | $125.00 | $300.00 | $425.00 |
AionLabs: Aion-1.0 $4 in / $8 out per 1M | $0.004400 | $200.00 | $240.00 | $440.00 |
Google: Gemini 3.1 Pro Preview Custom Tools $2 in / $12 out per 1M | $0.004600 | $100.00 | $360.00 | $460.00 |
Google: Gemini 3.1 Pro Preview $2 in / $12 out per 1M | $0.004600 | $100.00 | $360.00 | $460.00 |
Google: Nano Banana Pro (Gemini 3 Pro Image Preview) $2 in / $12 out per 1M | $0.004600 | $100.00 | $360.00 | $460.00 |
Amazon: Nova Premier 1.0 $2.5 in / $12.5 out per 1M | $0.005000 | $125.00 | $375.00 | $500.00 |
OpenAI: GPT-5.3 Chat $1.75 in / $14 out per 1M | $0.005075 | $87.50 | $420.00 | $507.50 |
OpenAI: GPT-5.3-Codex $1.75 in / $14 out per 1M | $0.005075 | $87.50 | $420.00 | $507.50 |
OpenAI: GPT-5.2-Codex $1.75 in / $14 out per 1M | $0.005075 | $87.50 | $420.00 | $507.50 |
OpenAI: GPT-5.2 Chat $1.75 in / $14 out per 1M | $0.005075 | $87.50 | $420.00 | $507.50 |
OpenAI: GPT-5.2 $1.75 in / $14 out per 1M | $0.005075 | $87.50 | $420.00 | $507.50 |
OpenAI: GPT-5.4 $2.5 in / $15 out per 1M | $0.005750 | $125.00 | $450.00 | $575.00 |
Anthropic: Claude Sonnet 4.6 $3 in / $15 out per 1M | $0.006000 | $150.00 | $450.00 | $600.00 |
xAI: Grok 3 Beta $3 in / $15 out per 1M | $0.006000 | $150.00 | $450.00 | $600.00 |
Anthropic: Claude Sonnet 4.5 $3 in / $15 out per 1M | $0.006000 | $150.00 | $450.00 | $600.00 |
Anthropic: Claude Sonnet 4 $3 in / $15 out per 1M | $0.006000 | $150.00 | $450.00 | $600.00 |
xAI: Grok 3 $3 in / $15 out per 1M | $0.006000 | $150.00 | $450.00 | $600.00 |
Anthropic: Claude 3.7 Sonnet $3 in / $15 out per 1M | $0.006000 | $150.00 | $450.00 | $600.00 |
Anthropic: Claude 3.7 Sonnet (thinking) $3 in / $15 out per 1M | $0.006000 | $150.00 | $450.00 | $600.00 |
Perplexity: Sonar Pro $3 in / $15 out per 1M | $0.006000 | $150.00 | $450.00 | $600.00 |
Perplexity: Sonar Pro Search $3 in / $15 out per 1M | $0.006000 | $150.00 | $450.00 | $600.00 |
xAI: Grok 4 $3 in / $15 out per 1M | $0.006000 | $150.00 | $450.00 | $600.00 |
OpenAI: GPT-4o (2024-05-13) $5 in / $15 out per 1M | $0.007000 | $250.00 | $450.00 | $700.00 |
OpenAI: GPT-5 Image $10 in / $10 out per 1M | $0.008000 | $500.00 | $300.00 | $800.00 |
OpenAI: GPT-4o (extended) $6 in / $18 out per 1M | $0.008400 | $300.00 | $540.00 | $840.00 |
OpenAI: GPT-5.4 Image 2 $8 in / $15 out per 1M | $0.008500 | $400.00 | $450.00 | $850.00 |
Anthropic: Claude Opus 4.8 $5 in / $25 out per 1M | $0.0100 | $250.00 | $750.00 | $1000.00 |
Anthropic: Claude Opus 4.7 $5 in / $25 out per 1M | $0.0100 | $250.00 | $750.00 | $1000.00 |
Anthropic: Claude Opus 4.6 $5 in / $25 out per 1M | $0.0100 | $250.00 | $750.00 | $1000.00 |
Anthropic: Claude Opus 4.5 $5 in / $25 out per 1M | $0.0100 | $250.00 | $750.00 | $1000.00 |
OpenAI: GPT-5.5 $5 in / $30 out per 1M | $0.0115 | $250.00 | $900.00 | $1150.00 |
OpenAI: GPT-4 Turbo $10 in / $30 out per 1M | $0.0140 | $500.00 | $900.00 | $1400.00 |
OpenAI: GPT-4 Turbo Preview $10 in / $30 out per 1M | $0.0140 | $500.00 | $900.00 | $1400.00 |
OpenAI: GPT-4 Turbo (older v1106) $10 in / $30 out per 1M | $0.0140 | $500.00 | $900.00 | $1400.00 |
OpenAI: o3 Deep Research $10 in / $40 out per 1M | $0.0170 | $500.00 | $1200.00 | $1700.00 |
OpenAI: o3 $10 in / $40 out per 1M | $0.0170 | $500.00 | $1200.00 | $1700.00 |
Anthropic: Claude Fable 5 $10 in / $50 out per 1M | $0.0200 | $500.00 | $1500.00 | $2000.00 |
OpenAI: o1 $15 in / $60 out per 1M | $0.0255 | $750.00 | $1800.00 | $2550.00 |
Anthropic: Claude Opus 4.1 $15 in / $75 out per 1M | $0.0300 | $750.00 | $2250.00 | $3000.00 |
Anthropic: Claude Opus 4 $15 in / $75 out per 1M | $0.0300 | $750.00 | $2250.00 | $3000.00 |
OpenAI: GPT-4 (older v0314) $30 in / $60 out per 1M | $0.0330 | $1500.00 | $1800.00 | $3300.00 |
OpenAI: GPT-4 $30 in / $60 out per 1M | $0.0330 | $1500.00 | $1800.00 | $3300.00 |
OpenAI: o3 Pro $20 in / $80 out per 1M | $0.0340 | $1000.00 | $2400.00 | $3400.00 |
OpenAI: GPT-5 Pro $15 in / $120 out per 1M | $0.0435 | $750.00 | $3600.00 | $4350.00 |
Anthropic: Claude Opus 4.7 (Fast) $30 in / $150 out per 1M | $0.0600 | $1500.00 | $4500.00 | $6000.00 |
Anthropic: Claude Opus 4.6 (Fast) $30 in / $150 out per 1M | $0.0600 | $1500.00 | $4500.00 | $6000.00 |
OpenAI: GPT-5.2 Pro $21 in / $168 out per 1M | $0.0609 | $1050.00 | $5040.00 | $6090.00 |
OpenAI: GPT-5.5 Pro $30 in / $180 out per 1M | $0.0690 | $1500.00 | $5400.00 | $6900.00 |
OpenAI: GPT-5.4 Pro $30 in / $180 out per 1M | $0.0690 | $1500.00 | $5400.00 | $6900.00 |
OpenAI: o1-pro $150 in / $600 out per 1M | $0.2550 | $7500.00 | $18,000 | $25,500 |
List-price estimate. Real bills typically run 1.3-1.7x higher after retries, system-prompt re-sends, and tool-call round-trips. See per-million-tokens true cost for the adders.
Recent Price Changes
MiniMax: MiniMax M3
Jun 8, 2026
$0.6 / $2.4
xAI: Grok 4.20
May 25, 2026
$1.25 / $2.5
MoonshotAI: Kimi K2.6
May 25, 2026
$0.73 / $3.49
DeepSeek: R1 0528
May 25, 2026
$0.5 / $2.15
MoonshotAI: Kimi K2.5
May 25, 2026
$0.4 / $1.9
Z.ai: GLM 5.1
May 25, 2026
$0.98 / $3.08
Z.ai: GLM 5
May 25, 2026
$0.6 / $1.92
DeepSeek: DeepSeek V3.2
May 25, 2026
$0.252 / $0.378
DeepSeek: DeepSeek V3.2 Speciale
May 25, 2026
$0.287 / $0.431
DeepSeek: DeepSeek V3.1 Terminus
May 25, 2026
$0.27 / $0.95
Understanding AI API Pricing in 2026
AI model pricing has undergone a dramatic transformation. Since GPT-4 launched in March 2023 at $30 per million input tokens, prices have fallen by over 90% — driven by competition from Anthropic, Google, and open-source challengers like DeepSeek and Meta's Llama.
Today's pricing landscape spans a 150x range: from Google's Gemini 2.0 Flash at $0.10/1M input tokens to Claude Opus 4 at $15/1M tokens. The key insight is that price doesn't always correlate with quality — DeepSeek V3 delivers 86% quality at just $0.27/1M tokens, while some premium models charge 50x more for marginal quality gains.
How to Optimize AI API Costs
The most effective strategy is model routing: sending simple queries to cheap, fast models and complex queries to premium models. A gateway like Swfte Connect automates this, typically reducing costs by 30-60% without sacrificing quality.
Other strategies include: leveraging cached input pricing (offered by Google and DeepSeek), batching requests to reduce per-call overhead, and using open-source models for predictable workloads where you can self-host.
Pricing Trends to Watch
- Price compression continues: Expect another 50%+ reduction across flagship models by end of 2026
- Reasoning premium: Models with extended thinking (o3, R1) cost more due to higher compute per request
- Open-source pressure: Llama 4 and DeepSeek are forcing closed providers to cut prices faster
- Cached pricing: More providers offering discounted rates for repeated context