Updated Apr 6, 2026

AI Model Pricing Index

Compare API pricing across every major AI provider. Sortable table, historical trends, and an interactive cost calculator to estimate your monthly spend.

318

Models Tracked

52

Providers

$0.02

Cheapest Input

8824x

Price Range

Full Pricing Table

318 models
ModelProviderInput / 1MOutput / 1MBlendedQualityValueContext

Open-source

Mistral AI$0.02$0.04$0.03
72
2400.0131K

Open-source

Google$0.02$0.04$0.03
50
1666.733K

Open-source

Meta$0.02$0.05$0.04
65
1857.116K

Open-source

Meta$0.03$0.04$0.04
65
1857.18K

Open-source

Meta$0.02$0.06$0.04
50
1250.0131K

Hard reasoning

sao10k$0.04$0.05$0.04
50
1111.18K

Open-source

Meta$0.05$0.05$0.05
50
1020.4131K

Open-source

Google$0.04$0.08$0.06
65
1083.3131K

Open-source

Google$0.03$0.09$0.06
65
1083.38K

Speed & cost

gryphe$0.06$0.06$0.06
58
966.74K

Code generation

Alibaba Cloud$0.03$0.09$0.06
50
833.333K

Speed & cost

ibm-granite$0.02$0.11$0.06
62
976.4131K

Open-source

Mistral AI$0.05$0.08$0.07
72
1107.733K

Open-source

Mistral AI$0.03$0.11$0.07
72
1028.6131K

Speed & cost

OpenAI$0.03$0.11$0.07
50
714.3131K

Open-source

Alibaba Cloud$0.04$0.10$0.07
50
714.333K

Speed & cost

liquid$0.03$0.12$0.07
50
666.733K

Open-source

Alibaba Cloud$0.03$0.13$0.08
50
615.4131K

Open-source

Google$0.04$0.13$0.09
74
870.6131K

Open-source

Alibaba Cloud$0.07$0.10$0.09
82
959.1262K

Speed & cost

Amazon$0.04$0.14$0.09
62
708.6128K

Open-source

Cohere$0.04$0.15$0.09
50
533.3128K

Speed & cost

arcee$0.04$0.15$0.10
50
512.8131K

Open-source

Alibaba Cloud$0.05$0.15$0.10
82
820.0256K

Speed & cost

nvidia$0.04$0.16$0.10
62
620.0131K

Speed & cost

rekaai$0.10$0.10$0.10
58
580.016K

Speed & cost

Mistral AI$0.10$0.10$0.10
58
580.0131K

Open-source

z-ai$0.10$0.10$0.10
58
580.0128K

Speed & cost

microsoft$0.07$0.14$0.10
65
634.116K

Open-source

Meta$0.03$0.20$0.11
50
440.560K

Speed & cost

OpenAI$0.04$0.19$0.11
50
436.7131K

Open-source

Google$0.08$0.16$0.12
74
616.7131K

Speed & cost

nvidia$0.05$0.20$0.13
62
496.0262K

Open-source

allenai$0.05$0.20$0.13
50
400.0128K

Open-source

Mistral AI$0.07$0.20$0.14
72
523.6128K

Search + citations

nousresearch$0.14$0.14$0.14
65
464.38K

Open-source

Alibaba Cloud$0.06$0.24$0.15
82
546.741K

Speed & cost

essentialai$0.15$0.15$0.15
58
386.733K

Speed & cost

Mistral AI$0.15$0.15$0.15
58
386.7262K

Speed & cost

Amazon$0.06$0.24$0.15
58
386.7300K

Open-source

Mistral AI$0.11$0.19$0.15
58
386.73K

Speed & cost

bytedance$0.10$0.20$0.15
58
386.7128K

Speed & cost

rekaai$0.10$0.20$0.15
58
386.766K

Open-source

Alibaba Cloud$0.08$0.24$0.16
82
512.541K

Speed & cost

Alibaba Cloud$0.07$0.26$0.16
82
504.61M

Code generation

Alibaba Cloud$0.07$0.27$0.17
82
482.4160K

Hard reasoning

baidu$0.07$0.28$0.18
58
331.4131K

Speed & cost

baidu$0.07$0.28$0.18
58
331.4120K

Speed & cost

arcee$0.18$0.18$0.18
58
322.2131K

Open-source

Meta$0.18$0.18$0.18
58
322.2164K

Open-source

Alibaba Cloud$0.08$0.28$0.18
82
455.641K

Speed & cost

Google$0.07$0.30$0.19
73
389.31M

Speed & cost

bytedance$0.07$0.30$0.19
58
309.3262K

Speed & cost

OpenAI$0.07$0.30$0.19
58
309.3131K

Speed & cost

Meta$0.08$0.30$0.19
74
389.5328K

Speed & cost

xiaomi$0.09$0.29$0.19
58
305.3262K

Open-source

Alibaba Cloud$0.09$0.30$0.20
82
420.5262K

Open-source

Meta$0.05$0.34$0.20
58
296.780K

Open-source

Mistral AI$0.10$0.30$0.20
72
360.033K

Speed & cost

stepfun$0.10$0.30$0.20
58
290.0262K

Speed & cost

Mistral AI$0.20$0.20$0.20
58
290.0262K

Open-source

Mistral AI$0.10$0.30$0.20
58
290.032K

Open-source

Mistral AI$0.10$0.30$0.20
58
290.0131K

Open-source

Meta$0.10$0.32$0.21
74
352.4131K

Open-source

Alibaba Cloud$0.05$0.40$0.23
82
364.441K

Speed & cost

OpenAI$0.05$0.40$0.23
72
320.0400K

Speed & cost

z-ai$0.06$0.40$0.23
58
252.2203K

Hard reasoning

Alibaba Cloud$0.08$0.40$0.24
82
341.7131K

Speed & cost

Google$0.10$0.40$0.25
80
320.01M

Speed & cost

Google$0.10$0.40$0.25
80
320.01M

Open-source

nvidia$0.10$0.40$0.25
74
296.0131K

Speed & cost

Google$0.10$0.40$0.25
73
292.01M

Speed & cost

OpenAI$0.10$0.40$0.25
72
288.01M

Speed & cost

bytedance$0.10$0.40$0.25
58
232.0262K

Open-source

Alibaba Cloud$0.12$0.39$0.26
82
321.633K

Open-source

Alibaba Cloud$0.10$0.42$0.26
82
315.4131K

Open-source

Google$0.13$0.40$0.27
76
286.8262K

Search + citations

nousresearch$0.13$0.40$0.27
58
218.9131K

Open-source

Google$0.14$0.40$0.27
76
281.5262K

Search + citations

Alibaba Cloud$0.09$0.45$0.27
58
214.8131K

Open-source

Alibaba Cloud$0.14$0.41$0.27
58
212.5131K

Hard reasoning

DeepSeek$0.29$0.29$0.29
91
313.833K

Open-source

Alibaba Cloud$0.08$0.50$0.29
82
282.8131K

Search + citations

nousresearch$0.30$0.30$0.30
74
246.7131K

Speed & cost

nvidia$0.10$0.50$0.30
58
193.3262K

Speed & cost

thedrummer$0.17$0.43$0.30
58
193.333K

Open-source

nex-agi$0.14$0.50$0.32
86
270.9131K

Open-source

DeepSeek$0.26$0.38$0.32
86
268.8164K

Open-source

Alibaba Cloud$0.13$0.52$0.33
82
252.3131K

Hard reasoning

allenai$0.15$0.50$0.33
58
178.566K

Open-source

DeepSeek$0.27$0.41$0.34
86
252.9164K

Speed & cost

xAI$0.20$0.50$0.35
58
165.72M

Speed & cost

xAI$0.20$0.50$0.35
58
165.72M

Speed & cost

baidu$0.14$0.56$0.35
58
165.730K

Speed & cost

tencent$0.14$0.57$0.35
58
163.4131K

Hard reasoning

Alibaba Cloud$0.15$0.58$0.36
58
158.9131K

Open-source

Meta$0.15$0.60$0.38
82
218.71M

Search + citations

OpenAI$0.15$0.60$0.38
80
213.3128K

Speed & cost

OpenAI$0.15$0.60$0.38
80
213.3128K

Speed & cost

OpenAI$0.15$0.60$0.38
80
213.3128K

Open-source

Mistral AI$0.15$0.60$0.38
72
192.0262K

Speed & cost

upstage$0.15$0.60$0.38
58
154.7128K

Open-source

Cohere$0.15$0.60$0.38
58
154.7128K

Speed & cost

xAI$0.30$0.50$0.40
82
205.0131K

Speed & cost

xAI$0.30$0.50$0.40
82
205.0131K

Open-source

Meta$0.40$0.40$0.40
74
185.0131K

Speed & cost

thedrummer$0.40$0.40$0.40
66
165.033K

Speed & cost

nvidia$0.20$0.60$0.40
62
155.0131K

Open-source

allenai$0.20$0.60$0.40
58
145.066K

Speed & cost

thedrummer$0.30$0.50$0.40
58
145.0131K

Open-source

Alibaba Cloud$0.20$0.60$0.40
58
145.0128K

Open-source

Mistral AI$0.20$0.60$0.40
58
145.033K

Code generation

Alibaba Cloud$0.12$0.75$0.43
82
188.5262K

Hard reasoning

Alibaba Cloud$0.10$0.78$0.44
82
186.9131K

Open-source

DeepSeek$0.15$0.75$0.45
86
191.133K

Open-source

DeepSeek$0.20$0.77$0.48
86
177.3164K

Open-source

z-ai$0.13$0.85$0.49
58
118.4131K

Open-source

DeepSeek$0.21$0.79$0.50
86
172.0164K

Speed & cost

inception$0.25$0.75$0.50
58
116.0128K

Speed & cost

meituan$0.20$0.80$0.50
58
116.0131K

Speed & cost

inception$0.25$0.75$0.50
58
116.0128K

Code generation

inception$0.25$0.75$0.50
58
116.0128K

Hard reasoning

Alibaba Cloud$0.26$0.78$0.52
58
111.51M

Open-source

Alibaba Cloud$0.26$0.78$0.52
58
111.51M

Open-source

Alibaba Cloud$0.26$0.78$0.52
58
111.51M

Hard reasoning

arcee$0.22$0.85$0.54
58
108.4262K

Open-source

Alibaba Cloud$0.20$0.88$0.54
82
151.9262K

Open-source

Mistral AI$0.54$0.54$0.54
72
133.333K

Speed & cost

undi95$0.45$0.65$0.55
66
120.06K

Speed & cost

minimax$0.12$0.99$0.55
58
104.7197K

Code generation

Alibaba Cloud$0.20$0.97$0.58
82
140.21M

Open-source

Alibaba Cloud$0.09$1.10$0.60
82
137.8262K

Code generation

Mistral AI$0.30$0.90$0.60
78
130.0256K

Open-source

z-ai$0.30$0.90$0.60
58
96.7131K

Open-source

DeepSeek$0.32$0.89$0.60
86
142.1164K

Code generation

Alibaba Cloud$0.22$1.00$0.61
82
134.4262K

Speed & cost

minimax$0.27$0.95$0.61
58
95.1197K

Speed & cost

microsoft$0.62$0.62$0.62
62
100.066K

Open-source

Meta$0.51$0.74$0.63
66
105.68K

Speed & cost

minimax$0.26$1.00$0.63
58
92.4197K

Code generation

arcee$0.50$0.80$0.65
66
101.533K

Open-source

Google$0.65$0.65$0.65
65
100.08K

Speed & cost

prime-intellect$0.20$1.10$0.65
58
89.2131K

Speed & cost

minimax$0.20$1.10$0.65
58
89.21M

Speed & cost

thedrummer$0.55$0.80$0.68
66
97.833K

Speed & cost

baidu$0.28$1.10$0.69
58
84.1123K

Hard reasoning

sao10k$0.65$0.75$0.70
66
94.3131K

Hard reasoning

tngtech$0.30$1.10$0.70
91
130.0164K

Speed & cost

OpenAI$0.20$1.25$0.72
72
99.3400K

Open-source

Alibaba Cloud$0.16$1.30$0.73
82
112.1262K

Hard reasoning

Alibaba Cloud$0.12$1.36$0.74
82
110.7131K

Hard reasoning

DeepSeek$0.70$0.80$0.75
91
121.3131K

Speed & cost

Anthropic$0.25$1.25$0.75
72
96.0200K

Code generation

kwaipilot$0.30$1.20$0.75
58
77.3256K

Speed & cost

minimax$0.30$1.20$0.75
58
77.3205K

Speed & cost

minimax$0.30$1.20$0.75
58
77.366K

Open-source

DeepSeek$0.40$1.20$0.80
86
107.5164K

Open-source

Alibaba Cloud$0.80$0.80$0.80
66
82.533K

Hard reasoning

Alibaba Cloud$0.15$1.50$0.82
82
99.7131K

Code generation

Alibaba Cloud$0.66$1.00$0.83
66
79.533K

Speed & cost

baidu$0.42$1.25$0.83
66
79.0123K

Hard reasoning

Alibaba Cloud$0.13$1.56$0.84
82
97.0131K

Hard reasoning

sao10k$0.85$0.85$0.85
66
77.6131K

Speed & cost

xAI$0.20$1.50$0.85
58
68.2256K

Speed & cost

mancer$0.75$1.00$0.88
66
75.48K

Speed & cost

Google$0.25$1.50$0.88
62
70.91M

Open-source

Alibaba Cloud$0.20$1.56$0.88
82
93.4262K

Open-source

Alibaba Cloud$0.26$1.56$0.91
82
90.11M

Speed & cost

arcee$0.75$1.20$0.97
66
67.7131K

Open-source

Mistral AI$0.50$1.50$1.00
85
85.0262K

Speed & cost

OpenAI$0.40$1.60$1.00
80
80.01M

Speed & cost

morph$0.80$1.20$1.00
66
66.082K

Speed & cost

eleutherai$0.80$1.20$1.00
66
66.04K

Open-source

alfredpros$0.80$1.20$1.00
66
66.04K

Search + citations

Perplexity$1.00$1.00$1.00
66
66.0127K

Search + citations

nousresearch$1.00$1.00$1.00
66
66.0131K

Speed & cost

OpenAI$0.50$1.50$1.00
66
66.016K

Speed & cost

aion$0.70$1.40$1.05
66
62.9131K

Speed & cost

relace$0.85$1.25$1.05
66
62.9256K

Speed & cost

moonshot$0.38$1.72$1.05
66
62.8262K

Open-source

z-ai$0.39$1.75$1.07
66
61.7203K

Speed & cost

OpenAI$0.25$2.00$1.13
83
73.8400K

Speed & cost

bytedance$0.25$2.00$1.13
58
51.6262K

Speed & cost

bytedance$0.25$2.00$1.13
58
51.6262K

Code generation

OpenAI$0.25$2.00$1.13
58
51.6400K

Open-source

Alibaba Cloud$0.46$1.82$1.14
82
72.1131K

Open-source

z-ai$0.39$1.90$1.15
66
57.6205K

Open-source

Alibaba Cloud$0.26$2.08$1.17
82
70.1262K

Open-source

nvidia$1.20$1.20$1.20
74
61.7131K

Speed & cost

xiaomi$0.40$2.00$1.20
66
55.0262K

Open-source

Mistral AI$0.40$2.00$1.20
66
55.0262K

Speed & cost

moonshot$0.40$2.00$1.20
66
55.0131K

Open-source

Mistral AI$0.40$2.00$1.20
66
55.0131K

Open-source

z-ai$0.60$1.80$1.20
66
55.066K

Open-source

Mistral AI$0.40$2.00$1.20
66
55.0131K

Open-source

Mistral AI$0.40$2.00$1.20
66
55.0131K

Open-source

nvidia$0.60$1.80$1.20
66
55.0131K

Speed & cost

aion$0.80$1.60$1.20
66
55.0131K

Open-source

aion$0.80$1.60$1.20
65
54.233K

Hard reasoning

moonshot$0.47$2.00$1.23
66
53.4131K

General purpose

deepcogito$1.25$1.25$1.25
74
59.2128K

Hard reasoning

DeepSeek$0.45$2.15$1.30
91
70.0164K

Speed & cost

minimax$0.40$2.20$1.30
66
50.81M

Open-source

Alibaba Cloud$0.52$2.08$1.30
66
50.8131K

Open-source

Alibaba Cloud$0.39$2.34$1.36
82
60.1262K

Image generation

Google$0.30$2.50$1.40
80
57.133K

Speed & cost

Google$0.30$2.50$1.40
80
57.11M

Speed & cost

morph$0.90$1.90$1.40
66
47.1262K

Speed & cost

Amazon$0.30$2.50$1.40
58
41.41M

Open-source

z-ai$0.60$2.20$1.40
66
47.1131K

Hard reasoning

Alibaba Cloud$0.26$2.60$1.43
82
57.3131K

Speed & cost

moonshot$0.57$2.30$1.43
66
46.0131K

Hard reasoning

sao10k$1.48$1.48$1.48
74
50.08K

Speed & cost

OpenAI$0.60$2.40$1.50
66
44.0128K

Speed & cost

OpenAI$1.00$2.00$1.50
66
44.04K

Open-source

z-ai$0.72$2.30$1.51
66
43.780K

Hard reasoning

DeepSeek$0.70$2.50$1.60
91
56.964K

Speed & cost

Google$0.50$3.00$1.75
80
45.71M

General purpose

OpenAI$1.50$2.00$1.75
74
42.34K

Image generation

Google$0.50$3.00$1.75
66
37.766K

Code generation

Alibaba Cloud$0.65$3.25$1.95
82
42.11M

Speed & cost

xiaomi$1.00$3.00$2.00
66
33.01M

Search + citations

relace$1.00$3.00$2.00
66
33.0256K

Search + citations

nousresearch$1.00$3.00$2.00
66
33.0131K

Speed & cost

Amazon$0.80$3.20$2.00
66
33.0300K

Speed & cost

arcee$0.90$3.30$2.10
66
31.4131K

Speed & cost

switchpoint$0.85$3.40$2.13
66
31.1131K

Image generation

OpenAI$2.50$2.00$2.25
74
32.9400K

Hard reasoning

Alibaba Cloud$0.78$3.90$2.34
82
35.0262K

Open-source

Alibaba Cloud$0.78$3.90$2.34
82
35.0262K

Speed & cost

Anthropic$0.80$4.00$2.40
76
31.7200K

Open-source

z-ai$1.20$4.00$2.60
74
28.5203K

Open-source

z-ai$1.20$4.00$2.60
74
28.5203K

Open-source

Alibaba Cloud$1.04$4.16$2.60
74
28.533K

Speed & cost

OpenAI$0.75$4.50$2.63
83
31.6400K

Hard reasoning

OpenAI$1.10$4.40$2.75
82
29.8200K

Hard reasoning

OpenAI$1.10$4.40$2.75
82
29.8200K

Hard reasoning

OpenAI$1.10$4.40$2.75
82
29.8200K

Hard reasoning

OpenAI$1.10$4.40$2.75
82
29.8200K

Speed & cost

Anthropic$1.00$5.00$3.00
76
25.3200K

Hard reasoning

sao10k$3.00$3.00$3.00
74
24.716K

Speed & cost

writer$0.60$6.00$3.30
66
20.01M

General purpose

OpenAI$3.00$4.00$3.50
74
21.116K

Open-source

Mistral AI$2.00$6.00$4.00
85
21.3131K

Open-source

Mistral AI$2.00$6.00$4.00
85
21.3131K

Open-source

Mistral AI$2.00$6.00$4.00
85
21.3128K

General purpose

xAI$2.00$6.00$4.00
74
18.52M

General purpose

xAI$2.00$6.00$4.00
74
18.52M

Open-source

Mistral AI$2.00$6.00$4.00
74
18.5131K

General purpose

anthracite-org$3.00$5.00$4.00
74
18.516K

Open-source

Mistral AI$2.00$6.00$4.00
72
18.066K

Deep research

OpenAI$2.00$8.00$5.00
96
19.2200K

Hard reasoning

OpenAI$2.00$8.00$5.00
92
18.4200K

General purpose

OpenAI$2.00$8.00$5.00
89
17.81M

General purpose

ai21$2.00$8.00$5.00
74
14.8256K

Search + citations

Perplexity$2.00$8.00$5.00
74
14.8128K

Deep research

Perplexity$2.00$8.00$5.00
74
14.8128K

Code generation

OpenAI$1.25$10.00$5.63
93
16.5400K

General purpose

OpenAI$1.25$10.00$5.63
93
16.5400K

General purpose

OpenAI$1.25$10.00$5.63
93
16.5128K

Code generation

OpenAI$1.25$10.00$5.63
93
16.5400K

General purpose

OpenAI$1.25$10.00$5.63
92
16.4400K

Speed & cost

Google$1.25$10.00$5.63
91
16.21M

Speed & cost

Google$1.25$10.00$5.63
91
16.21M

Speed & cost

Google$1.25$10.00$5.63
91
16.21M

General purpose

alpindale$3.75$7.50$5.63
82
14.66K

Code generation

OpenAI$1.25$10.00$5.63
74
13.2400K

General purpose

OpenAI$1.25$10.00$5.63
74
13.2128K

General purpose

aion$4.00$8.00$6.00
82
13.7131K

General purpose

OpenAI$2.50$10.00$6.25
88
14.1128K

Search + citations

OpenAI$2.50$10.00$6.25
88
14.1128K

General purpose

OpenAI$2.50$10.00$6.25
88
14.1128K

General purpose

OpenAI$2.50$10.00$6.25
88
14.1128K

General purpose

OpenAI$2.50$10.00$6.25
88
14.1128K

Open-source

Cohere$2.50$10.00$6.25
84
13.4128K

General purpose

OpenAI$2.50$10.00$6.25
74
11.8128K

General purpose

Cohere$2.50$10.00$6.25
74
11.8256K

General purpose

inflection$2.50$10.00$6.25
74
11.88K

General purpose

inflection$2.50$10.00$6.25
74
11.88K

Speed & cost

Google$2.00$12.00$7.00
94
13.41M

Speed & cost

Google$2.00$12.00$7.00
94
13.41M

Image generation

Google$2.00$12.00$7.00
94
13.466K

General purpose

Amazon$2.50$12.50$7.50
74
9.91M

General purpose

OpenAI$1.75$14.00$7.88
93
11.8128K

Code generation

OpenAI$1.75$14.00$7.88
93
11.8400K

Code generation

OpenAI$1.75$14.00$7.88
93
11.8400K

General purpose

OpenAI$1.75$14.00$7.88
93
11.8128K

General purpose

OpenAI$1.75$14.00$7.88
93
11.8400K

General purpose

OpenAI$2.50$15.00$8.75
93
10.61M

General purpose

xAI$3.00$15.00$9.00
90
10.0131K

General purpose

xAI$3.00$15.00$9.00
90
10.0131K

General purpose

Anthropic$3.00$15.00$9.00
88
9.81M

General purpose

Anthropic$3.00$15.00$9.00
88
9.81M

General purpose

Anthropic$3.00$15.00$9.00
86
9.6200K

General purpose

Anthropic$3.00$15.00$9.00
86
9.6200K

Hard reasoning

Anthropic$3.00$15.00$9.00
86
9.6200K

Search + citations

Perplexity$3.00$15.00$9.00
74
8.2200K

General purpose

xAI$3.00$15.00$9.00
74
8.2256K

Search + citations

Perplexity$3.00$15.00$9.00
74
8.2200K

Multimodal

OpenAI$10.00$10.00$10.00
88
8.8400K

General purpose

OpenAI$5.00$15.00$10.00
88
8.8128K

Multimodal

OpenAI$6.00$18.00$12.00
88
7.3128K

General purpose

Anthropic$5.00$25.00$15.00
95
6.31M

General purpose

Anthropic$5.00$25.00$15.00
95
6.3200K

Multimodal

OpenAI$10.00$30.00$20.00
88
4.4128K

Complex analysis

OpenAI$10.00$30.00$20.00
88
4.4128K

Multimodal

OpenAI$10.00$30.00$20.00
88
4.4128K

Deep research

OpenAI$10.00$40.00$25.00
96
3.8200K

Hard reasoning

OpenAI$15.00$60.00$37.50
88
2.3200K

Multimodal

Anthropic$15.00$75.00$45.00
94
2.1200K

Multimodal

Anthropic$15.00$75.00$45.00
94
2.1200K

Complex analysis

OpenAI$30.00$60.00$45.00
93
2.18K

Multimodal

OpenAI$30.00$60.00$45.00
93
2.18K

Hard reasoning

OpenAI$20.00$80.00$50.00
96
1.9200K

Complex analysis

OpenAI$15.00$120.00$67.50
88
1.3400K

Complex analysis

OpenAI$21.00$168.00$94.50
97
1.0400K

Complex analysis

OpenAI$30.00$180.00$105.00
97
0.91M

Hard reasoning

OpenAI$150.00$600.00$375.00
93
0.2200K
Blended = avg of input + output per 1M tokensQuality = composite benchmark score (0-100)Value = quality per dollar (higher is better)

Estimate Your Monthly Cost

Cost Calculator

Cheapest

$2.20/mo

Mistral: Mistral Nemo

Best Value

$6.55/mo

Qwen: Qwen3 235B A22B Instruct 2507 (quality >= 80)

Most Expensive

$25.5K/mo

OpenAI: o1-pro

Save 30-60% with smart model routing

Swfte Connect automatically routes each request to the optimal model based on complexity, reducing costs without sacrificing quality.

Learn More

All Models — Estimated Monthly Cost

Mistral: Mistral Nemo
$2.20/mo
Google: Gemma 3n 4B
$2.20/mo
Meta: Llama 3.1 8B Instruct
$2.50/mo
Meta: Llama 3 8B Instruct
$2.70/mo
Llama Guard 3 8B
$2.80/mo
Sao10K: Llama 3 8B Lunaris
$3.50/mo
Meta: Llama 3.2 11B Vision Instruct
$3.92/mo
IBM: Granite 4.0 Micro
$4.15/mo
Google: Gemma 2 9B
$4.20/mo
Qwen: Qwen2.5 Coder 7B Instruct
$4.20/mo
Google: Gemma 3 4B
$4.40/mo
Mistral: Mistral Small 3.1 24B
$4.80/mo
MythoMax 13B
$4.80/mo
OpenAI: gpt-oss-20b
$4.80/mo
Mistral: Mistral Small 3
$4.90/mo
Qwen: Qwen2.5 7B Instruct
$5.00/mo
LiquidAI: LFM2-24B-A2B
$5.10/mo
Qwen: Qwen-Turbo
$5.53/mo
Google: Gemma 3 12B
$5.90/mo
Amazon: Nova Micro 1.0
$5.95/mo
Cohere: Command R7B (12-2024)
$6.38/mo
Qwen: Qwen3 235B A22B Instruct 2507
$6.55/mo
Arcee AI: Trinity Mini
$6.75/mo
NVIDIA: Nemotron Nano 9B V2
$6.80/mo
Qwen: Qwen3.5-9B
$7.00/mo
Meta: Llama 3.2 1B Instruct
$7.35/mo
Microsoft: Phi 4
$7.45/mo
OpenAI: gpt-oss-120b
$7.65/mo
Reka Edge
$8.00/mo
Mistral: Ministral 3 3B 2512
$8.00/mo
Z.ai: GLM 4 32B
$8.00/mo
NVIDIA: Nemotron 3 Nano 30B A3B
$8.50/mo
AllenAI: Olmo 2 32B Instruct
$8.50/mo
Google: Gemma 3 27B
$8.80/mo
Mistral: Mistral Small 3.2 24B
$9.75/mo
Qwen: Qwen3 14B
$10.20/mo
Amazon: Nova Lite 1.0
$10.20/mo
ByteDance: UI-TARS 7B
$11.00/mo
Reka Flash 3
$11.00/mo
Qwen: Qwen3.5-Flash
$11.05/mo
Qwen: Qwen3 32B
$11.20/mo
Mistral: Mistral 7B Instruct v0.1
$11.20/mo
NousResearch: Hermes 2 Pro - Llama-3 8B
$11.20/mo
Qwen: Qwen3 Coder 30B A3B Instruct
$11.60/mo
Baidu: ERNIE 4.5 21B A3B Thinking
$11.90/mo
Baidu: ERNIE 4.5 21B A3B
$11.90/mo
EssentialAI: Rnj 1 Instruct
$12.00/mo
Mistral: Ministral 3 8B 2512
$12.00/mo
Qwen: Qwen3 30B A3B
$12.40/mo
Google: Gemini 2.0 Flash Lite
$12.75/mo
ByteDance Seed: Seed 1.6 Flash
$12.75/mo
OpenAI: gpt-oss-safeguard-20b
$12.75/mo
Meta: Llama 3.2 3B Instruct
$12.75/mo
Meta: Llama 4 Scout
$13.00/mo
Xiaomi: MiMo-V2-Flash
$13.20/mo
Qwen: Qwen3 30B A3B Instruct 2507
$13.50/mo
Mistral: Mistral Small Creative
$14.00/mo
StepFun: Step 3.5 Flash
$14.00/mo
Mistral: Voxtral Small 24B 2507
$14.00/mo
Mistral: Devstral Small 1.1
$14.00/mo
Arcee AI: Spotlight
$14.40/mo
Meta: Llama Guard 4 12B
$14.40/mo
Qwen: Qwen3 8B
$14.50/mo
OpenAI: GPT-5 Nano
$14.50/mo
Meta: Llama 3.3 70B Instruct
$14.60/mo
Z.ai: GLM 4.7 Flash
$15.00/mo
Qwen: Qwen3 30B A3B Thinking 2507
$16.00/mo
Mistral: Ministral 3 14B 2512
$16.00/mo
Google: Gemini 2.5 Flash Lite Preview 09-2025
$17.00/mo
Google: Gemini 2.5 Flash Lite
$17.00/mo
NVIDIA: Llama 3.3 Nemotron Super 49B V1.5
$17.00/mo
Google: Gemini 2.0 Flash
$17.00/mo
OpenAI: GPT-4.1 Nano
$17.00/mo
ByteDance Seed: Seed-2.0-Mini
$17.00/mo
Qwen: Qwen3 VL 32B Instruct
$17.68/mo
Qwen2.5 72B Instruct
$17.70/mo
Tongyi DeepResearch 30B A3B
$18.00/mo
Google: Gemma 4 26B A4B
$18.50/mo
Nous: Hermes 4 70B
$18.50/mo
Qwen: Qwen3 VL 8B Instruct
$19.00/mo
Google: Gemma 4 31B
$19.00/mo
Qwen: Qwen VL Plus
$19.11/mo
NVIDIA: Nemotron 3 Super
$20.00/mo
TheDrummer: Rocinante 12B
$21.40/mo
Nex AGI: DeepSeek V3.1 Nex N1
$21.75/mo
Qwen: Qwen3 VL 30B A3B Instruct
$22.10/mo
AllenAI: Olmo 3 32B Think
$22.50/mo
DeepSeek: R1 Distill Qwen 32B
$23.20/mo
Baidu: ERNIE 4.5 VL 28B A3B
$23.80/mo
Nous: Hermes 3 70B Instruct
$24.00/mo
Tencent: Hunyuan A13B Instruct
$24.10/mo
DeepSeek: DeepSeek V3.2
$24.40/mo
Qwen: QwQ 32B
$24.90/mo
xAI: Grok 4.1 Fast
$25.00/mo
xAI: Grok 4 Fast
$25.00/mo
Meta: Llama 4 Maverick
$25.50/mo
OpenAI: GPT-4o-mini Search Preview
$25.50/mo
OpenAI: GPT-4o-mini (2024-07-18)
$25.50/mo
OpenAI: GPT-4o-mini
$25.50/mo
Mistral: Mistral Small 4
$25.50/mo
Upstage: Solar Pro 3
$25.50/mo
Cohere: Command R (08-2024)
$25.50/mo
DeepSeek: DeepSeek V3.2 Exp
$25.80/mo
NVIDIA: Nemotron Nano 12B 2 VL
$28.00/mo
AllenAI: Olmo 3.1 32B Instruct
$28.00/mo
Qwen: Qwen2.5 VL 32B Instruct
$28.00/mo
Mistral: Saba
$28.00/mo
Qwen: Qwen3 Next 80B A3B Thinking
$28.28/mo
Qwen: Qwen3 Coder Next
$28.50/mo
DeepSeek: DeepSeek V3.1
$30.00/mo
xAI: Grok 3 Mini
$30.00/mo
xAI: Grok 3 Mini Beta
$30.00/mo
TheDrummer: Cydonia 24B V4.1
$30.00/mo
Meta: Llama 3.1 70B Instruct
$32.00/mo
TheDrummer: UnslopNemo 12B
$32.00/mo
Z.ai: GLM 4.5 Air
$32.00/mo
DeepSeek: DeepSeek V3 0324
$33.10/mo
Meituan: LongCat Flash Chat
$34.00/mo
DeepSeek: DeepSeek V3.1 Terminus
$34.20/mo
Inception: Mercury 2
$35.00/mo
Inception: Mercury
$35.00/mo
Inception: Mercury Coder
$35.00/mo
MiniMax: MiniMax M2.5
$35.60/mo
Qwen: Qwen3 VL 235B A22B Instruct
$36.40/mo
Qwen: Qwen Plus 0728 (thinking)
$36.40/mo
Qwen: Qwen Plus 0728
$36.40/mo
Qwen: Qwen-Plus
$36.40/mo
Arcee AI: Trinity Large Thinking
$36.50/mo
Qwen: Qwen3 Next 80B A3B Instruct
$37.50/mo
Qwen: Qwen3 Coder Flash
$39.00/mo
Qwen: Qwen3 Coder 480B A35B
$41.00/mo
Mistral: Codestral 2508
$42.00/mo
ReMM SLERP 13B
$42.00/mo
MiniMax: MiniMax M2.1
$42.00/mo
Z.ai: GLM 4.6V
$42.00/mo
DeepSeek: DeepSeek V3
$42.70/mo
MiniMax: MiniMax M2
$42.75/mo
Prime Intellect: INTELLECT-3
$43.00/mo
MiniMax: MiniMax-01
$43.00/mo
Mistral: Mixtral 8x7B Instruct
$43.20/mo
Qwen: Qwen3 VL 8B Thinking
$46.80/mo
Baidu: ERNIE 4.5 300B A47B
$47.00/mo
Qwen: Qwen3.5-35B-A3B
$47.13/mo
OpenAI: GPT-5.4 Nano
$47.50/mo
Meta: Llama 3 70B Instruct
$47.70/mo
TNG: DeepSeek R1T2 Chimera
$48.00/mo
Arcee AI: Coder Large
$49.00/mo
WizardLM-2 8x22B
$49.60/mo
Anthropic: Claude 3 Haiku
$50.00/mo
Kwaipilot: KAT-Coder-Pro V2
$51.00/mo
MiniMax: MiniMax M2.7
$51.00/mo
MiniMax: MiniMax M2-her
$51.00/mo
TheDrummer: Skyfall 36B V2
$51.50/mo
Google: Gemma 2 27B
$52.00/mo
Qwen: Qwen3 235B A22B Thinking 2507
$52.33/mo
Qwen: Qwen3 VL 30B A3B Thinking
$53.30/mo
Sao10K: Llama 3.3 Euryale 70B
$55.00/mo
xAI: Grok Code Fast 1
$55.00/mo
DeepSeek: DeepSeek V3.2 Speciale
$56.00/mo
Qwen: Qwen3.5-27B
$56.55/mo
Google: Gemini 3.1 Flash Lite Preview
$57.50/mo
Baidu: ERNIE 4.5 VL 424B A47B
$58.50/mo
DeepSeek: R1 Distill Llama 70B
$59.00/mo
Qwen: Qwen3.5 Plus 2026-02-15
$59.80/mo
Qwen2.5 Coder 32B Instruct
$63.00/mo
Qwen: Qwen2.5 VL 72B Instruct
$64.00/mo
Mancer: Weaver (alpha)
$67.50/mo
OpenAI: GPT-4.1 Mini
$68.00/mo
Sao10K: Llama 3.1 Euryale 70B v2.2
$68.00/mo
Mistral: Mistral Large 3 2512
$70.00/mo
OpenAI: GPT-3.5 Turbo
$70.00/mo
MoonshotAI: Kimi K2.5
$70.73/mo
Z.ai: GLM 4.7
$72.00/mo
OpenAI: GPT-5 Mini
$72.50/mo
ByteDance Seed: Seed-2.0-Lite
$72.50/mo
ByteDance Seed: Seed 1.6
$72.50/mo
OpenAI: GPT-5.1-Codex-Mini
$72.50/mo
Arcee AI: Virtuoso Large
$73.50/mo
Qwen: Qwen3.5-122B-A10B
$75.40/mo
Morph: Morph V3 Fast
$76.00/mo
EleutherAI: Llemma 7b
$76.00/mo
AlfredPros: CodeLLaMa 7B Instruct Solidity
$76.00/mo
Z.ai: GLM 4.6
$76.50/mo
AionLabs: Aion-1.0-Mini
$77.00/mo
Qwen: Qwen3 235B A22B
$77.35/mo
Xiaomi: MiMo-V2-Omni
$80.00/mo
Mistral: Devstral 2 2512
$80.00/mo
Relace: Relace Apply 3
$80.00/mo
MoonshotAI: Kimi K2 0905
$80.00/mo
Mistral: Mistral Medium 3.1
$80.00/mo
Mistral: Devstral Medium
$80.00/mo
Mistral: Mistral Medium 3
$80.00/mo
Perplexity: Sonar
$80.00/mo
Nous: Hermes 3 405B Instruct
$80.00/mo
MoonshotAI: Kimi K2 Thinking
$83.50/mo
Z.ai: GLM 4.5V
$84.00/mo
NVIDIA: Llama 3.1 Nemotron Ultra 253B v1
$84.00/mo
MiniMax: MiniMax M1
$86.00/mo
DeepSeek: R1 0528
$87.00/mo
AionLabs: Aion-2.0
$88.00/mo
AionLabs: Aion-RP 1.0 (8B)
$88.00/mo
Qwen: Qwen VL Max
$88.40/mo
Qwen: Qwen3.5 397B A17B
$89.70/mo
Google: Nano Banana (Gemini 2.5 Flash Image)
$90.00/mo
Google: Gemini 2.5 Flash
$90.00/mo
Amazon: Nova 2 Lite
$90.00/mo
Qwen: Qwen3 VL 235B A22B Thinking
$91.00/mo
NVIDIA: Llama 3.1 Nemotron 70B Instruct
$96.00/mo
Z.ai: GLM 4.5
$96.00/mo
MoonshotAI: Kimi K2 0711
$97.50/mo
Deep Cogito: Cogito v2.1 671B
$100.00/mo
OpenAI: GPT Audio Mini
$102.00/mo
Morph: Morph V3 Large
$102.00/mo
Z.ai: GLM 5
$105.00/mo
DeepSeek: R1
$110.00/mo
OpenAI: GPT-3.5 Turbo (older v0613)
$110.00/mo
Google: Gemini 3 Flash Preview
$115.00/mo
Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview)
$115.00/mo
Sao10k: Llama 3 Euryale 70B v2.1
$118.40/mo
Qwen: Qwen3 Coder Plus
$130.00/mo
OpenAI: GPT-3.5 Turbo Instruct
$135.00/mo
Amazon: Nova Pro 1.0
$136.00/mo
Xiaomi: MiMo-V2-Pro
$140.00/mo
Relace: Relace Search
$140.00/mo
Nous: Hermes 4 405B
$140.00/mo
Arcee AI: Maestro Reasoning
$144.00/mo
Switchpoint Router
$144.50/mo
Qwen: Qwen3 Max Thinking
$156.00/mo
Qwen: Qwen3 Max
$156.00/mo
Anthropic: Claude 3.5 Haiku
$160.00/mo
OpenAI: GPT-5.4 Mini
$172.50/mo
Qwen: Qwen-Max
$176.80/mo
Z.ai: GLM 5V Turbo
$180.00/mo
Z.ai: GLM 5 Turbo
$180.00/mo
OpenAI: GPT-5 Image Mini
$185.00/mo
OpenAI: o4 Mini High
$187.00/mo
OpenAI: o4 Mini
$187.00/mo
OpenAI: o3 Mini High
$187.00/mo
OpenAI: o3 Mini
$187.00/mo
Anthropic: Claude Haiku 4.5
$200.00/mo
Writer: Palmyra X5
$210.00/mo
Sao10K: Llama 3.1 70B Hanami x1
$240.00/mo
OpenAI: GPT-3.5 Turbo 16k
$270.00/mo
Mistral Large 2411
$280.00/mo
Mistral Large 2407
$280.00/mo
Mistral Large
$280.00/mo
xAI: Grok 4.20 Multi-Agent
$280.00/mo
xAI: Grok 4.20
$280.00/mo
Mistral: Pixtral Large 2411
$280.00/mo
Mistral: Mixtral 8x22B Instruct
$280.00/mo
Magnum v4 72B
$300.00/mo
OpenAI: o4 Mini Deep Research
$340.00/mo
OpenAI: o3
$340.00/mo
OpenAI: GPT-4.1
$340.00/mo
AI21: Jamba Large 1.7
$340.00/mo
Perplexity: Sonar Reasoning Pro
$340.00/mo
Perplexity: Sonar Deep Research
$340.00/mo
OpenAI: GPT-5.1-Codex-Max
$362.50/mo
OpenAI: GPT-5.1
$362.50/mo
OpenAI: GPT-5.1 Chat
$362.50/mo
OpenAI: GPT-5.1-Codex
$362.50/mo
OpenAI: GPT-5
$362.50/mo
Google: Gemini 2.5 Pro
$362.50/mo
Google: Gemini 2.5 Pro Preview 06-05
$362.50/mo
Google: Gemini 2.5 Pro Preview 05-06
$362.50/mo
OpenAI: GPT-5 Codex
$362.50/mo
OpenAI: GPT-5 Chat
$362.50/mo
Goliath 120B
$412.50/mo
OpenAI: GPT-4o Audio
$425.00/mo
OpenAI: GPT-4o Search Preview
$425.00/mo
OpenAI: GPT-4o (2024-11-20)
$425.00/mo
OpenAI: GPT-4o (2024-08-06)
$425.00/mo
OpenAI: GPT-4o
$425.00/mo
Cohere: Command R+ (08-2024)
$425.00/mo
OpenAI: GPT Audio
$425.00/mo
Cohere: Command A
$425.00/mo
Inflection: Inflection 3 Pi
$425.00/mo
Inflection: Inflection 3 Productivity
$425.00/mo
AionLabs: Aion-1.0
$440.00/mo
Google: Gemini 3.1 Pro Preview Custom Tools
$460.00/mo
Google: Gemini 3.1 Pro Preview
$460.00/mo
Google: Nano Banana Pro (Gemini 3 Pro Image Preview)
$460.00/mo
Amazon: Nova Premier 1.0
$500.00/mo
OpenAI: GPT-5.3 Chat
$507.50/mo
OpenAI: GPT-5.3-Codex
$507.50/mo
OpenAI: GPT-5.2-Codex
$507.50/mo
OpenAI: GPT-5.2 Chat
$507.50/mo
OpenAI: GPT-5.2
$507.50/mo
OpenAI: GPT-5.4
$575.00/mo
xAI: Grok 3
$600.00/mo
xAI: Grok 3 Beta
$600.00/mo
Anthropic: Claude Sonnet 4.6
$600.00/mo
Anthropic: Claude Sonnet 4.5
$600.00/mo
Anthropic: Claude Sonnet 4
$600.00/mo
Anthropic: Claude 3.7 Sonnet
$600.00/mo
Anthropic: Claude 3.7 Sonnet (thinking)
$600.00/mo
Perplexity: Sonar Pro Search
$600.00/mo
xAI: Grok 4
$600.00/mo
Perplexity: Sonar Pro
$600.00/mo
OpenAI: GPT-4o (2024-05-13)
$700.00/mo
OpenAI: GPT-5 Image
$800.00/mo
OpenAI: GPT-4o (extended)
$840.00/mo
Anthropic: Claude Opus 4.6
$1.0K/mo
Anthropic: Claude Opus 4.5
$1.0K/mo
OpenAI: GPT-4 Turbo
$1.4K/mo
OpenAI: GPT-4 Turbo Preview
$1.4K/mo
OpenAI: GPT-4 Turbo (older v1106)
$1.4K/mo
OpenAI: o3 Deep Research
$1.7K/mo
OpenAI: o1
$2.5K/mo
Anthropic: Claude Opus 4.1
$3.0K/mo
Anthropic: Claude Opus 4
$3.0K/mo
OpenAI: GPT-4 (older v0314)
$3.3K/mo
OpenAI: GPT-4
$3.3K/mo
OpenAI: o3 Pro
$3.4K/mo
OpenAI: GPT-5 Pro
$4.3K/mo
OpenAI: GPT-5.2 Pro
$6.1K/mo
OpenAI: GPT-5.4 Pro
$6.9K/mo
OpenAI: o1-pro
$25.5K/mo

Understanding AI API Pricing in 2026

AI model pricing has undergone a dramatic transformation. Since GPT-4 launched in March 2023 at $30 per million input tokens, prices have fallen by over 90% — driven by competition from Anthropic, Google, and open-source challengers like DeepSeek and Meta's Llama.

Today's pricing landscape spans a 150x range: from Google's Gemini 2.0 Flash at $0.10/1M input tokens to Claude Opus 4 at $15/1M tokens. The key insight is that price doesn't always correlate with quality — DeepSeek V3 delivers 86% quality at just $0.27/1M tokens, while some premium models charge 50x more for marginal quality gains.

How to Optimize AI API Costs

The most effective strategy is model routing: sending simple queries to cheap, fast models and complex queries to premium models. A gateway like Swfte Connect automates this, typically reducing costs by 30-60% without sacrificing quality.

Other strategies include: leveraging cached input pricing (offered by Google and DeepSeek), batching requests to reduce per-call overhead, and using open-source models for predictable workloads where you can self-host.

Pricing Trends to Watch

  • Price compression continues: Expect another 50%+ reduction across flagship models by end of 2026
  • Reasoning premium: Models with extended thinking (o3, R1) cost more due to higher compute per request
  • Open-source pressure: Llama 4 and DeepSeek are forcing closed providers to cut prices faster
  • Cached pricing: More providers offering discounted rates for repeated context