What is the best AI model in June 2026?

By quality index, Anthropic: Claude Fable 5 (100/100) currently leads our June 2026 leaderboard. The "best" model depends on the workload — text chat, code, reasoning, image generation, speed, or cost. See the rankings below for the top model in each category.

What is the LLM leaderboard for June 2026?

Our LLM leaderboard ranks 353 large language models by composite quality (MMLU Pro, HumanEval, MATH), LMSys Arena Elo, pricing per million tokens, and inference speed. See /ai/llm/leaderboard for the LLM-only view.

What is the LMSys Chatbot Arena leaderboard June 2026?

LMArena (formerly LMSys Chatbot Arena) ranks models by pairwise human preference votes. Our June 2026 snapshot mirrors the latest published Elo with provider pricing and benchmark cross-validation. See /lmarena for the full breakdown.

Which AI model has the best price-to-quality in 2026?

DeepSeek V4 Pro (Apache 2.0) currently leads price-per-quality. Its launch-promo rate — about $0.44 input / $0.87 output per 1M tokens — is a small fraction of frontier pricing for a comparable Arena Elo band (Gemini 3.1 Pro is $2 / $12; GPT-5.5 is $5 / $30). Grok 4.3 ($1.25 / $2.50) and Kimi K2.6 ($0.73 / $3.49) are close behind on value.

Which AI image model is best in 2026?

For 2026 image generation, the leaderboard tracks Imagen 4, Flux 2, DALL-E 4, and Stable Diffusion 4 Ultra. See our Image Model Leaderboard section below for ranked quality scores.

Which AI coding model is best in June 2026?

Claude Opus 4.8 leads on coding benchmarks — SWE-bench Verified 88.6%, SWE-bench Pro 69.2%, Terminal-Bench 2.1 74.6% — and tops the Artificial Analysis Intelligence Index at 61.4 (the new #1). GPT-5.5, Gemini 3.1 Pro, and Qwen 3.7 Max follow closely. For open weights, DeepSeek V4 Pro is the leader. See /ai/llm/leaderboard for the coding-specific ranking.

How are AI models ranked on this leaderboard?

The leaderboard uses a composite quality index (0-100) drawn from MMLU Pro (knowledge), HumanEval (coding), and MATH (reasoning), validated against LMSys Chatbot Arena Elo. Pricing comes from official provider pages and OpenRouter; speed (tok/s, TTFT) from Artificial Analysis.

What is the best open-source AI model in June 2026?

DeepSeek V4 Pro (Apache 2.0, 1.6T MoE / 49B active, 1M context) is the June 2026 leader for open-weights. Gemma 4 (Google) and NVIDIA Nemotron 3 Nano Omni are strong alternatives.

Updated Jun 9, 2026

AI Model Leaderboard — June 2026

Every major AI model ranked by quality, speed, pricing, and value. Filter by category, sort by any metric, and find the right model for your use case. Live data refreshed regularly with LMSys Arena Elo, official provider pricing, and Artificial Analysis benchmarks.

In short:As of June 2026, Anthropic: Claude Fable 5 leads the AI model leaderboard at 100/100 on our composite quality index, across 353+ models ranked by quality, price, speed, and value. The best pick depends on the workload — sort the table by the metric that matters to you.

🥇

Anthropic: Claude Fable 5

100/100

🥈

Anthropic: Claude Opus 4.8

99/100

🥉

OpenAI: GPT-5.5 Pro

98/100

Climb the Leaderboard

Stop reading — start ranking

Three ways to put this leaderboard to work. Pick any one — they all start with a free Swfte account, no card required.

Run Anthropic: Claude Fable 5 free Get pinged on rank changes The Model-Hopper Challenge50% OFF · 6 MO

Monthly Snapshot

June 2026: Top Models, Best Value, Fastest Inference

The June 2026 ranking covers 353 models across LMSys Arena Elo, MMLU Pro, HumanEval, MATH, pricing, and inference speed. Top of the table: Anthropic: Claude Fable 5 at 100/100 quality. The full table below is sortable by any metric. Live data is refreshed regularly from official provider pricing pages and the public Arena.

Top 5 by Quality Index

Anthropic: Claude Fable 5 — 100/100
Anthropic: Claude Opus 4.8 — 99/100
OpenAI: GPT-5.5 Pro — 98/100
OpenAI: GPT-5.5 — 97/100
OpenAI: GPT-5.4 Pro — 97/100

Best Price-to-Quality

Mistral: Mistral Nemo — $0.03/1M out
Mistral: Mistral Small 3 — $0.08/1M out
Qwen: Qwen3 235B A22B Instruct 2507 — $0.1/1M out
Google: Gemma 3 12B — $0.13/1M out
Qwen: Qwen3.5-9B — $0.15/1M out

See our LMSys Arena deep dive and the monthly release roundup.

353 models

#	Model	Quality	Arena ELO	Speed	Price	Context	Value	Released
1	Anthropic: Claude Fable 5 New Anthropic · Frontier agentic coding & knowledge work	100	1525	58 t/s	$10 / $50	1M	3.3	Jun 2026
2	Anthropic: Claude Opus 4.8 New Anthropic · Coding, agents & computer use	99	1512	72 t/s	$5 / $25	1M	6.6	May 2026
3	OpenAI: GPT-5.5 Pro OpenAI · Reasoning at any cost	98	1510	68 t/s	$30 / $180	1M	0.9	Apr 2026
4	OpenAI: GPT-5.5 OpenAI · Frontier general purpose	97	1506	70 t/s	$5 / $30	1M	5.5	Apr 2026
5	OpenAI: GPT-5.4 Pro OpenAI · Complex analysis	97	—	—	$30 / $180	1M	0.9	Mar 2026
6	OpenAI: GPT-5.2 Pro OpenAI · Complex analysis	97	—	—	$21 / $168	400K	1.0	Dec 2025
7	Anthropic: Claude Opus 4.7 (Fast) Anthropic · Complex analysis	97	—	—	$30 / $150	1M	1.1	May 2026
8	Anthropic: Claude Opus 4.7 Anthropic · Coding & agentic workflows	96	1505	68 t/s	$5 / $25	1M	6.4	Apr 2026
9	OpenAI: o3 Deep Research OpenAI · Deep research	96	—	—	$10 / $40	200K	3.8	Oct 2025
10	OpenAI: o4 Mini Deep Research OpenAI · Deep research	96	—	—	$2 / $8	200K	19.2	Oct 2025
11	OpenAI: o3 Pro OpenAI · Hard reasoning	96	—	—	$20 / $80	200K	1.9	Jun 2025
12	Google: Gemini 3.1 Pro Preview Custom Tools Google · Speed & cost	96	1505	—	$2 / $12	1M	13.7	Feb 2026
13	Google: Gemini 3.1 Pro Preview Google · Science & long-context	96	1505	131 t/s	$2 / $12	1M	13.7	Apr 2026
14	Anthropic: Claude Opus 4.6 Anthropic · General purpose	95	1490	—	$5 / $25	1M	6.3	Feb 2026
15	Anthropic: Claude Opus 4.5 Anthropic · General purpose	95	—	—	$5 / $25	200K	6.3	Nov 2025
16	Anthropic: Claude Opus 4.6 (Fast) Anthropic · Complex analysis	95	—	—	$30 / $150	1M	1.1	Apr 2026
17	Google: Nano Banana Pro (Gemini 3 Pro Image Preview) Google · Image generation	94	—	—	$2 / $12	66K	13.4	Nov 2025
18	Anthropic: Claude Opus 4.1 Anthropic · Multimodal	94	—	—	$15 / $75	200K	2.1	Aug 2025
19	OpenAI: o3 OpenAI · Hard reasoning	94	1370	68 t/s	$10 / $40	200K	3.8	Apr 2025
20	Qwen: Qwen3.7 Max New Alibaba Cloud · Long autonomous agentic runs	94	1488	90 t/s	$2.5 / $7.5	1M	18.8	May 2026
21	xAI: Grok 4.3 xAI · Agentic tasks & real-time info	93	1496	83 t/s	$1.25 / $2.5	1M	49.6	May 2026
22	OpenAI: GPT-5.4 OpenAI · General purpose	93	1495	—	$2.5 / $15	1M	10.6	Mar 2026
23	OpenAI: GPT-5.3 Chat OpenAI · General purpose	93	—	—	$1.75 / $14	128K	11.8	Mar 2026
24	OpenAI: GPT-5.3-Codex OpenAI · Code generation	93	—	—	$1.75 / $14	400K	11.8	Feb 2026
25	OpenAI: GPT-5.2-Codex OpenAI · Code generation	93	—	—	$1.75 / $14	400K	11.8	Jan 2026
26	OpenAI: GPT-5.2 Chat OpenAI · General purpose	93	—	—	$1.75 / $14	128K	11.8	Dec 2025
27	OpenAI: GPT-5.2 OpenAI · General purpose	93	—	—	$1.75 / $14	400K	11.8	Dec 2025
28	OpenAI: GPT-5.1-Codex-Max OpenAI · Code generation	93	—	—	$1.25 / $10	400K	16.5	Dec 2025
29	OpenAI: GPT-5.1 OpenAI · General purpose	93	—	—	$1.25 / $10	400K	16.5	Nov 2025
30	OpenAI: GPT-5.1 Chat OpenAI · General purpose	93	—	—	$1.25 / $10	128K	16.5	Nov 2025
31	OpenAI: GPT-5.1-Codex OpenAI · Code generation	93	—	—	$1.25 / $10	400K	16.5	Nov 2025
32	OpenAI: o1-pro OpenAI · Hard reasoning	93	—	—	$150 / $600	200K	0.2	Mar 2025
33	OpenAI: GPT-4 (older v0314) OpenAI · Complex analysis	93	—	—	$30 / $60	8K	2.1	May 2023
34	OpenAI: GPT-4 OpenAI · Multimodal	93	—	—	$30 / $60	8K	2.1	May 2023
35	xAI: Grok 4.20 xAI · General purpose	93	1496	—	$1.25 / $2.5	2M	49.6	Mar 2026
36	OpenAI: GPT-5.4 Image 2 OpenAI · Complex analysis	93	—	—	$8 / $15	272K	8.1	Apr 2026
37	MoonshotAI: Kimi K2.6 Moonshot AI · Frontier quality at low cost	92	1466	48 t/s	$0.73 / $3.49	256K	43.6	Apr 2026
38	Google: Gemini 2.5 Pro Google · Multimodal + value	92	1345	87 t/s	$1.25 / $10	1M	16.4	Mar 2025
39	Anthropic: Claude Opus 4 Anthropic · Complex analysis	91	1360	52 t/s	$15 / $75	200K	2.0	May 2025
40	TNG: DeepSeek R1T2 ChimeraOSS · Hard reasoning	91	—	—	$0.3 / $1.1	164K	130.0	Jul 2025
41	Google: Gemini 2.5 Pro Preview 06-05 Google · Speed & cost	91	—	—	$1.25 / $10	1M	16.2	Jun 2025
42	DeepSeek: R1 0528OSS DeepSeek · Hard reasoning	91	—	—	$0.5 / $2.15	164K	68.7	May 2025
43	Google: Gemini 2.5 Pro Preview 05-06 Google · Speed & cost	91	—	—	$1.25 / $10	1M	16.2	May 2025
44	DeepSeek: R1 Distill Qwen 32BOSS DeepSeek · Hard reasoning	91	—	—	$0.29 / $0.29	33K	313.8	Jan 2025
45	DeepSeek: R1 Distill Llama 70BOSS DeepSeek · Hard reasoning	91	—	—	$0.7 / $0.8	131K	121.3	Jan 2025
46	DeepSeek: R1OSS DeepSeek · Hard reasoning	91	—	—	$0.7 / $2.5	64K	56.9	Jan 2025
47	DeepSeek: DeepSeek V4 ProOSS DeepSeek · Open-source value leader	90	1467	33 t/s	$1.74 / $3.48	1M	34.5	Apr 2026
48	Anthropic: Claude Sonnet 4.6 Anthropic · Coding & balance	90	1467	73 t/s	$3 / $15	1M	10.0	Feb 2026
49	OpenAI: GPT-5 OpenAI · General purpose	90	1455	—	$1.25 / $10	400K	16.0	Aug 2025
50	xAI: Grok 3 Beta xAI · General purpose	90	—	—	$3 / $15	131K	10.0	Apr 2025
51	Qwen: Qwen3.6 Max PreviewOSS Alibaba Cloud · Open-source	90	—	—	$1.04 / $6.24	262K	24.7	Apr 2026
52	OpenAI: GPT-4.1 OpenAI · Long context	89	1310	120 t/s	$2 / $8	1M	17.8	Apr 2025
53	MoonshotAI: Kimi K2.5 Moonshot AI · Speed & cost	89	1452	—	$0.4 / $1.9	262K	77.4	Jan 2026
54	MiniMax: MiniMax M3 NewOSS · Open-weight agentic coding	89	1455	80 t/s	$0.6 / $2.4	1M	59.3	Jun 2026
55	Z.ai: GLM 5.1OSS · Open-weight agentic & tool use	88	1467	48 t/s	$0.98 / $3.08	200K	43.3	Apr 2026
56	OpenAI: GPT-5 Image OpenAI · Multimodal	88	—	—	$10 / $10	400K	8.8	Oct 2025
57	OpenAI: GPT-5 Pro OpenAI · Complex analysis	88	—	—	$15 / $120	400K	1.3	Oct 2025
58	Anthropic: Claude Sonnet 4.5 Anthropic · General purpose	88	—	—	$3 / $15	1M	9.8	Sep 2025
59	OpenAI: GPT-4o Audio OpenAI · General purpose	88	—	—	$2.5 / $10	128K	14.1	Aug 2025
60	OpenAI: GPT-4o Search Preview OpenAI · Search + citations	88	—	—	$2.5 / $10	128K	14.1	Mar 2025
61	OpenAI: o1 OpenAI · Hard reasoning	88	—	—	$15 / $60	200K	2.3	Dec 2024
62	OpenAI: GPT-4o (2024-11-20) OpenAI · General purpose	88	—	—	$2.5 / $10	128K	14.1	Nov 2024
63	OpenAI: GPT-4o OpenAI · General purpose	88	—	—	$2.5 / $10	128K	14.1	May 2024
64	OpenAI: GPT-4o (extended) OpenAI · Multimodal	88	—	—	$6 / $18	128K	7.3	May 2024
65	OpenAI: GPT-4o (2024-05-13) OpenAI · General purpose	88	—	—	$5 / $15	128K	8.8	May 2024
66	OpenAI: GPT-4 Turbo OpenAI · Multimodal	88	—	—	$10 / $30	128K	4.4	Apr 2024
67	OpenAI: GPT-4 Turbo Preview OpenAI · Complex analysis	88	—	—	$10 / $30	128K	4.4	Jan 2024
68	OpenAI: GPT-4 Turbo (older v1106) OpenAI · Multimodal	88	—	—	$10 / $30	128K	4.4	Nov 2023
69	Z.ai: GLM 5OSS · Open-source	88	1450	—	$0.6 / $1.92	80K	69.8	Feb 2026
70	Anthropic: Claude Sonnet 4 Anthropic · Coding & balance	88	1320	95 t/s	$3 / $15	200K	9.8	May 2025
71	OpenAI: o3 Mini OpenAI · Reasoning & math	88	1305	155 t/s	$1.1 / $4.4	200K	32.0	Jan 2025
72	xAI: Grok 3 xAI · Real-time info	87	1330	82 t/s	$3 / $15	131K	9.7	Feb 2025
73	DeepSeek: DeepSeek V3.2OSS DeepSeek · Open-source	87	1455	—	$0.252 / $0.378	164K	276.2	Dec 2025
74	Nex AGI: DeepSeek V3.1 Nex N1OSS · Open-source	86	—	—	$0.135 / $0.5	131K	270.9	Dec 2025
75	DeepSeek: DeepSeek V3.2 SpecialeOSS DeepSeek · Open-source	86	—	—	$0.287 / $0.431	164K	239.6	Dec 2025
76	DeepSeek: DeepSeek V3.2 ExpOSS DeepSeek · Open-source	86	—	—	$0.27 / $0.41	164K	252.9	Sep 2025
77	DeepSeek: DeepSeek V3.1 TerminusOSS DeepSeek · Open-source	86	—	—	$0.27 / $0.95	164K	141.0	Sep 2025
78	DeepSeek: DeepSeek V3.1OSS DeepSeek · Open-source	86	—	—	$0.21 / $0.79	33K	172.0	Aug 2025
79	DeepSeek: DeepSeek V3 0324OSS DeepSeek · Open-source	86	—	—	$0.2 / $0.77	164K	177.3	Mar 2025
80	Anthropic: Claude 3.7 Sonnet Anthropic · General purpose	86	—	—	$3 / $15	200K	9.6	Feb 2025
81	Anthropic: Claude 3.7 Sonnet (thinking) Anthropic · Hard reasoning	86	—	—	$3 / $15	200K	9.6	Feb 2025
82	DeepSeek: DeepSeek V3OSS DeepSeek · Best open-source value	86	1310	62 t/s	$0.27 / $1.1	128K	125.5	Mar 2025
83	Qwen: Qwen3.6 Plus Alibaba Cloud · Multilingual & APAC	86	1448	124 t/s	$1.4 / $5.6	256K	24.6	Apr 2026
84	OpenAI: GPT-4o (2024-08-06) OpenAI · General purpose	85	1285	109 t/s	$2.5 / $10	128K	13.6	May 2024
85	Mistral: Mistral Large 3 2512OSS Mistral AI · Open-source	85	—	—	$0.5 / $1.5	262K	85.0	Dec 2025
86	Mistral Large 2407OSS Mistral AI · Open-source	85	—	—	$2 / $6	131K	21.3	Nov 2024
87	Mistral LargeOSS Mistral AI · Open-source	85	—	—	$2 / $6	128K	21.3	Feb 2024
88	Google: Gemini 3.5 Flash New Google · Speed & cost	84	—	—	$1.5 / $9	1M	16.0	May 2026
89	OpenAI: GPT-5.4 Mini OpenAI · Speed & cost	83	—	—	$0.75 / $4.5	400K	31.6	Mar 2026
90	OpenAI: GPT-5 Mini OpenAI · Speed & cost	83	—	—	$0.25 / $2	400K	73.8	Aug 2025
91	Qwen: Qwen3.5-9BOSS Alibaba Cloud · Open-source	82	—	—	$0.04 / $0.15	256K	863.2	Mar 2026
92	Qwen: Qwen3.5-35B-A3BOSS Alibaba Cloud · Open-source	82	—	—	$0.139 / $1	262K	144.0	Feb 2026
93	Qwen: Qwen3.5-27BOSS Alibaba Cloud · Open-source	82	—	—	$0.195 / $1.56	262K	93.4	Feb 2026
94	Qwen: Qwen3.5-122B-A10BOSS Alibaba Cloud · Open-source	82	—	—	$0.26 / $2.08	262K	70.1	Feb 2026
95	Qwen: Qwen3.5-FlashOSS Alibaba Cloud · Speed & cost	82	—	—	$0.065 / $0.26	1M	504.6	Feb 2026
96	Qwen: Qwen3.5 Plus 2026-02-15OSS Alibaba Cloud · Open-source	82	—	—	$0.26 / $1.56	1M	90.1	Feb 2026
97	Qwen: Qwen3.5 397B A17BOSS Alibaba Cloud · Open-source	82	—	—	$0.39 / $2.34	262K	60.1	Feb 2026
98	Qwen: Qwen3 Max ThinkingOSS Alibaba Cloud · Hard reasoning	82	—	—	$0.78 / $3.9	262K	35.0	Feb 2026
99	Qwen: Qwen3 Coder NextOSS Alibaba Cloud · Code generation	82	—	—	$0.11 / $0.8	262K	180.2	Feb 2026
100	Qwen: Qwen3 VL 32B InstructOSS Alibaba Cloud · Open-source	82	—	—	$0.104 / $0.416	131K	315.4	Oct 2025

Page 1 of 4 · 1–100 of 353

Quality = composite benchmark (MMLU, HumanEval, MATH)Arena ELO = LMSYS Chatbot Arena ratingValue = quality per dollarPrice = input / output per 1M tokens

LLM Leaderboard June 2026

Large language models ranked by LMSys Arena Elo, MMLU, HumanEval, MATH, pricing, and tokens-per-second. Text-only view.

LM Leaderboard June 2026

Language model rankings: LMArena Elo, price-to-Elo ratio, and open-weight vs closed-source comparison.

LMSys Arena Leaderboard June 2026

LMArena (formerly LMSys Chatbot Arena) tracker — pairwise human preference Elo scores, refreshed as the public arena publishes.

Image Model Leaderboard 2026

Generative AI image and video models — Imagen 4, Flux 2, DALL-E 4, Stable Diffusion 4 Ultra, Sora 2 ranked by quality and cost.

Coding Model Leaderboard 2026

AI coding assistants ranked: Claude Opus, GPT-5.5, Gemini 3.1 Pro, DeepSeek V4, plus HumanEval and SWE-Bench scores.

Vendor Lock-in Leaderboard 2026

AI vendors ranked by portability — license, weight availability, fine-tuning openness, and exit cost score.

How We Rank AI Models

Our leaderboard uses a composite quality index that combines three key benchmarks: MMLU Pro (measuring knowledge and reasoning across 57 subjects), HumanEval (measuring code generation ability), and MATH (measuring mathematical problem-solving). Scores are normalized to a 0-100 scale and cross-referenced against LMSYS Chatbot Arena ELO ratings for real-world validation.

We track speed (tokens per second), time-to-first-token (TTFT), pricing, and context window size to give you a complete picture. The Value Score divides quality by cost, showing you which models deliver the most capability per dollar.

Key Trends in AI Model Performance

A new frontier #1: Claude Opus 4.8 (May 28, 2026) tops the Artificial Analysis Intelligence Index at 61.4, edging out GPT-5.5, with standout gains in coding (SWE-bench Pro 69.2%) and computer-use agents (Online-Mind2Web 84%). Alibaba's Qwen 3.7 Max debuts as the highest-ranked Chinese model at #5
Open weights closing the gap: DeepSeek V4 Pro, GLM-5.1, and Kimi K2.6 now trade blows with closed frontier models on reasoning and coding — at a fraction of the price
Reasoning is built in: Frontier models like GPT-5.5, Claude Opus 4.8, and Gemini 3.1 Pro ship extended thinking by default, while deep-research variants trade latency for accuracy on the hardest tasks
Million-token context is standard: 1M-token windows are now table stakes for flagship models across OpenAI, Anthropic, Google, and xAI
Price keeps falling: Fast- and flash-tier models pair strong quality with low latency, while launch promos (such as DeepSeek V4) push frontier-class quality below $1 per million tokens

Choosing the Right Model

There is no single "best" model — it depends on your use case. For most applications, a model routing approach works best: route simple queries to fast, cheap models and complex queries to frontier models. This gives you the best of both worlds — low cost and high quality.