AI Leaderboard

AI Models & Tools Benchmark

Real-time tracking of global AI model benchmarks and rankings.

LLM Benchmarks

RankModelCode ArenaChat ArenaGPQAPricing (In/Out)
#1
Gemini 3.1 Pro
Google
Ctx: 1.0MModel
2130
1222
94.3
$2.5 / $15
#2
Claude Opus 4.6
Anthropic
Ctx: 1.0MModel
2040
1491
91.3
$5 / $25
#3
Claude Opus 4.7
Anthropic
Ctx: 1.0MModel
1900
358
94.2
$5 / $25
#4
Gemini 3 Flash
Google
Ctx: 1.0MModel
1703
1143
90.4
$0.5 / $3
#5
Claude Sonnet 4.6
Anthropic
Ctx: 200kModel
1642
956
89.9
$3 / $15
#6
Claude Opus 4.5
Anthropic
Ctx: 200kModel
1614
1342
87
$5 / $25
#7
Gemini 3 Pro
Google
Ctx: N/AModel
1579
1045
91.9
N/A
#8
GPT-5.2
OpenAI
Ctx: 400kModel
1514
1170
92.4
$1.75 / $14
#9
Gemma 4 26B-A4B
Google
Ctx: 262kModel
1233
594
82.3
$0.13 / $0.4
#10
Qwen3.5-397B-A17B
Alibaba Cloud / Qwen Team
Ctx: 262kModel
1208
963
88.4
$0.6 / $3.6
#11
Qwen3.6 Plus
Alibaba Cloud / Qwen Team
Ctx: 1.0MModel
1202
750
90.4
$0.5 / $3
#12
Claude Opus 4.1
Anthropic
Ctx: N/AModel
1189
1180
80.9
N/A
#13
Claude Sonnet 4.5
Anthropic
Ctx: 200kModel
1166
1308
83.4
$3 / $15
#14
Gemma 4 31B
Google
Ctx: 262kModel
1067
881
84.3
$0.14 / $0.4
#15
GPT-4.1 mini
OpenAI
Ctx: 1.0MModel
965
528
65
$0.4 / $1.6
#16
Claude Opus 4
Anthropic
Ctx: N/AModel
932
1088
79.6
N/A
#17
Claude Haiku 4.5
Anthropic
Ctx: 200kModel
894
1188
73
$1 / $5
#18
Claude Sonnet 4
Anthropic
Ctx: N/AModel
882
856
75.4
N/A
#19
Gemini 3.1 Flash-Lite
Google
Ctx: 1.0MModel
859
756
86.9
$0.25 / $1.5
#20
GPT-4.1
OpenAI
Ctx: 1.0MModel
778
1237
66.3
$2 / $8