████████╗██╗   ██╗██████╗ ██╗███╗   ██╗ ██████╗ ███████╗ ██████╗ ██████╗ ██████╗ ███████╗    ██████╗ ██████╗ ███╗   ███╗
╚══██╔══╝██║   ██║██╔══██╗██║████╗  ██║██╔════╝ ██╔════╝██╔════╝██╔═══██╗██╔══██╗██╔════╝   ██╔════╝██╔═══██╗████╗ ████║
   ██║   ██║   ██║██████╔╝██║██╔██╗ ██║██║  ███╗███████╗██║     ██║   ██║██████╔╝█████╗     ██║     ██║   ██║██╔████╔██║
   ██║   ██║   ██║██╔══██╗██║██║╚██╗██║██║   ██║╚════██║██║     ██║   ██║██╔══██╗██╔══╝     ██║     ██║   ██║██║╚██╔╝██║
   ██║   ╚██████╔╝██║  ██║██║██║ ╚████║╚██████╔╝███████║╚██████╗╚██████╔╝██║  ██║███████╗██╗╚██████╗╚██████╔╝██║ ╚═╝ ██║
   ╚═╝    ╚═════╝ ╚═╝  ╚═╝╚═╝╚═╝  ╚═══╝ ╚═════╝ ╚══════╝ ╚═════╝ ╚═════╝ ╚═╝  ╚═╝╚══════╝╚═╝ ╚═════╝ ╚═════╝ ╚═╝     ╚═╝

Figure out which LLM is "best". Data from LMSYS's Chatbot Arena. Click here to learn more.

Rank Model Elo
1 claude-opus-4-6-thinking 1505±10 ±3,679
2 claude-opus-4-6 1503±9 ±4,427
3 gemini-3-pro 1486±4 ±35,513
4 grok-4.1-thinking 1475±4 ±35,214
5 gemini-3-flash 1472±5 ±26,167
6 claude-opus-4-5-20251101-thinking-32k 1471±5 ±27,282
7 claude-opus-4-5-20251101 1467±4 ±32,179
8 grok-4.1 1464±4 ±39,284
9 gemini-3-flash (thinking-minimal) 1462±5 ±17,361
10 gpt-5.1-high 1458±4 ±31,519
11 ernie-5.0-0110 1452±6Preliminary ±11,080
12 claude-sonnet-4-5-20250929 1451±4 ±43,519
13 claude-sonnet-4-5-20250929-thinking-32k 1450±4 ±45,809
14 ernie-5.0-preview-1203 1449±7 ±9,773
15 gemini-2.5-pro 1449±3 ±94,838
16 claude-opus-4-1-20250805-thinking-16k 1449±4 ±49,947
17 kimi-k2.5-thinking 1448±7 ±8,083
18 claude-opus-4-1-20250805 1445±3 ±74,911
19 gpt-4.5-preview-2025-02-27 1444±6 ±14,549
20 chatgpt-4o-latest-20250326 1442±3 ±82,329
21 glm-4.7 1441±6 ±12,013
22 kimi-k2.5-instant 1438±10 ±3,830
23 gpt-5.1 1438±4 ±33,736
24 gpt-5.2 1438±6 ±12,759
25 gpt-5.2-high 1436±6 ±16,100
26 gpt-5-high 1434±5 ±32,619
27 qwen3-max-preview 1434±5 ±27,842
28 o3-2025-04-16 1432±4 ±61,351
29 grok-4-1-fast-reasoning 1431±4 ±28,142
30 kimi-k2-thinking-turbo 1429±4 ±33,118
31 gpt-5-chat 1426±4 ±31,821
32 glm-4.6 1425±4 ±35,331
33 qwen3-max-2025-09-23 1425±6 ±9,223
34 claude-opus-4-20250514-thinking-16k 1424±4 ±37,973
35 deepseek-v3.2-exp 1423±6 ±11,762
36 deepseek-v3.2-exp-thinking 1423±7 ±8,999
37 qwen3-235b-a22b-instruct-2507 1422±3 ±69,176
38 grok-4-fast-chat 1422±8 ±6,989
39 deepseek-v3.2 1421±5 ±27,695
40 deepseek-v3.2-thinking 1421±5 ±22,732
41 deepseek-r1-0528 1418±6 ±19,289
42 ernie-5.0-preview-1022 1418±9 ±4,616
43 deepseek-v3.1 1418±6 ±15,298
44 kimi-k2-0905-preview 1417±6 ±11,973
45 deepseek-v3.1-thinking 1417±7 ±11,985
46 kimi-k2-0711-preview 1417±5 ±28,662
47 mistral-large-3 1416±5 ±24,050
48 deepseek-v3.1-terminus 1416±10 ±3,761
49 deepseek-v3.1-terminus-thinking 1415±10 ±3,548
50 qwen3-vl-235b-a22b-instruct 1415±6 ±11,684
51 gpt-4.1-2025-04-14 1413±4 ±52,211
52 claude-opus-4-20250514 1413±4 ±45,573
53 grok-3-preview-02-24 1411±4 ±33,970
54 mistral-medium-2508 1411±3 ±63,013
55 gemini-2.5-flash 1410±3 ±94,114
56 glm-4.5 1410±5 ±24,792
57 grok-4-0709 1410±4 ±42,143
58 gemini-2.5-flash-preview-09-2025 1405±4 ±32,875
59 claude-haiku-4-5-20251001 1404±4 ±44,511
60 grok-4-fast-reasoning 1403±5 ±18,637
61 o1-2024-12-17 1402±4 ±27,822
62 qwen3-235b-a22b-no-thinking 1401±4 ±39,546
63 qwen3-next-80b-a3b-instruct 1401±5 ±22,856
64 claude-sonnet-4-20250514-thinking-32k 1400±4 ±36,207
65 longcat-flash-chat 1399±6 ±11,548
66 qwen3-235b-a22b-thinking-2507 1398±6 ±9,254
67 deepseek-r1 1398±5 ±18,537
68 qwen3-vl-235b-a22b-thinking 1395±7 ±7,971
69 amazon-nova-experimental-chat-12-10 1395±10 ±3,718
70 mimo-v2-flash (non-thinking) 1394±5 ±16,651
71 deepseek-v3-0324 1394±4 ±46,690
72 hunyuan-vision-1.5-thinking 1393±12 ±2,225
73 mai-1-preview 1391±5 ±18,134
74 o4-mini-2025-04-16 1391±4 ±46,676
75 gpt-5-mini-high 1390±5 ±27,145
76 claude-sonnet-4-20250514 1390±4 ±41,645
77 claude-3-7-sonnet-20250219-thinking-32k 1388±4 ±39,889
78 o1-preview 1388±5 ±31,120
79 minimax-m2.1-preview 1387±5 ±16,114
80 hunyuan-t1-20250711 1387±9 ±4,801
81 qwen3-coder-480b-a35b-instruct 1386±5 ±26,652
82 step-3.5-flash 1386±8 ±5,571
83 mistral-medium-2505 1385±5 ±34,548
84 qwen3-30b-a3b-instruct-2507 1383±5 ±24,092
85 hunyuan-turbos-20250416 1382±6 ±11,053
86 gpt-4.1-mini-2025-04-14 1382±4 ±40,532
87 gemini-2.5-flash-lite-preview-09-2025-no-thinking 1380±4 ±47,414
88 glm-4.6v 1378±11 ±2,819
89 gemini-2.5-flash-lite-preview-06-17-thinking 1375±4 ±33,929
90 qwen3-235b-a22b 1375±5 ±27,164
91 qwen2.5-max 1374±4 ±33,294
92 claude-3-5-sonnet-20241022 1373±3 ±89,471
93 claude-3-7-sonnet-20250219 1372±4 ±44,434
94 glm-4.5-air 1371±4 ±31,382
95 qwen3-next-80b-a3b-thinking 1369±6 ±13,859
96 minimax-m1 1367±4 ±36,838
97 amazon-nova-experimental-chat-11-10 1367±5 ±16,393
98 gemma-3-27b-it 1365±4 ±48,694
99 o3-mini-high 1364±5 ±18,584
100 glm-4.7-flash 1363±7 ±7,376
101 grok-3-mini-high 1363±5 ±17,523
102 gemini-2.0-flash-001 1361±4 ±44,828
103 deepseek-v3 1358±5 ±21,788
104 grok-3-mini-beta 1357±5 ±23,765
105 intellect-3 1356±8 ±5,325
106 mistral-small-2506 1356±5 ±18,343
107 gpt-oss-120b 1354±4 ±30,963
108 gemini-2.0-flash-lite-preview-02-05 1353±4 ±24,951
109 glm-4.5v 1353±8 ±4,973
110 command-a-03-2025 1353±3 ±57,439
111 gemini-1.5-pro-002 1351±3 ±55,607
112 amazon-nova-experimental-chat-10-20 1349±6 ±11,439
113 hunyuan-turbos-20250226 1349±12 ±2,226
114 o3-mini 1348±3 ±58,638
115 amazon-nova-experimental-chat-10-09 1347±11 ±2,890
116 llama-3.1-nemotron-ultra-253b-v1 1347±12 ±2,546
117 qwen3-32b 1347±9 ±3,932
118 ling-flash-2.0 1347±7 ±7,038
119 minimax-m2 1347±8 ±6,742
120 step-3 1346±7 ±6,592
121 qwen-plus-0125 1346±8 ±5,823
122 gpt-4o-2024-05-13 1346±3 ±112,863
123 glm-4-plus-0111 1343±8 ±5,760
124 claude-3-5-sonnet-20240620 1343±3 ±82,417
125 gemma-3-12b-it 1342±10 ±3,829
126 nvidia-llama-3.3-nemotron-super-49b-v1.5 1341±10 ±3,432
127 hunyuan-turbo-0110 1340±12 ±2,295
128 gpt-5-nano-high 1338±7 ±8,399
129 nova-2-lite 1337±6 ±12,258
130 o1-mini 1337±4 ±51,986
131 qwq-32b 1336±4 ±26,118
132 llama-3.1-405b-instruct-bf16 1335±4 ±41,392
133 gpt-4o-2024-08-06 1335±4 ±45,498
134 grok-2-2024-08-13 1335±4 ±63,495
135 gemini-advanced-0514 1335±5 ±50,142
136 step-2-16k-exp-202412 1334±9 ±4,829
137 llama-3.1-405b-instruct-fp8 1334±4 ±59,655
138 olmo-3.1-32b-instruct 1331±6 ±11,634
139 yi-lightning 1329±5 ±27,340
140 qwen3-30b-a3b 1328±5 ±27,431
141 llama-4-maverick-17b-128e-instruct 1328±4 ±41,136
142 llama-3.3-nemotron-49b-super-v1 1327±12 ±2,230
143 hunyuan-large-2025-02-10 1326±10 ±3,738
144 gpt-4-turbo-2024-04-09 1324±4 ±98,130
145 claude-3-5-haiku-20241022 1324±3 ±71,183
146 deepseek-v2.5-1210 1323±8 ±6,793
147 gemini-1.5-pro-001 1323±4 ±79,132
148 llama-4-scout-17b-16e-instruct 1323±5 ±31,235
149 claude-3-opus-20240229 1322±3 ±194,904
150 gpt-4.1-nano-2025-04-14 1322±8 ±6,107
151 step-1o-turbo-202506 1321±7 ±9,687
152 ring-flash-2.0 1320±7 ±7,204
153 llama-3.3-70b-instruct 1320±3 ±55,576
154 glm-4-plus 1319±5 ±26,134
155 gemma-3n-e4b-it 1319±5 ±23,342
156 qwen-max-0919 1318±6 ±16,479
157 gpt-4o-mini-2024-07-18 1318±3 ±68,801
158 gpt-oss-20b 1317±6 ±10,810
159 nvidia-nemotron-3-nano-30b-a3b-bf16 1317±6 ±15,500
160 qwen2.5-plus-1127 1315±6 ±10,179
161 athene-v2-chat 1314±5 ±24,746
162 mistral-large-2407 1314±4 ±45,460
163 gpt-4-0125-preview 1313±4 ±93,439
164 gpt-4-1106-preview 1313±4 ±100,107
165 hunyuan-standard-2025-02-10 1312±10 ±3,905
166 mercury 1310±14 ±1,920
167 gemini-1.5-flash-002 1310±4 ±34,909
168 grok-2-mini-2024-08-13 1308±4 ±52,574
169 deepseek-v2.5 1307±5 ±24,574
170 athene-70b-0725 1306±6 ±19,622
171 olmo-3-32b-think 1306±8 ±5,892
172 mistral-large-2411 1305±4 ±28,081
173 magistral-medium-2506 1305±6 ±12,065
174 mistral-small-3.1-24b-instruct-2503 1305±4 ±34,130
175 gemma-3-4b-it 1303±9 ±4,177
176 qwen2.5-72b-instruct 1303±4 ±39,409
177 llama-3.1-nemotron-70b-instruct 1299±8 ±7,136
178 hunyuan-large-vision 1296±9 ±5,600
179 llama-3.1-70b-instruct 1294±4 ±55,234
180 amazon-nova-pro-v1.0 1290±5 ±24,753
181 jamba-1.5-large 1289±7 ±8,659
182 gemma-2-27b-it 1288±3 ±75,764
183 ibm-granite-h-small 1288±8 ±5,682
184 reka-core-20240904 1288±7 ±7,309
185 gpt-4-0314 1287±5 ±54,167
186 llama-3.1-tulu-3-70b 1287±10 ±2,846
187 llama-3.1-nemotron-51b-instruct 1287±10 ±3,749
188 olmo-3.1-32b-think 1286±7 ±8,479
189 gemini-1.5-flash-001 1286±4 ±62,823
190 claude-3-sonnet-20240229 1281±4 ±109,289
191 gemma-2-9b-it-simpo 1280±7 ±10,069
192 nemotron-4-340b-instruct 1278±5 ±19,661
193 command-r-plus-08-2024 1277±7 ±9,869
194 llama-3-70b-instruct 1276±4 ±156,880
195 gpt-4-0613 1276±4 ±88,721
196 mistral-small-24b-instruct-2501 1274±6 ±14,677
197 glm-4-0520 1274±7 ±9,788
198 reka-flash-20240904 1272±7 ±7,537
199 qwen2.5-coder-32b-instruct 1271±8 ±5,430
200 c4ai-aya-expanse-32b 1267±5 ±27,123
201 gemma-2-9b-it 1266±4 ±54,615
202 deepseek-coder-v2 1264±6 ±15,147
203 command-r-plus 1262±4 ±77,556
204 qwen2-72b-instruct 1262±5 ±37,325
205 claude-3-haiku-20240307 1261±4 ±117,705
206 amazon-nova-lite-v1.0 1261±5 ±19,376
207 gemini-1.5-flash-8b-001 1259±4 ±35,556
208 phi-4 1256±5 ±24,126
209 olmo-2-0325-32b-instruct 1252±11 ±3,335
210 command-r-08-2024 1251±7 ±10,141
211 mistral-large-2402 1243±5 ±62,437
212 amazon-nova-micro-v1.0 1241±5 ±19,355
213 jamba-1.5-mini 1239±7 ±8,854
214 ministral-8b-2410 1237±9 ±4,780
215 gemini-pro-dev-api 1235±7 ±18,352
216 qwen1.5-110b-chat 1234±6 ±26,191
217 reka-flash-21b-20240226-online 1234±7 ±15,451
218 qwen1.5-72b-chat 1234±5 ±39,296
219 hunyuan-standard-256k 1233±12 ±2,729
220 mixtral-8x22b-instruct-v0.1 1230±5 ±51,417
221 command-r 1227±5 ±54,038
222 reka-flash-21b-20240226 1227±6 ±24,806
223 gpt-3.5-turbo-0125 1224±5 ±66,191
224 llama-3-8b-instruct 1223±4 ±104,636
225 mistral-medium 1223±6 ±34,552
226 c4ai-aya-expanse-8b 1223±7 ±9,827
227 gemini-pro 1222±12 ±6,390
228 llama-3.1-tulu-3-8b 1221±11 ±2,895
229 yi-1.5-34b-chat 1214±5 ±24,142
230 zephyr-orpo-141b-A35b-v0.1 1213±11 ±4,653
231 llama-3.1-8b-instruct 1212±4 ±49,605
232 granite-3.1-8b-instruct 1209±11 ±3,092
233 qwen1.5-32b-chat 1204±6 ±21,744
234 gpt-3.5-turbo-1106 1203±9 ±16,616
235 gemma-2-2b-it 1199±4 ±46,618
236 phi-3-medium-4k-instruct 1198±5 ±25,055
237 mixtral-8x7b-instruct-v0.1 1198±4 ±73,505
238 dbrx-instruct-preview 1195±6 ±32,196
239 internlm2_5-20b-chat 1191±7 ±9,902
240 qwen1.5-14b-chat 1191±7 ±17,841
241 wizardlm-70b 1185±9 ±8,214
242 deepseek-llm-67b-chat 1185±12 ±4,933
243 yi-34b-chat 1184±7 ±15,483
244 openchat-3.5-0106 1183±8 ±12,636
245 openchat-3.5 1183±10 ±7,967
246 granite-3.0-8b-instruct 1182±9 ±6,643
247 gemma-1.1-7b-it 1180±6 ±23,893
248 snowflake-arctic-instruct 1180±6 ±32,836
249 granite-3.1-2b-instruct 1180±11 ±3,191
250 tulu-2-dpo-70b 1178±10 ±6,534
251 openhermes-2.5-mistral-7b 1176±10 ±5,006
252 vicuna-33b 1173±6 ±22,479
253 starling-lm-7b-beta 1172±7 ±16,057
254 phi-3-small-8k-instruct 1172±6 ±17,763
255 llama-2-70b-chat 1171±6 ±38,491
256 starling-lm-7b-alpha 1168±8 ±10,224
257 llama-3.2-3b-instruct 1167±8 ±7,936
258 nous-hermes-2-mixtral-8x7b-dpo 1165±12 ±3,776
259 qwq-32b-preview 1157±12 ±3,233
260 granite-3.0-2b-instruct 1156±8 ±6,837
261 llama2-70b-steerlm-chat 1156±13 ±3,584
262 solar-10.7b-instruct-v1.0 1153±13 ±4,155
263 dolphin-2.2.1-mistral-7b 1152±15 ±1,679
264 mpt-30b-chat 1151±12 ±2,571
265 mistral-7b-instruct-v0.2 1150±7 ±19,402
266 wizardlm-13b 1150±9 ±7,046
267 falcon-180b-chat 1147±17 ±1,295
268 qwen1.5-7b-chat 1144±10 ±4,735
269 phi-3-mini-4k-instruct-june-2024 1143±6 ±12,296
270 llama-2-13b-chat 1142±7 ±19,171
271 vicuna-13b 1141±7 ±19,366
272 qwen-14b-chat 1139±11 ±4,964
273 palm-2 1138±9 ±8,554
274 codellama-34b-instruct 1137±9 ±7,363
275 gemma-7b-it 1136±10 ±8,925
276 zephyr-7b-beta 1131±9 ±11,116
277 phi-3-mini-128k-instruct 1130±7 ±20,691
278 phi-3-mini-4k-instruct 1129±6 ±20,115
279 guanaco-33b 1128±12 ±2,921
280 zephyr-7b-alpha 1127±16 ±1,785
281 stripedhyena-nous-7b 1121±11 ±5,184
282 codellama-70b-instruct 1119±18 ±1,143
283 vicuna-7b 1115±9 ±6,923
284 smollm2-1.7b-instruct 1115±14 ±2,201
285 gemma-1.1-2b-it 1114±8 ±10,853
286 llama-3.2-1b-instruct 1112±8 ±8,045
287 mistral-7b-instruct 1110±9 ±8,977
288 llama-2-7b-chat 1108±7 ±14,148
289 gemma-2b-it 1092±12 ±4,779
290 qwen1.5-4b-chat 1091±9 ±7,598
291 olmo-7b-instruct 1075±11 ±6,329
292 koala-13b 1071±10 ±6,964
293 alpaca-13b 1068±12 ±5,745
294 gpt4all-13b-snoozy 1066±15 ±1,743
295 mpt-7b-chat 1062±12 ±3,925
296 chatglm3-6b 1056±12 ±4,658
297 RWKV-4-Raven-14B 1042±11 ±4,845
298 chatglm2-6b 1025±14 ±2,657
299 oasst-pythia-12b 1023±11 ±6,311
300 chatglm-6b 996±13 ±4,914
301 fastchat-t5-3b 992±12 ±4,203
302 dolly-v2-12b 981±14 ±3,412
303 llama-13b 973±16 ±2,391
304 stablelm-tuned-alpha-7b 953±13 ±3,287
The turing test (or imitation game) tries to distinguish machine from human. © 2026 TuringScore
Feedback? Click here.