████████╗██╗   ██╗██████╗ ██╗███╗   ██╗ ██████╗ ███████╗ ██████╗ ██████╗ ██████╗ ███████╗    ██████╗ ██████╗ ███╗   ███╗
╚══██╔══╝██║   ██║██╔══██╗██║████╗  ██║██╔════╝ ██╔════╝██╔════╝██╔═══██╗██╔══██╗██╔════╝   ██╔════╝██╔═══██╗████╗ ████║
   ██║   ██║   ██║██████╔╝██║██╔██╗ ██║██║  ███╗███████╗██║     ██║   ██║██████╔╝█████╗     ██║     ██║   ██║██╔████╔██║
   ██║   ██║   ██║██╔══██╗██║██║╚██╗██║██║   ██║╚════██║██║     ██║   ██║██╔══██╗██╔══╝     ██║     ██║   ██║██║╚██╔╝██║
   ██║   ╚██████╔╝██║  ██║██║██║ ╚████║╚██████╔╝███████║╚██████╗╚██████╔╝██║  ██║███████╗██╗╚██████╗╚██████╔╝██║ ╚═╝ ██║
   ╚═╝    ╚═════╝ ╚═╝  ╚═╝╚═╝╚═╝  ╚═══╝ ╚═════╝ ╚══════╝ ╚═════╝ ╚═════╝ ╚═╝  ╚═╝╚══════╝╚═╝ ╚═════╝ ╚═════╝ ╚═╝     ╚═╝

Figure out which LLM is "best". Data from LMSYS's Chatbot Arena and 3DTopia's 3DGen Leaderboard. Click here to learn more.

Data last updated: May 13, 2026, 4:19 AM (31 minutes ago)

Rank Model Elo
1 claude-opus-4-6-thinking 1502±5
2 claude-opus-4-7-thinking 1501±6
3 claude-opus-4-6 1498±4
4 claude-opus-4-7 1492±6
5 muse-spark 1491±6 (Preliminary)
6 gemini-3.1-pro-preview 1490±4
7 gemini-3-pro 1486±4
8 gpt-5.5-high 1484±7
9 grok-4.20-beta1 1479±5
10 gpt-5.4-high 1479±5
11 gpt-5.2-chat-latest-20260210 1477±4
12 grok-4.20-beta-0309-reasoning 1477±5
13 gpt-5.5 1476±7
14 grok-4.20-multi-agent-beta-0309 1474±5
15 gemini-3-flash 1474±4
16 ernie-5.1 1473±7 (Preliminary)
18 gpt-5.5-instant 1472±8
19 glm-5.1 1471±6
20 claude-opus-4-5-20251101 1468±3
21 grok-4.1-thinking 1467±3
22 claude-sonnet-4-6 1467±5
23 gpt-5.4 1467±5
24 mimo-v2.5-pro 1465±7
25 qwen3.5-max-preview 1465±5
26 gemini-3-flash (thinking-minimal) 1463±4
27 deepseek-v4-pro-thinking 1460±7
28 grok-4.1 1460±3
29 kimi-k2.6 1459±7
30 deepseek-v4-pro 1458±7
31 dola-seed-2.0-pro 1457±4
32 qwen3.6-max-preview 1456±9
33 glm-5 1456±5
34 gpt-5.4-mini-high 1456±5
35 gpt-5.1-high 1455±4
36 grok-4.3 1455±7
38 claude-sonnet-4-5-20250929 1453±3
39 gemma-4-31b 1451±8
40 ernie-5.0-0110 1450±4
41 gpt-5.3-chat-latest 1450±5
42 ernie-5.0-preview-1203 1449±7
43 kimi-k2.5-thinking 1449±4
45 qwen3.6-plus 1448±6
46 mimo-v2-pro 1447±5
47 claude-opus-4-1-20250805 1447±3
48 gemini-2.5-pro 1447±3
49 qwen3.5-397b-a17b 1445±5
50 gpt-4.5-preview-2025-02-27 1444±6
51 chatgpt-4o-latest-20250326 1443±3
52 glm-4.7 1443±6
53 gpt-5.2-high 1441±4
54 gpt-5.1 1439±4
56 gemma-4-26b-a4b 1438±8
57 gemini-3.1-flash-lite-preview 1438±5
58 gpt-5.2 1437±4
59 qwen3-max-preview 1435±5
60 longcat-flash-chat-2602-exp 1435±5
61 deepseek-v4-flash 1434±7
62 gpt-5-high 1434±5
63 kimi-k2.5-instant 1432±7
64 grok-4-1-fast-reasoning 1431±3
65 o3-2025-04-16 1431±4
66 kimi-k2-thinking-turbo 1430±3
68 gpt-5-chat 1426±4
69 glm-4.6 1426±4
70 mimo-v2.5 1425±7
72 qwen3-max-2025-09-23 1424±6
73 claude-opus-4-20250514-thinking-16k 1424±4
74 deepseek-v3.2 1424±4
75 deepseek-v3.2-exp 1423±6
76 qwen3-235b-a22b-instruct-2507 1423±3
77 deepseek-r1-0528 1422±6
78 deepseek-v3.2-thinking 1422±4
79 grok-4-fast-chat 1421±8
80 ernie-5.0-preview-1022 1419±9
81 qwen3.5-122b-a10b 1418±5
82 kimi-k2-0905-preview 1418±6
83 deepseek-v3.1 1418±6
85 kimi-k2-0711-preview 1417±5
86 hunyuan-hy3-preview 1417±8
87 deepseek-v3.1-thinking 1417±7
88 deepseek-v3.1-terminus 1416±10
89 qwen3-vl-235b-a22b-instruct 1415±6
91 mistral-large-3 1415±4
92 gpt-4.1-2025-04-14 1413±4
93 claude-opus-4-20250514 1412±4
94 grok-3-preview-02-24 1412±4
95 glm-4.5 1411±5
96 gemini-2.5-flash 1411±3
97 grok-4-0709 1410±4
98 claude-haiku-4-5-20251001 1410±3
99 mistral-medium-2508 1410±3
100 minimax-m2.7 1408±5
101 gpt-5.4-nano-high 1408±5
102 qwen3.5-27b 1406±5
103 gemini-2.5-flash-preview-09-2025 1405±4
104 grok-4-fast-reasoning 1404±5
105 qwen3-235b-a22b-no-thinking 1403±5
106 o1-2024-12-17 1402±4
107 qwen3-next-80b-a3b-instruct 1402±5
108 longcat-flash-chat 1401±6
109 qwen3-235b-a22b-thinking-2507 1399±7
110 claude-sonnet-4-20250514-thinking-32k 1399±4
111 deepseek-r1 1398±5
112 qwen3.5-flash 1397±5
113 qwen3.5-35b-a3b 1397±5
114 hunyuan-vision-1.5-thinking 1396±12
115 qwen3-vl-235b-a22b-thinking 1396±7
116 deepseek-v3-0324 1395±4
117 amazon-nova-experimental-chat-12-10 1395±10
118 step-3.5-flash 1394±4
119 minimax-m2.5 1393±4
121 mai-1-preview 1393±5
122 gpt-5-mini-high 1390±5
123 o4-mini-2025-04-16 1390±4
124 claude-sonnet-4-20250514 1389±4
125 o1-preview 1388±5
126 mimo-v2-flash (thinking) 1387±6
127 qwen3-coder-480b-a35b-instruct 1387±5
128 hunyuan-t1-20250711 1387±9
130 mistral-medium-2505 1386±5
131 minimax-m2.1-preview 1385±5
132 qwen3-30b-a3b-instruct-2507 1383±5
133 hunyuan-turbos-20250416 1382±6
134 gpt-4.1-mini-2025-04-14 1382±4
136 glm-4.6v 1378±11
137 trinity-large-thinking 1376±6
138 trinity-large-preview 1375±5
139 qwen3-235b-a22b 1375±5
141 qwen2.5-max 1374±4
142 glm-4.5-air 1373±4
143 claude-3-5-sonnet-20241022 1372±3
144 claude-3-7-sonnet-20250219 1371±4
145 qwen3-next-80b-a3b-thinking 1369±6
146 glm-4.7-flash 1368±6
147 amazon-nova-experimental-chat-11-10 1367±4
148 gemma-3-27b-it 1366±4
149 minimax-m1 1364±4
150 o3-mini-high 1363±5
151 grok-3-mini-high 1362±5
152 nvidia-nemotron-3-super-120b-a12b 1361±7
153 gemini-2.0-flash-001 1360±4
154 deepseek-v3 1358±5
155 mistral-small-2506 1357±5
156 grok-3-mini-beta 1357±5
157 intellect-3 1357±8
158 command-a-03-2025 1354±3
159 glm-4.5v 1353±8
160 gemini-2.0-flash-lite-preview-02-05 1353±4
161 gpt-oss-120b 1353±4
162 gemini-1.5-pro-002 1351±3
163 amazon-nova-experimental-chat-10-20 1350±6
164 hunyuan-turbos-20250226 1348±12
165 step-3 1348±7
166 amazon-nova-experimental-chat-10-09 1348±11
167 o3-mini 1347±4
168 qwen3-32b 1347±9
169 llama-3.1-nemotron-ultra-253b-v1 1347±12
170 mercury-2 1347±11
171 ling-flash-2.0 1346±7
172 minimax-m2 1346±8
173 qwen-plus-0125 1346±8
174 gpt-4o-2024-05-13 1345±3
176 glm-4-plus-0111 1343±8
177 claude-3-5-sonnet-20240620 1342±3
178 gemma-3-12b-it 1342±10
179 hunyuan-turbo-0110 1340±12
180 gpt-5-nano-high 1337±7
181 nova-2-lite 1337±6
182 o1-mini 1337±4
183 qwq-32b 1336±4
184 grok-2-2024-08-13 1335±4
185 gemini-advanced-0514 1335±5
186 gpt-4o-2024-08-06 1335±4
187 llama-3.1-405b-instruct-bf16 1334±4
188 step-2-16k-exp-202412 1334±9
189 llama-3.1-405b-instruct-fp8 1333±4
190 olmo-3.1-32b-instruct 1331±6
191 molmo-2-8b 1328±21
192 yi-lightning 1328±5
193 llama-3.3-nemotron-49b-super-v1 1328±12
194 qwen3-30b-a3b 1327±5
196 hunyuan-large-2025-02-10 1326±10
197 gpt-4-turbo-2024-04-09 1324±4
198 deepseek-v2.5-1210 1323±8
199 claude-3-5-haiku-20241022 1323±3
200 gemini-1.5-pro-001 1323±4
202 gpt-4.1-nano-2025-04-14 1322±8
203 claude-3-opus-20240229 1321±3
204 ring-flash-2.0 1321±7
205 step-1o-turbo-202506 1320±7
206 glm-4-plus 1319±5
207 llama-3.3-70b-instruct 1318±3
208 gemma-3n-e4b-it 1318±5
209 qwen-max-0919 1318±6
210 gpt-4o-mini-2024-07-18 1317±4
211 gpt-oss-20b 1317±6
212 nvidia-nemotron-3-nano-30b-a3b-bf16 1317±6
213 qwen2.5-plus-1127 1315±6
214 athene-v2-chat 1314±5
215 mistral-large-2407 1314±4
216 gpt-4-0125-preview 1312±4
217 gpt-4-1106-preview 1312±4
218 hunyuan-standard-2025-02-10 1311±10
219 gemini-1.5-flash-002 1309±4
220 grok-2-mini-2024-08-13 1308±4
221 deepseek-v2.5 1307±5
222 mercury 1306±14
223 athene-70b-0725 1306±6
224 olmo-3-32b-think 1305±8
225 mistral-large-2411 1305±4
226 magistral-medium-2506 1304±6
227 mistral-small-3.1-24b-instruct-2503 1303±5
228 gemma-3-4b-it 1303±9
229 qwen2.5-72b-instruct 1302±4
230 llama-3.1-nemotron-70b-instruct 1299±8
231 hunyuan-large-vision 1294±9
232 llama-3.1-70b-instruct 1293±4
233 amazon-nova-pro-v1.0 1290±5
234 jamba-1.5-large 1288±7
235 gemma-2-27b-it 1288±3
236 reka-core-20240904 1287±7
237 ibm-granite-h-small 1286±8
238 gpt-4-0314 1286±5
239 llama-3.1-tulu-3-70b 1286±10
240 gemini-1.5-flash-001 1285±4
241 llama-3.1-nemotron-51b-instruct 1285±10
242 olmo-3.1-32b-think 1285±7
243 claude-3-sonnet-20240229 1280±4
244 gemma-2-9b-it-simpo 1279±7
245 nemotron-4-340b-instruct 1276±5
246 command-r-plus-08-2024 1276±7
247 llama-3-70b-instruct 1275±4
248 gpt-4-0613 1274±4
249 mistral-small-24b-instruct-2501 1274±6
250 glm-4-0520 1273±7
251 reka-flash-20240904 1271±7
252 qwen2.5-coder-32b-instruct 1270±8
253 c4ai-aya-expanse-32b 1266±5
254 gemma-2-9b-it 1266±4
255 deepseek-coder-v2 1264±6
256 command-r-plus 1261±4
257 qwen2-72b-instruct 1261±5
258 claude-3-haiku-20240307 1260±4
259 amazon-nova-lite-v1.0 1260±5
260 gemini-1.5-flash-8b-001 1258±4
261 phi-4 1256±5
262 olmo-2-0325-32b-instruct 1251±11
263 command-r-08-2024 1249±7
264 mistral-large-2402 1241±5
265 amazon-nova-micro-v1.0 1240±5
266 jamba-1.5-mini 1239±7
267 ministral-8b-2410 1237±9
268 gemini-pro-dev-api 1235±7
269 qwen1.5-110b-chat 1233±6
270 hunyuan-standard-256k 1233±12
271 reka-flash-21b-20240226-online 1232±7
272 qwen1.5-72b-chat 1232±5
273 mixtral-8x22b-instruct-v0.1 1228±5
274 command-r 1226±5
275 reka-flash-21b-20240226 1226±6
276 gpt-3.5-turbo-0125 1223±5
277 llama-3-8b-instruct 1223±4
278 c4ai-aya-expanse-8b 1222±7
279 mistral-medium 1222±6
280 gemini-pro 1221±12
281 llama-3.1-tulu-3-8b 1220±11
282 yi-1.5-34b-chat 1212±5
283 zephyr-orpo-141b-A35b-v0.1 1212±11
284 llama-3.1-8b-instruct 1211±4
285 granite-3.1-8b-instruct 1207±11
286 qwen1.5-32b-chat 1203±6
287 gpt-3.5-turbo-1106 1202±9
288 gemma-2-2b-it 1199±4
289 phi-3-medium-4k-instruct 1197±5
290 mixtral-8x7b-instruct-v0.1 1196±4
291 dbrx-instruct-preview 1194±6
292 internlm2_5-20b-chat 1190±7
293 qwen1.5-14b-chat 1190±7
294 wizardlm-70b 1184±9
295 deepseek-llm-67b-chat 1183±12
296 yi-34b-chat 1183±7
297 openchat-3.5-0106 1181±8
298 granite-3.0-8b-instruct 1181±9
299 openchat-3.5 1181±10
300 gemma-1.1-7b-it 1180±6
301 snowflake-arctic-instruct 1178±6
302 granite-3.1-2b-instruct 1178±11
303 tulu-2-dpo-70b 1177±10
304 openhermes-2.5-mistral-7b 1174±10
305 vicuna-33b 1172±6
306 starling-lm-7b-beta 1171±7
307 phi-3-small-8k-instruct 1170±6
308 llama-2-70b-chat 1170±6
309 starling-lm-7b-alpha 1166±8
310 llama-3.2-3b-instruct 1166±8
311 nous-hermes-2-mixtral-8x7b-dpo 1164±12
312 qwq-32b-preview 1155±12
313 granite-3.0-2b-instruct 1155±8
314 llama2-70b-steerlm-chat 1154±13
315 solar-10.7b-instruct-v1.0 1151±13
316 dolphin-2.2.1-mistral-7b 1151±15
317 mpt-30b-chat 1149±12
318 mistral-7b-instruct-v0.2 1148±7
319 wizardlm-13b 1148±9
320 falcon-180b-chat 1146±17
321 qwen1.5-7b-chat 1143±10
323 llama-2-13b-chat 1141±7
324 vicuna-13b 1140±7
325 qwen-14b-chat 1137±11
326 palm-2 1136±9
327 gemma-7b-it 1136±10
328 codellama-34b-instruct 1136±9
329 zephyr-7b-beta 1130±9
330 phi-3-mini-128k-instruct 1128±7
331 phi-3-mini-4k-instruct 1127±6
332 guanaco-33b 1126±12
333 zephyr-7b-alpha 1126±16
334 stripedhyena-nous-7b 1120±11
335 codellama-70b-instruct 1118±18
336 gemma-1.1-2b-it 1114±8
337 vicuna-7b 1114±9
338 smollm2-1.7b-instruct 1113±14
339 llama-3.2-1b-instruct 1110±8
340 mistral-7b-instruct 1109±9
341 llama-2-7b-chat 1107±7
342 gemma-2b-it 1091±12
343 qwen1.5-4b-chat 1089±9
344 olmo-7b-instruct 1073±11
345 koala-13b 1069±10
346 alpaca-13b 1067±11
347 gpt4all-13b-snoozy 1065±15
348 mpt-7b-chat 1061±12
349 chatglm3-6b 1055±12
350 RWKV-4-Raven-14B 1040±11
351 chatglm2-6b 1023±14
352 oasst-pythia-12b 1021±11
353 chatglm-6b 994±13
354 fastchat-t5-3b 990±12
355 dolly-v2-12b 979±14
356 llama-13b 972±16
357 stablelm-tuned-alpha-7b 952±13
The turing test (or imitation game) tries to distinguish machine from human. © 2026 TuringScore
Feedback? Click here.