████████╗██╗   ██╗██████╗ ██╗███╗   ██╗ ██████╗ ███████╗ ██████╗ ██████╗ ██████╗ ███████╗    ██████╗ ██████╗ ███╗   ███╗
╚══██╔══╝██║   ██║██╔══██╗██║████╗  ██║██╔════╝ ██╔════╝██╔════╝██╔═══██╗██╔══██╗██╔════╝   ██╔════╝██╔═══██╗████╗ ████║
   ██║   ██║   ██║██████╔╝██║██╔██╗ ██║██║  ███╗███████╗██║     ██║   ██║██████╔╝█████╗     ██║     ██║   ██║██╔████╔██║
   ██║   ██║   ██║██╔══██╗██║██║╚██╗██║██║   ██║╚════██║██║     ██║   ██║██╔══██╗██╔══╝     ██║     ██║   ██║██║╚██╔╝██║
   ██║   ╚██████╔╝██║  ██║██║██║ ╚████║╚██████╔╝███████║╚██████╗╚██████╔╝██║  ██║███████╗██╗╚██████╗╚██████╔╝██║ ╚═╝ ██║
   ╚═╝    ╚═════╝ ╚═╝  ╚═╝╚═╝╚═╝  ╚═══╝ ╚═════╝ ╚══════╝ ╚═════╝ ╚═════╝ ╚═╝  ╚═╝╚══════╝╚═╝ ╚═════╝ ╚═════╝ ╚═╝     ╚═╝

Figure out which LLM is "best". Data from LMSYS's Chatbot Arena. Click here to learn more.

Rank Model Elo
1 gemini-3-pro 1487 ±5
2 grok-4.1-thinking 1475 ±5
3 gemini-3-flash 1471 ±5
4 claude-opus-4-5-20251101-thinking-32k 1468 ±5
5 claude-opus-4-5-20251101 1466 ±5
6 grok-4.1 1466 ±4
7 gemini-3-flash (thinking-minimal) 1463 ±6
8 gpt-5.1-high 1459 ±5
9 ernie-5.0-0110 1453 ±7 (Preliminary)
10 claude-sonnet-4-5-20250929-thinking-32k 1450 ±4
11 claude-sonnet-4-5-20250929 1450 ±4
12 kimi-k2.5-thinking 1450 ±9
13 gemini-2.5-pro 1450 ±3
14 ernie-5.0-preview-1203 1449 ±7
15 claude-opus-4-1-20250805-thinking-16k 1448 ±4
16 claude-opus-4-1-20250805 1445 ±3
17 gpt-4.5-preview-2025-02-27 1444 ±6
18 chatgpt-4o-latest-20250326 1443 ±3
19 glm-4.7 1440 ±6
20 gpt-5.2 1440 ±7
21 gpt-5.2-high 1440 ±7
22 gpt-5.1 1435 ±5
23 gpt-5-high 1435 ±5
24 qwen3-max-preview 1434 ±5
25 o3-2025-04-16 1433 ±4
26 grok-4-1-fast-reasoning 1430 ±5
27 kimi-k2-thinking-turbo 1429 ±4
28 gpt-5-chat 1426 ±4
29 qwen3-max-2025-09-23 1425 ±6
30 glm-4.6 1425 ±4
31 claude-opus-4-20250514-thinking-16k 1424 ±4
32 deepseek-v3.2-exp 1423 ±6
33 deepseek-v3.2-exp-thinking 1423 ±7
34 qwen3-235b-a22b-instruct-2507 1422 ±3
35 grok-4-fast-chat 1422 ±8
36 deepseek-v3.2-thinking 1420 ±5
37 deepseek-v3.2 1420 ±5
38 deepseek-r1-0528 1418 ±6
39 kimi-k2-0905-preview 1418 ±6
40 ernie-5.0-preview-1022 1418 ±9
41 kimi-k2-0711-preview 1417 ±5
42 deepseek-v3.1 1417 ±6
43 deepseek-v3.1-thinking 1417 ±7
44 deepseek-v3.1-terminus 1416 ±10
45 deepseek-v3.1-terminus-thinking 1416 ±10
46 qwen3-vl-235b-a22b-instruct 1415 ±6
47 mistral-large-3 1414 ±5
48 claude-opus-4-20250514 1414 ±4
49 gpt-4.1-2025-04-14 1413 ±4
50 mistral-medium-2508 1412 ±3
51 grok-3-preview-02-24 1411 ±4
52 grok-4-0709 1410 ±4
53 glm-4.5 1410 ±5
54 gemini-2.5-flash 1409 ±3
55 gemini-2.5-flash-preview-09-2025 1405 ±4
56 grok-4-fast-reasoning 1404 ±5
57 claude-haiku-4-5-20251001 1403 ±4
58 o1-2024-12-17 1402 ±4
59 qwen3-235b-a22b-no-thinking 1401 ±4
60 qwen3-next-80b-a3b-instruct 1401 ±5
61 claude-sonnet-4-20250514-thinking-32k 1400 ±4
62 longcat-flash-chat 1399 ±6
63 qwen3-235b-a22b-thinking-2507 1398 ±6
64 deepseek-r1 1398 ±5
65 amazon-nova-experimental-chat-12-10 1395 ±10
66 qwen3-vl-235b-a22b-thinking 1395 ±7
67 mimo-v2-flash (non-thinking) 1395 ±6
68 deepseek-v3-0324 1394 ±4
69 hunyuan-vision-1.5-thinking 1393 ±12
70 mai-1-preview 1391 ±5
71 o4-mini-2025-04-16 1391 ±4
72 gpt-5-mini-high 1390 ±5
73 claude-sonnet-4-20250514 1390 ±4
74 claude-3-7-sonnet-20250219-thinking-32k 1389 ±4
75 o1-preview 1388 ±5
76 hunyuan-t1-20250711 1387 ±9
77 qwen3-coder-480b-a35b-instruct 1387 ±5
78 mistral-medium-2505 1384 ±5
79 qwen3-30b-a3b-instruct-2507 1383 ±5
80 minimax-m2.1-preview 1383 ±6
81 hunyuan-turbos-20250416 1382 ±6
82 gpt-4.1-mini-2025-04-14 1382 ±4
83 gemini-2.5-flash-lite-preview-09-2025-no-thinking 1379 ±4
84 glm-4.6v 1378 ±11
85 gemini-2.5-flash-lite-preview-06-17-thinking 1375 ±4
86 qwen3-235b-a22b 1374 ±5
87 qwen2.5-max 1374 ±4
88 claude-3-5-sonnet-20241022 1373 ±3
89 claude-3-7-sonnet-20250219 1372 ±4
90 glm-4.5-air 1371 ±4
91 qwen3-next-80b-a3b-thinking 1369 ±6
92 minimax-m1 1367 ±4
93 amazon-nova-experimental-chat-11-10 1367 ±6
94 gemma-3-27b-it 1365 ±4
95 o3-mini-high 1364 ±5
96 grok-3-mini-high 1362 ±5
97 glm-4.7-flash 1362 ±9
98 gemini-2.0-flash-001 1361 ±4
99 deepseek-v3 1359 ±5
100 grok-3-mini-beta 1357 ±5
101 mistral-small-2506 1356 ±5
102 intellect-3 1356 ±8
103 gpt-oss-120b 1354 ±4
104 gemini-2.0-flash-lite-preview-02-05 1353 ±4
105 glm-4.5v 1353 ±8
106 command-a-03-2025 1353 ±3
107 gemini-1.5-pro-002 1352 ±3
108 hunyuan-turbos-20250226 1349 ±12
109 o3-mini 1348 ±3
110 amazon-nova-experimental-chat-10-20 1348 ±6
111 llama-3.1-nemotron-ultra-253b-v1 1347 ±12
112 amazon-nova-experimental-chat-10-09 1347 ±11
113 qwen3-32b 1347 ±9
114 minimax-m2 1347 ±8
115 ling-flash-2.0 1346 ±7
116 step-3 1346 ±7
117 qwen-plus-0125 1346 ±8
118 gpt-4o-2024-05-13 1346 ±3
119 glm-4-plus-0111 1343 ±8
120 claude-3-5-sonnet-20240620 1343 ±3
121 gemma-3-12b-it 1342 ±10
122 nvidia-llama-3.3-nemotron-super-49b-v1.5 1341 ±10
123 hunyuan-turbo-0110 1340 ±12
124 gpt-5-nano-high 1338 ±7
125 o1-mini 1337 ±4
126 nova-2-lite 1336 ±6
127 llama-3.1-405b-instruct-bf16 1336 ±4
128 qwq-32b 1336 ±4
129 gpt-4o-2024-08-06 1335 ±4
130 gemini-advanced-0514 1335 ±5
131 grok-2-2024-08-13 1335 ±4
132 llama-3.1-405b-instruct-fp8 1334 ±4
133 step-2-16k-exp-202412 1334 ±9
134 yi-lightning 1329 ±5
135 qwen3-30b-a3b 1328 ±5
136 llama-4-maverick-17b-128e-instruct 1328 ±4
137 llama-3.3-nemotron-49b-super-v1 1327 ±12
138 hunyuan-large-2025-02-10 1327 ±10
139 gpt-4-turbo-2024-04-09 1325 ±4
140 claude-3-5-haiku-20241022 1324 ±3
141 olmo-3.1-32b-instruct 1324 ±7
142 deepseek-v2.5-1210 1323 ±8
143 gemini-1.5-pro-001 1323 ±4
144 llama-4-scout-17b-16e-instruct 1323 ±5
145 claude-3-opus-20240229 1323 ±3
146 gpt-4.1-nano-2025-04-14 1322 ±8
147 step-1o-turbo-202506 1321 ±7
148 ring-flash-2.0 1320 ±7
149 llama-3.3-70b-instruct 1320 ±3
150 glm-4-plus 1319 ±5
151 gemma-3n-e4b-it 1319 ±5
152 qwen-max-0919 1318 ±6
153 gpt-4o-mini-2024-07-18 1318 ±3
154 gpt-oss-20b 1318 ±6
155 nvidia-nemotron-3-nano-30b-a3b-bf16 1317 ±6
156 qwen2.5-plus-1127 1315 ±6
157 athene-v2-chat 1314 ±4
158 mistral-large-2407 1314 ±4
159 gpt-4-1106-preview 1314 ±4
160 gpt-4-0125-preview 1314 ±4
161 hunyuan-standard-2025-02-10 1312 ±10
162 mercury 1310 ±14
163 gemini-1.5-flash-002 1310 ±4
164 grok-2-mini-2024-08-13 1308 ±4
165 deepseek-v2.5 1307 ±5
166 athene-70b-0725 1306 ±6
167 magistral-medium-2506 1305 ±6
168 mistral-large-2411 1305 ±4
169 olmo-3-32b-think 1305 ±8
170 mistral-small-3.1-24b-instruct-2503 1305 ±4
171 gemma-3-4b-it 1303 ±9
172 qwen2.5-72b-instruct 1303 ±4
173 llama-3.1-nemotron-70b-instruct 1299 ±8
174 hunyuan-large-vision 1296 ±9
175 llama-3.1-70b-instruct 1294 ±4
176 amazon-nova-pro-v1.0 1290 ±5
177 jamba-1.5-large 1289 ±7
178 ibm-granite-h-small 1288 ±8
179 gemma-2-27b-it 1288 ±3
180 reka-core-20240904 1288 ±7
181 gpt-4-0314 1288 ±5
182 llama-3.1-nemotron-51b-instruct 1287 ±10
183 llama-3.1-tulu-3-70b 1287 ±10
184 gemini-1.5-flash-001 1286 ±4
185 olmo-3.1-32b-think 1285 ±7
186 claude-3-sonnet-20240229 1282 ±4
187 gemma-2-9b-it-simpo 1280 ±7
188 nemotron-4-340b-instruct 1278 ±5
189 command-r-plus-08-2024 1277 ±7
190 llama-3-70b-instruct 1277 ±4
191 gpt-4-0613 1276 ±4
192 mistral-small-24b-instruct-2501 1274 ±6
193 glm-4-0520 1274 ±7
194 reka-flash-20240904 1273 ±7
195 qwen2.5-coder-32b-instruct 1271 ±8
196 c4ai-aya-expanse-32b 1267 ±5
197 gemma-2-9b-it 1266 ±4
198 deepseek-coder-v2 1265 ±6
199 command-r-plus 1263 ±4
200 qwen2-72b-instruct 1263 ±5
201 claude-3-haiku-20240307 1262 ±4
202 amazon-nova-lite-v1.0 1261 ±5
203 gemini-1.5-flash-8b-001 1259 ±4
204 phi-4 1256 ±5
205 olmo-2-0325-32b-instruct 1253 ±11
206 command-r-08-2024 1251 ±7
207 mistral-large-2402 1243 ±5
208 amazon-nova-micro-v1.0 1241 ±5
209 jamba-1.5-mini 1239 ±7
210 ministral-8b-2410 1237 ±9
211 gemini-pro-dev-api 1236 ±7
212 qwen1.5-110b-chat 1235 ±6
213 reka-flash-21b-20240226-online 1234 ±7
214 qwen1.5-72b-chat 1234 ±5
215 hunyuan-standard-256k 1234 ±12
216 mixtral-8x22b-instruct-v0.1 1230 ±5
217 command-r 1228 ±5
218 reka-flash-21b-20240226 1227 ±6
219 gpt-3.5-turbo-0125 1225 ±5
220 mistral-medium 1224 ±5
221 llama-3-8b-instruct 1224 ±4
222 c4ai-aya-expanse-8b 1223 ±7
223 gemini-pro 1223 ±12
224 llama-3.1-tulu-3-8b 1221 ±11
225 yi-1.5-34b-chat 1214 ±5
226 zephyr-orpo-141b-A35b-v0.1 1214 ±11
227 llama-3.1-8b-instruct 1212 ±4
228 granite-3.1-8b-instruct 1209 ±11
229 qwen1.5-32b-chat 1205 ±6
230 gpt-3.5-turbo-1106 1204 ±9
231 phi-3-medium-4k-instruct 1199 ±5
232 gemma-2-2b-it 1199 ±4
233 mixtral-8x7b-instruct-v0.1 1198 ±4
234 dbrx-instruct-preview 1196 ±6
235 internlm2_5-20b-chat 1192 ±7
236 qwen1.5-14b-chat 1192 ±7
237 wizardlm-70b 1186 ±9
238 deepseek-llm-67b-chat 1185 ±12
239 yi-34b-chat 1185 ±7
240 openchat-3.5-0106 1183 ±8
241 openchat-3.5 1183 ±10
242 granite-3.0-8b-instruct 1183 ±9
243 gemma-1.1-7b-it 1181 ±6
244 snowflake-arctic-instruct 1180 ±6
245 granite-3.1-2b-instruct 1180 ±11
246 tulu-2-dpo-70b 1179 ±10
247 openhermes-2.5-mistral-7b 1176 ±10
248 vicuna-33b 1174 ±6
249 starling-lm-7b-beta 1172 ±7
250 phi-3-small-8k-instruct 1172 ±6
251 llama-2-70b-chat 1172 ±5
252 starling-lm-7b-alpha 1168 ±8
253 llama-3.2-3b-instruct 1167 ±8
254 nous-hermes-2-mixtral-8x7b-dpo 1166 ±12
255 qwq-32b-preview 1158 ±12
256 granite-3.0-2b-instruct 1157 ±8
257 llama2-70b-steerlm-chat 1156 ±13
258 solar-10.7b-instruct-v1.0 1153 ±13
259 dolphin-2.2.1-mistral-7b 1153 ±15
260 mpt-30b-chat 1151 ±12
261 mistral-7b-instruct-v0.2 1151 ±7
262 wizardlm-13b 1150 ±9
263 falcon-180b-chat 1148 ±17
264 qwen1.5-7b-chat 1145 ±10
265 phi-3-mini-4k-instruct-june-2024 1144 ±6
266 llama-2-13b-chat 1142 ±7
267 vicuna-13b 1142 ±7
268 qwen-14b-chat 1140 ±11
269 palm-2 1138 ±9
270 codellama-34b-instruct 1138 ±9
271 gemma-7b-it 1137 ±10
272 zephyr-7b-beta 1132 ±9
273 phi-3-mini-128k-instruct 1130 ±7
274 phi-3-mini-4k-instruct 1129 ±6
275 guanaco-33b 1128 ±12
276 zephyr-7b-alpha 1128 ±16
277 stripedhyena-nous-7b 1122 ±11
278 codellama-70b-instruct 1120 ±18
279 vicuna-7b 1116 ±9
280 smollm2-1.7b-instruct 1115 ±14
281 gemma-1.1-2b-it 1115 ±8
282 llama-3.2-1b-instruct 1112 ±8
283 mistral-7b-instruct 1111 ±9
284 llama-2-7b-chat 1109 ±7
285 gemma-2b-it 1092 ±12
286 qwen1.5-4b-chat 1091 ±9
287 olmo-7b-instruct 1075 ±11
288 koala-13b 1071 ±10
289 alpaca-13b 1068 ±12
290 gpt4all-13b-snoozy 1067 ±15
291 mpt-7b-chat 1063 ±12
292 chatglm3-6b 1057 ±12
293 RWKV-4-Raven-14B 1042 ±11
294 chatglm2-6b 1025 ±14
295 oasst-pythia-12b 1023 ±11
296 chatglm-6b 997 ±13
297 fastchat-t5-3b 992 ±12
298 dolly-v2-12b 981 ±14
299 llama-13b 973 ±16
300 stablelm-tuned-alpha-7b 954 ±13
The turing test (or imitation game) tries to distinguish machine from human. © 2026 TuringScore
Feedback? Click here.