████████╗██╗ ██╗██████╗ ██╗███╗ ██╗ ██████╗ ███████╗ ██████╗ ██████╗ ██████╗ ███████╗ ██████╗ ██████╗ ███╗ ███╗ ╚══██╔══╝██║ ██║██╔══██╗██║████╗ ██║██╔════╝ ██╔════╝██╔════╝██╔═══██╗██╔══██╗██╔════╝ ██╔════╝██╔═══██╗████╗ ████║ ██║ ██║ ██║██████╔╝██║██╔██╗ ██║██║ ███╗███████╗██║ ██║ ██║██████╔╝█████╗ ██║ ██║ ██║██╔████╔██║ ██║ ██║ ██║██╔══██╗██║██║╚██╗██║██║ ██║╚════██║██║ ██║ ██║██╔══██╗██╔══╝ ██║ ██║ ██║██║╚██╔╝██║ ██║ ╚██████╔╝██║ ██║██║██║ ╚████║╚██████╔╝███████║╚██████╗╚██████╔╝██║ ██║███████╗██╗╚██████╗╚██████╔╝██║ ╚═╝ ██║ ╚═╝ ╚═════╝ ╚═╝ ╚═╝╚═╝╚═╝ ╚═══╝ ╚═════╝ ╚══════╝ ╚═════╝ ╚═════╝ ╚═╝ ╚═╝╚══════╝╚═╝ ╚═════╝ ╚═════╝ ╚═╝ ╚═╝
Figure out which LLM is "best". Data from LMSYS's Chatbot Arena. Click here to learn more.
Model | Elo | MMLU | Licensing |
---|---|---|---|
Gemini-1.5-Pro-Exp-0801 | 1299 | Proprietary | |
GPT-4o-2024-05-13 | 1286 | 88.7 | Proprietary |
GPT-4o-mini-2024-07-18 | 1277 | 82 | Proprietary |
Claude 3.5 Sonnet | 1271 | 88.7 | Proprietary |
Gemini Advanced App (2024-05-14) | 1266 | Proprietary | |
Meta-Llama-3.1-405b-Instruct | 1264 | 88.6 | Llama 3.1 Community |
Gemini-1.5-Pro-001 | 1260 | 85.9 | Proprietary |
GPT-4-Turbo-2024-04-09 | 1257 | Proprietary | |
Gemini-1.5-Pro-Preview-0409 | 1257 | 81.9 | Proprietary |
GPT-4-1106-preview | 1251 | Proprietary | |
Athene-70b | 1249 | CC-BY-NC-4.0 | |
Mistral-Large-2407 | 1249 | Mistral Research | |
Claude 3 Opus | 1248 | 86.8 | Proprietary |
GPT-4-0125-preview | 1245 | Proprietary | |
Meta-Llama-3.1-70b-Instruct | 1242 | 86 | Llama 3.1 Community |
Yi-Large-preview | 1240 | Proprietary | |
Gemini-1.5-Flash-001 | 1228 | 78.9 | Proprietary |
Reka-Core-20240722 | 1227 | Proprietary | |
Deepseek-v2-API-0628 | 1221 | DeepSeek | |
Gemma-2-27b-it | 1218 | Gemma license | |
Deepseek-Coder-v2-0724 | 1213 | Proprietary | |
Yi-Large | 1212 | Proprietary | |
Gemini App (2024-01-24) | 1209 | Proprietary | |
Nemotron-4-340B-Instruct | 1209 | NVIDIA Open Model | |
GLM-4-0520 | 1207 | Proprietary | |
Llama-3-70b-Instruct | 1206 | 82 | Llama 3 Community |
Reka-Flash-20240722 | 1203 | Proprietary | |
Claude 3 Sonnet | 1201 | 79 | Proprietary |
Reka-Core-20240501 | 1199 | 83.2 | Proprietary |
Command R+ | 1190 | CC-BY-NC-4.0 | |
Qwen2-72B-Instruct | 1187 | 84.2 | Qianwen LICENSE |
Gemma-2-9b-it | 1187 | Gemma license | |
GPT-4-0314 | 1186 | 86.4 | Proprietary |
Qwen-Max-0428 | 1183 | Proprietary | |
GLM-4-0116 | 1183 | Proprietary | |
Claude 3 Haiku | 1178 | 75.2 | Proprietary |
DeepSeek-Coder-V2-Instruct | 1178 | DeepSeek License | |
Reka-Flash-Preview-20240611 | 1165 | Proprietary | |
GPT-4-0613 | 1162 | Proprietary | |
Meta-Llama-3.1-8b-Instruct | 1162 | 73 | Llama 3.1 Community |
Qwen1.5-110B-Chat | 1161 | 80.4 | Qianwen LICENSE |
Mistral-Large-2402 | 1157 | 81.2 | Proprietary |
Yi-1.5-34B-Chat | 1157 | 76.8 | Apache-2.0 |
Reka-Flash-21B-online | 1156 | Proprietary | |
Llama-3-8b-Instruct | 1152 | 68.4 | Llama 3 Community |
Claude-1 | 1149 | 77 | Proprietary |
Command R | 1149 | CC-BY-NC-4.0 | |
Mistral Medium | 1148 | 75.3 | Proprietary |
Qwen1.5-72B-Chat | 1147 | 77.5 | Qianwen LICENSE |
Reka-Flash-21B | 1147 | 73.5 | Proprietary |
Mixtral-8x22b-Instruct-v0.1 | 1146 | 77.8 | Apache 2.0 |
Claude-2.0 | 1132 | 78.5 | Proprietary |
Gemini-1.0-Pro-001 | 1131 | 71.8 | Proprietary |
Zephyr-ORPO-141b-A35b-v0.1 | 1127 | Apache 2.0 | |
Gemma-2-2b-it | 1127 | 51.3 | Gemma license |
Qwen1.5-32B-Chat | 1125 | 73.4 | Qianwen LICENSE |
Mistral-Next | 1124 | Proprietary | |
Phi-3-Medium-4k-Instruct | 1123 | 78 | MIT |
Starling-LM-7B-beta | 1119 | Apache-2.0 | |
Claude-2.1 | 1118 | Proprietary | |
GPT-3.5-Turbo-0613 | 1117 | Proprietary | |
Mixtral-8x7b-Instruct-v0.1 | 1114 | 70.6 | Apache 2.0 |
Claude-Instant-1 | 1111 | 73.4 | Proprietary |
Yi-34B-Chat | 1111 | 73.5 | Yi License |
Gemini Pro | 1111 | 71.8 | Proprietary |
Qwen1.5-14B-Chat | 1109 | 67.6 | Qianwen LICENSE |
GPT-3.5-Turbo-0314 | 1106 | 70 | Proprietary |
WizardLM-70B-v1.0 | 1106 | 63.7 | Llama 2 Community |
GPT-3.5-Turbo-0125 | 1106 | Proprietary | |
DBRX-Instruct-Preview | 1103 | 73.7 | DBRX LICENSE |
Phi-3-Small-8k-Instruct | 1101 | 75.7 | MIT |
Tulu-2-DPO-70B | 1099 | AI2 ImpACT Low-risk | |
Llama-2-70b-chat | 1093 | 63 | Llama 2 Community |
Vicuna-33B | 1091 | 59.2 | Non-commercial |
OpenChat-3.5-0106 | 1091 | 65.8 | Apache-2.0 |
Snowflake Arctic Instruct | 1090 | 67.3 | Apache 2.0 |
Starling-LM-7B-alpha | 1088 | 63.9 | CC-BY-NC-4.0 |
Nous-Hermes-2-Mixtral-8x7B-DPO | 1084 | Apache-2.0 | |
Gemma-1.1-7b-it | 1083 | 64.3 | Gemma license |
NV-Llama2-70B-SteerLM-Chat | 1081 | 68.5 | Llama 2 Community |
pplx-70b-online | 1078 | Proprietary | |
DeepSeek-LLM-67B-Chat | 1077 | 71.3 | DeepSeek License |
OpenChat-3.5 | 1076 | 64.3 | Apache-2.0 |
OpenHermes-2.5-Mistral-7b | 1074 | Apache-2.0 | |
Mistral-7B-Instruct-v0.2 | 1072 | Apache-2.0 | |
Qwen1.5-7B-Chat | 1070 | 61 | Qianwen LICENSE |
GPT-3.5-Turbo-1106 | 1068 | Proprietary | |
Phi-3-Mini-4k-Instruct | 1066 | 68.8 | MIT |
Phi-3-Mini-4k-Instruct-June-24 | 1064 | 70.9 | MIT |
Llama-2-13b-chat | 1063 | 53.6 | Llama 2 Community |
SOLAR-10.7B-Instruct-v1.0 | 1062 | 66.2 | CC-BY-NC-4.0 |
Dolphin-2.2.1-Mistral-7B | 1062 | Apache-2.0 | |
WizardLM-13b-v1.2 | 1058 | 52.7 | Llama 2 Community |
Zephyr-7b-beta | 1053 | 61.4 | MIT |
MPT-30B-chat | 1045 | 50.4 | CC-BY-NC-SA-4.0 |
pplx-7b-online | 1045 | Proprietary | |
Vicuna-13B | 1042 | 55.8 | Llama 2 Community |
CodeLlama-34B-instruct | 1042 | 53.7 | Llama 2 Community |
CodeLlama-70B-instruct | 1042 | Llama 2 Community | |
Zephyr-7b-alpha | 1041 | MIT | |
Llama-2-7b-chat | 1037 | 45.8 | Llama 2 Community |
Gemma-7b-it | 1037 | 64.3 | Gemma license |
Phi-3-Mini-128k-Instruct | 1036 | 68.1 | MIT |
Qwen-14B-Chat | 1035 | 66.5 | Qianwen LICENSE |
falcon-180b-chat | 1034 | 68 | Falcon-180B TII License |
Guanaco-33B | 1033 | 57.6 | Non-commercial |
Gemma-1.1-2b-it | 1020 | 64.3 | Gemma license |
StripedHyena-Nous-7B | 1017 | Apache 2.0 | |
OLMo-7B-instruct | 1015 | Apache-2.0 | |
Mistral-7B-Instruct-v0.1 | 1008 | 55.4 | Apache 2.0 |
Vicuna-7B | 1005 | 49.8 | Llama 2 Community |
PaLM-Chat-Bison-001 | 1003 | Proprietary | |
Qwen1.5-4B-Chat | 989 | 56.1 | Qianwen LICENSE |
Gemma-2b-it | 989 | 42.3 | Gemma license |
Koala-13B | 964 | 44.7 | Non-commercial |
ChatGLM3-6B | 956 | Apache-2.0 | |
GPT4All-13B-Snoozy | 932 | 43 | Non-commercial |
MPT-7B-Chat | 927 | 32 | CC-BY-NC-SA-4.0 |
ChatGLM2-6B | 924 | 45.5 | Apache-2.0 |
RWKV-4-Raven-14B | 922 | 25.6 | Apache 2.0 |
Alpaca-13B | 902 | 48.1 | Non-commercial |
OpenAssistant-Pythia-12B | 893 | 27 | Apache 2.0 |
ChatGLM-6B | 879 | 36.1 | Non-commercial |
FastChat-T5-3B | 868 | 47.7 | Apache 2.0 |
StableLM-Tuned-Alpha-7B | 840 | 24.4 | CC-BY-NC-SA-4.0 |
Dolly-V2-12B | 822 | 25.7 | MIT |
LLaMA-13B | 798 | 47 | Non-commercial |
WizardLM-30B | 58.7 | Non-commercial | |
Vicuna-13B-16k | 54.5 | Llama 2 Community | |
WizardLM-13B-v1.1 | 50 | Non-commercial | |
Tulu-30B | 58.1 | Non-commercial | |
Guanaco-65B | 62.1 | Non-commercial | |
OpenAssistant-LLaMA-30B | 56 | Non-commercial | |
WizardLM-13B-v1.0 | 52.3 | Non-commercial | |
Vicuna-7B-16k | 48.5 | Llama 2 Community | |
Baize-v2-13B | 48.9 | Non-commercial | |
XGen-7B-8K-Inst | 42.1 | Non-commercial | |
Nous-Hermes-13B | 49.3 | Non-commercial | |
MPT-30B-Instruct | 47.8 | CC-BY-SA 3.0 | |
Falcon-40B-Instruct | 54.7 | Apache 2.0 | |
H2O-Oasst-OpenLLaMA-13B | 42.8 | Apache 2.0 | |
LLaVA-v1.6-34B | Apache 2.0 | ||
CogVLM2-llama3-chat-19b | CogVLM2 | ||
InternVL2-26b | MIT | ||
GLM-4-AIR | Proprietary | ||
Snorkel-Mistral-PairRM-DPO | Apache 2.0 |