rating,variance,rating_q975,rating_q025,num_battles,final_ranking,key,Model,License,Organization,Knowledge cutoff date,Link,MT-bench (score),MMLU 1007.7329590351438,169.818293680913,1031.849773785732,982.4558698804362,540,11,Google: Gemini Pro 1.5,Google: Gemini Pro 1.5,Proprietary,Google,-,https://gemini.google.com/,-,- 1042.4520144977232,117.8434909383862,1061.0014189899887,1023.9434827595032,1065,6,claude-3-5-sonnet-20241022,claude-3-5-sonnet-20241022,Proprietary,Anthropic,06-2024,https://docs.anthropic.com/en/docs/intro-to-claude#claude-3-5-family,-,- 902.3880187987183,113.01456535455019,919.2594239926264,882.875693560186,974,37,gpt-4o-2024-05-13,gpt-4o-2024-05-13,Proprietary,OpenAI,10-2023,https://openai.com/api/,70.0,50.0 964.7054114008947,246.77221113388686,993.1417845835723,935.4956211758649,332,18,claude-3-5-sonnet-20240620,claude-3-5-sonnet-20240620,Proprietary,Anthropic,06-2024,https://docs.anthropic.com/en/docs/intro-to-claude#claude-3-5-family,-,- 977.6965503563338,230.60818555408642,1007.0446828059678,947.5964599455415,434,12,Google: Gemini Flash 1.5,Google: Gemini Flash 1.5,Proprietary,Google,-,https://gemini.google.com/,-,- 932.8986111701167,126.36940738505085,953.0306970352567,913.2617787203508,872,26,gpt-4-turbo-2024-04-09,gpt-4-turbo-2024-04-09,Proprietary,OpenAI,04-2023,https://openai.com/api/,70.0,50.0 1040.0660491971114,339.61436371063905,1074.9411942112229,1003.8258755933595,228,5,Qwen2.5 72B Instruct,Qwen2.5 72B Instruct,Open Source,Qwen,-,https://huggingface.co/Qwen/Qwen2.5-72B-Instruct,-,- 889.5039077488458,114.80940986649166,906.848549345477,870.9588281543212,984,40,Llama 3.1 405B Instruct Turbo,Llama 3.1 405B Instruct Turbo,Proprietary,Meta,-,https://ai.meta.com/blog/meta-llama-3-1/,-,- 964.6329250768428,201.594408178038,990.2170161271264,938.8186977476747,444,19,gpt-4o-mini-2024-07-18,gpt-4o-mini-2024-07-18,Proprietary,OpenAI,07-2024,https://openai.com/index/gpt-4o-mini-advancing-cost-efficient-intelligence/,70.0,50.0 928.2161511401265,119.1716825191117,946.5917242130985,908.8721029423501,959,27,gpt-4-0613,gpt-4-0613,Proprietary,OpenAI,04-2023,https://openai.com/api/,70.0,50.0 991.3649857326623,296.467557146047,1023.805995701806,957.5266989665441,276,12,Gemma 2 27B,Gemma 2 27B,Proprietary,Google,-,https://blog.google/technology/developers/google-gemma-2/,-,- 1036.027497975026,427.32120912744136,1078.7718965580088,995.6185819438455,208,4,GigaChat-Max-preview 4.0.26.20,GigaChat-Max-preview 4.0.26.20,Proprietary,Sber,In training,https://www.sber-bank.by/new/gigachat-29102024,-,- 947.3075239370963,115.03010047381805,965.5521058226973,928.9449922034324,908,25,GigaChat-Pro 4.0.26.20,GigaChat-Pro 4.0.26.20,Proprietary,Sber,In training,https://developers.sber.ru/portal/products/gigachat,-,- 1093.453359234941,179.01655493695912,1117.575853919296,1067.7918094100419,593,1,Llama 3.1 70B Instruct Turbo,Llama 3.1 70B Instruct Turbo,Proprietary,Meta,-,https://ai.meta.com/blog/meta-llama-3-1/,-,- 1131.5716539832576,192.20222956350986,1157.120074585947,1106.9520258258947,532,1,saiga_llama3_70b,saiga_llama3_70b,Open Source,Ilya Gusev,In training,https://huggingface.co/IlyaGusev/saiga_llama3_70b_sft_m1_d5_abliterated_awq_4bit,-,- 1002.2027132811115,115.33138700604408,1020.1828564529311,983.0242717796616,917,12,YandexGPT Experimental,YandexGPT Experimental,Proprietary,Yandex,In training,https://ya.ru/ai/gpt-3,45.2,35.2 947.6719018175467,108.12765061032577,965.3144218988184,928.5608934367314,955,25,Qwen 2 Instruct (72B),Qwen 2 Instruct (72B),Open Source,Qwen,12-2023,https://llama.meta.com/llama3/,-,- 1057.253551112602,106.40008504549796,1074.1692966485289,1039.605804327147,1084,5,claude-3-haiku-20240307,claude-3-haiku-20240307,Proprietary,Anthropic,03-2024,https://docs.anthropic.com/en/docs/intro-to-claude#claude-3-family,-,- 1021.6234837090112,108.14270051331808,1038.2328464795467,1002.7820523715413,1110,8,Cohere: Command R+ (08-2024),Cohere: Command R+ (08-2024),Open Source,Cohere,-,https://docs.cohere.com/v2/docs/command-r-plus,-,- 945.822917136063,103.86903183702671,961.8660341166047,927.5255054039449,1142,25,YandexGPT 4 Pro,YandexGPT 4 Pro,Proprietary,Yandex,In training,https://ya.ru/ai/gpt-4,45.2,35.2 961.9741525927969,186.57933675295138,984.9464026845426,938.825633687611,516,20,LLaMA-3 Chat (70B),LLaMA-3 Chat (70B),Proprietary,Meta,12-2023,https://llama.meta.com/llama3/,-,- 1019.0840901421483,115.79266640931613,1037.525656259409,1001.1393167965639,935,8,Vikhrmodels/Vikhr-Nemo-12B-Instruct-R-21-09-24,Vikhrmodels/Vikhr-Nemo-12B-Instruct-R-21-09-24,Open Source,Vikhrmodels,In training,https://huggingface.co/Vikhrmodels/Vikhr-Nemo-12B-Instruct-R-21-09-24,-,- 1060.7369238504189,188.65249164009626,1087.3669349341656,1035.6116770844812,500,3,GigaChat-Pro 4.0.26.15,GigaChat-Pro 4.0.26.15,Proprietary,Sber,In training,https://developers.sber.ru/portal/products/gigachat,-,- 934.0368409509991,189.71746082231152,959.4870225345246,905.4648969694932,467,25,gpt-3.5-turbo-0125,gpt-3.5-turbo-0125,Proprietary,OpenAI,09-2021,https://openai.com/api/,65.2,45.2 949.7355141836102,123.70997370279837,968.0426294913323,930.4882893208475,1025,25,YandexGPT 3 Pro,YandexGPT 3 Pro,Proprietary,Yandex,In training,https://ya.ru/ai/gpt-3,65.2,45.2 999.3043303904495,182.76505492099727,1022.8249219768784,974.3117890807498,525,12,GigaChat 4.0.26.20,GigaChat 4.0.26.20,Proprietary,Sber,In training,https://developers.sber.ru/portal/products/gigachat,-,- 937.3829533535418,166.82510963462383,959.7235522504662,912.4300217204257,532,25,GigaChat 4.0.26.15,GigaChat 4.0.26.15,Proprietary,Sber,In training,https://developers.sber.ru/portal/products/gigachat,-,- 941.0038427118799,98.63414440661494,956.2475627612499,925.5758870749552,1214,26,GigaChat-Plus 4.0.26.15,GigaChat-Plus 4.0.26.15,Proprietary,Sber,In training,https://developers.sber.ru/portal/products/gigachat,-,- 984.5956178867688,92.84306419976683,1000.5310112015856,969.0051472939118,1202,17,saiga_llama3_8b_v7,saiga_llama3_8b_v7,Open Source,Ilya Gusev,In training,https://huggingface.co/IlyaGusev/saiga_llama3_8b,-,- 1005.7933396558727,108.21124661486498,1022.8098773874418,988.3075334839211,988,12,Llama 3.2 11B Instruct,Llama 3.2 11B Instruct,Open Source,Meta,-,https://www.llama.com/,-,- 1020.2098914368069,92.74331702637394,1035.3807830117264,1003.5589821287602,978,11,saiga_phi3_medium,saiga_phi3_medium,Open Source,Ilya Gusev,In training,https://huggingface.co/IlyaGusev/saiga_phi3_medium_sft_m1_d2_kto_m5_d7,-,- 1096.7173990761198,121.79821650718813,1115.8459758528036,1078.0301611965513,1003,1,T-lite-instruct-0.1,T-lite-instruct-0.1,Open Source,t-bank-ai,In training,https://huggingface.co/AnatoliiPotapov/T-lite-instruct-0.1,-,- 1115.8994565405237,381.5497905443842,1155.7700574830399,1081.3294967762026,255,1,LLaMA-3 Chat (8B),LLaMA-3 Chat (8B),Proprietary,Meta,03-2023,https://llama.meta.com/llama3/,-,- 1011.4192357559319,118.92668361863409,1029.549449811906,991.0551547511924,918,11,GigaChat-Pro 4.0.26.8,GigaChat-Pro 4.0.26.8,Proprietary,Sber,In training,https://developers.sber.ru/portal/products/gigachat,-,- 986.6132177312799,90.30295433317978,1001.2109824911548,970.5147891389424,1311,15,Llama 3.1 8B Instruct Turbo,Llama 3.1 8B Instruct Turbo,Proprietary,Meta,-,https://ai.meta.com/blog/meta-llama-3-1/,-,- 1053.082035539947,85.21569987086933,1067.257148071178,1036.9122424485238,1339,6,YandexGPT 3 Lite,YandexGPT 3 Lite,Proprietary,Yandex,In training,https://ya.ru/ai/gpt-3,45.2,35.2 1073.4348715448846,85.0210331634979,1088.0361883745857,1057.6897152839829,1386,3,Vikhrmodels/it-5.2-fp16-cp,Vikhrmodels/it-5.2-fp16-cp,Open Source,Vikhrmodels,In training,https://huggingface.co/Vikhrmodels/it-5.2-fp16-cp,-,- 1106.6449361128784,92.96079840608918,1121.7855392810814,1089.7870222687427,1416,1,RefalMachine/ruadapt_llama3_instruct_lep_saiga_kto_ablitirated,RefalMachine/ruadapt_llama3_instruct_lep_saiga_kto_ablitirated,Open Source,RefalMachine,-,https://huggingface.co/RefalMachine/ruadapt_llama3_instruct_lep_saiga_kto_ablitirated,-,- 1053.7836308174083,92.60585712791098,1068.5774426073835,1037.4974182854107,1187,5,GigaChat 4.0.26.8,GigaChat 4.0.26.8,Proprietary,Sber,In training,https://developers.sber.ru/portal/products/gigachat,-,- 1021.4043201272381,131.44855610168824,1040.1130552179018,1001.1066777955125,673,7,GigaChat-Pro 2.2.25.3,GigaChat-Pro 2.2.25.3,Proprietary,Sber,In training,https://developers.sber.ru/portal/products/gigachat,-,- 916.9528773222481,291.1619529870915,947.1007536671236,882.5743996887952,261,27,saiga_llama3_8b_v6,saiga_llama3_8b_v6,Open Source,Ilya Gusev,In training,https://huggingface.co/IlyaGusev/saiga_llama3_8b,-,- 962.1392717905012,107.61472994509985,978.6271622855089,944.0131614314163,910,22,GigaChat 3.1.25.3,GigaChat 3.1.25.3,Proprietary,Sber,In training,https://developers.sber.ru/portal/products/gigachat,-,- 950.2160106600422,173.19293674584503,972.1396708220341,924.9718965030927,494,23,GigaChat-Plus 3.1.25.3,GigaChat-Plus 3.1.25.3,Proprietary,Sber,In training,https://developers.sber.ru/portal/products/gigachat,-,-