| 01 | mistralai/Mistral-Large-Instruct-2407 | verified | 78.07 | 2026 | Source ↗ | Looks wrong? |
| 02 | mistralai/Mistral-Large-Instruct-2411 | verified | 77.29 | 2026 | Source ↗ | Looks wrong? |
| 03 | Meta-Llama-3.1-405B-Instruct-FP8 | verified | 77.23 | 2026 | Source ↗ | Looks wrong? |
| 04 | GPT-4o-2024-08-06 | verified | 75.15 | 2026 | Source ↗ | Looks wrong? |
| 05 | gpt-4-turbo-2024-04-09 | verified | 74.586433 | 2026 | Source ↗ | Looks wrong? |
| 06 | speakleash/Bielik-11B-v2.6-Instruct | verified | 73.696491 | 2026 | Source ↗ | Looks wrong? |
| 07 | deepseek-ai/DeepSeek-V3-0324 (API) | verified | 73.46 | 2026 | Source ↗ | Looks wrong? |
| 08 | Mistral-Small-Instruct-2409 | verified | 72.85 | 2026 | Source ↗ | Looks wrong? |
| 09 | CYFRAGOVPL/Llama-PLLuM-70B-chat | verified | 72.563158 | 2026 | Source ↗ | Looks wrong? |
| 10 | meta-llama/Meta-Llama-3.1-70B-Instruct | verified | 72.53 | 2026 | Source ↗ | Looks wrong? |
| 11 | speakleash/Bielik-11B-v2.5-Instruct | verified | 71.996491 | 2026 | Source ↗ | Looks wrong? |
| 12 | Qwen/Qwen2-72B-Instruct | verified | 71.227076 | 2026 | Source ↗ | Looks wrong? |
| 13 | meta-llama/Meta-Llama-3-70B-Instruct | verified | 71.21 | 2026 | Source ↗ | Looks wrong? |
| 14 | speakleash/Bielik-11B-v3.0-Instruct | verified | 71.2 | 2026 | Source ↗ | Looks wrong? |
| 15 | GPT-4o-mini-2024-07-18 | verified | 71.15 | 2026 | Source ↗ | Looks wrong? |
| 16 | Qwen/Qwen2.5-32B-Instruct | verified | 71.15 | 2026 | Source ↗ | Looks wrong? |
| 17 | speakleash/Bielik-11B-v2.3-Instruct | verified | 70.86 | 2026 | Source ↗ | Looks wrong? |
| 18 | meta-llama/Llama-3.3-70B-Instruct | verified | 70.729591 | 2026 | Source ↗ | Looks wrong? |
| 19 | mistralai/Mistral-Small-24B-Instruct-2501 | verified | 70.52 | 2026 | Source ↗ | Looks wrong? |
| 20 | CYFRAGOVPL/Llama-PLLuM-70B-instruct | verified | 69.99 | 2026 | Source ↗ | Looks wrong? |
| 21 | alpindale/WizardLM-2-8x22B (API) | verified | 69.56 | 2026 | Source ↗ | Looks wrong? |
| 22 | Qwen/Qwen2.5-14B-Instruct | verified | 69.173099 | 2026 | Source ↗ | Looks wrong? |
| 23 | speakleash/Bielik-11B-v2.2-Instruct | verified | 69.05 | 2026 | Source ↗ | Looks wrong? |
| 24 | Qwen2-72B | verified | 68.934211 | 2026 | Source ↗ | Looks wrong? |
| 25 | Qwen/Qwen2.5-72B-Instruct | verified | 68.487135 | 2026 | Source ↗ | Looks wrong? |
| 26 | speakleash/Bielik-11B-v2.0-Instruct | verified | 68.24 | 2026 | Source ↗ | Looks wrong? |
| 27 | Qwen/Qwen1.5-72B-Chat | verified | 68.03 | 2026 | Source ↗ | Looks wrong? |
| 28 | mistralai/Mixtral-8x22B-Instruct-v0.1 (API) | verified | 67.63 | 2026 | Source ↗ | Looks wrong? |
| 29 | THUDM/glm-4-9b-chat | verified | 61.79 | 2026 | Source ↗ | Looks wrong? |
| 30 | mistralai/Mistral-Nemo-Instruct-2407 | verified | 61.76 | 2026 | Source ↗ | Looks wrong? |
| 31 | speakleash/Bielik-11B-v2.1-Instruct | verified | 60.069298 | 2026 | Source ↗ | Looks wrong? |
| 32 | Qwen1.5-32B-Chat | verified | 59.625263 | 2026 | Source ↗ | Looks wrong? |
| 33 | openchat/openchat-3.5-0106-gemma | verified | 59.579532 | 2026 | Source ↗ | Looks wrong? |
| 34 | microsoft/phi-4 | verified | 59.099942 | 2026 | Source ↗ | Looks wrong? |
| 35 | Qwen/Qwen2.5-7B-Instruct | verified | 58.58 | 2026 | Source ↗ | Looks wrong? |
| 36 | aya-23-35B | verified | 58.41 | 2026 | Source ↗ | Looks wrong? |
| 37 | GPT-3.5-turbo | verified | 57.7 | 2026 | Source ↗ | Looks wrong? |
| 38 | Qwen2-57B-A14B-Instruct | verified | 57.64 | 2026 | Source ↗ | Looks wrong? |
| 39 | mistralai/Mixtral-8x7B-Instruct-v0.1 | verified | 57.611228 | 2026 | Source ↗ | Looks wrong? |
| 40 | c4ai-command-r-v01 | verified | 56.43 | 2026 | Source ↗ | Looks wrong? |
| 41 | Phi-3-medium-4k-instruct | verified | 56.402515 | 2026 | Source ↗ | Looks wrong? |
| 42 | upstage/SOLAR-10.7B-Instruct-v1.0 | verified | 55.213333 | 2026 | Source ↗ | Looks wrong? |
| 43 | CYFRAGOVPL/pllum-12b-nc-chat-250715 | verified | 55.165263 | 2026 | Source ↗ | Looks wrong? |
| 44 | Hermes-2-Theta-Llama-3-8B | verified | 54.88 | 2026 | Source ↗ | Looks wrong? |
| 45 | NeuralDaredevil-8B-abliterated | verified | 54.74 | 2026 | Source ↗ | Looks wrong? |
| 46 | Hermes-2-Pro-Llama-3-8B | verified | 54.57 | 2026 | Source ↗ | Looks wrong? |
| 47 | utter-project/EuroLLM-9B-Instruct | verified | 54.109649 | 2026 | Source ↗ | Looks wrong? |
| 48 | Qwen1.5-32B | verified | 54.032164 | 2026 | Source ↗ | Looks wrong? |
| 49 | Qwen2-7B-Instruct | verified | 53.74 | 2026 | Source ↗ | Looks wrong? |
| 50 | speakleash/Bielik-4.5B-v3.0-Instruct | verified | 53.580292 | 2026 | Source ↗ | Looks wrong? |
| 51 | recurrentgemma-9b-it | verified | 52.82 | 2026 | Source ↗ | Looks wrong? |
| 52 | CYFRAGOVPL/PLLuM-12B-chat | verified | 52.264561 | 2026 | Source ↗ | Looks wrong? |
| 53 | Qwen1.5-72B | verified | 51.435556 | 2026 | Source ↗ | Looks wrong? |
| 54 | microsoft/Phi-4-mini-instruct | verified | 50.522807 | 2026 | Source ↗ | Looks wrong? |
| 55 | berkeley-nest/Starling-LM-7B-alpha | verified | 49.63 | 2026 | Source ↗ | Looks wrong? |
| 56 | Nous-Hermes-2-SOLAR-10.7B | verified | 49.266959 | 2026 | Source ↗ | Looks wrong? |
| 57 | openchat-3.5-1210 | verified | 49.04 | 2026 | Source ↗ | Looks wrong? |
| 58 | Delexa-7b | verified | 48.45655 | 2026 | Source ↗ | Looks wrong? |
| 59 | Qwen1.5-14B-Chat | verified | 47.962573 | 2026 | Source ↗ | Looks wrong? |
| 60 | CYFRAGOVPL/PLLuM-8x7B-nc-chat | verified | 47.29 | 2026 | Source ↗ | Looks wrong? |
| 61 | Mistral-7B-Instruct-v0.2 | verified | 47.02193 | 2026 | Source ↗ | Looks wrong? |
| 62 | meta-llama/Meta-Llama-3-8B-Instruct | verified | 46.53 | 2026 | Source ↗ | Looks wrong? |
| 63 | Yi-1.5-9B-Chat | verified | 46.497895 | 2026 | Source ↗ | Looks wrong? |
| 64 | 01-ai/Yi-1.5-34B-Chat | verified | 46.32 | 2026 | Source ↗ | Looks wrong? |
| 65 | CYFRAGOVPL/Llama-PLLuM-8B-chat | verified | 46.200877 | 2026 | Source ↗ | Looks wrong? |
| 66 | meta-llama/Llama-3.2-3B-Instruct | verified | 46.188304 | 2026 | Source ↗ | Looks wrong? |
| 67 | aya-23-8B | verified | 45.43 | 2026 | Source ↗ | Looks wrong? |
| 68 | openchat/openchat-3.5-0106 | verified | 45.422807 | 2026 | Source ↗ | Looks wrong? |
| 69 | nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16 | verified | 45.284094 | 2026 | Source ↗ | Looks wrong? |
| 70 | CYFRAGOVPL/PLLuM-8x7B-chat | verified | 45.22 | 2026 | Source ↗ | Looks wrong? |
| 71 | mistralai/Mistral-7B-Instruct-v0.3 | verified | 45.21 | 2026 | Source ↗ | Looks wrong? |
| 72 | Kruk-7B-SP-001 | verified | 44.44 | 2026 | Source ↗ | Looks wrong? |
| 73 | Starling-LM-7B-beta | verified | 43.781287 | 2026 | Source ↗ | Looks wrong? |
| 74 | OpenChat3.5-0106-Spichlerz-Bocian | verified | 42.839649 | 2026 | Source ↗ | Looks wrong? |
| 75 | falcon-11B | verified | 42.41 | 2026 | Source ↗ | Looks wrong? |
| 76 | CYFRAGOVPL/PLLuM-8x7B-nc-instruct | verified | 41.75 | 2026 | Source ↗ | Looks wrong? |
| 77 | OpenChat3.5-0106-Spichlerz-Inst-001 | verified | 41.6 | 2026 | Source ↗ | Looks wrong? |
| 78 | internlm2-chat-7b-sft | verified | 41.376608 | 2026 | Source ↗ | Looks wrong? |
| 79 | CYFRAGOVPL/PLLuM-8x7B-instruct | verified | 39.55 | 2026 | Source ↗ | Looks wrong? |
| 80 | internlm2-chat-7b | verified | 39.532164 | 2026 | Source ↗ | Looks wrong? |
| 81 | Llama3-ChatQA-1.5-8B | verified | 39.364327 | 2026 | Source ↗ | Looks wrong? |
| 82 | Meta-Llama-3-70B | verified | 39.090643 | 2026 | Source ↗ | Looks wrong? |
| 83 | OpenHermes-2.5-Mistral-7B | verified | 37.48 | 2026 | Source ↗ | Looks wrong? |
| 84 | internlm/internlm2-chat-20b | verified | 36.306433 | 2026 | Source ↗ | Looks wrong? |
| 85 | CYFRAGOVPL/PLLuM-12B-instruct | verified | 36.212515 | 2026 | Source ↗ | Looks wrong? |
| 86 | Qwen/Qwen2.5-3B-Instruct | verified | 35.869006 | 2026 | Source ↗ | Looks wrong? |
| 87 | Qwen2-7B | verified | 35.510409 | 2026 | Source ↗ | Looks wrong? |
| 88 | OpenHermes-13B | verified | 34.910526 | 2026 | Source ↗ | Looks wrong? |
| 89 | Bielik-SOLAR-LIKE-10.7B-Instruct-v0.1 | verified | 34.171462 | 2026 | Source ↗ | Looks wrong? |
| 90 | speakleash/Bielik-7B-Instruct-v0.1 | verified | 31.26386 | 2026 | Source ↗ | Looks wrong? |
| 91 | Qwen/Qwen2.5-1.5B-Instruct | verified | 27.627485 | 2026 | Source ↗ | Looks wrong? |
| 92 | Llama-3-8B-Omnibus-1-PL-v01-INSTRUCT | verified | 26.63 | 2026 | Source ↗ | Looks wrong? |
| 93 | Phi-3-mini-4k-instruct | verified | 26.081579 | 2026 | Source ↗ | Looks wrong? |
| 94 | Voicelab/trurl-2-13b-academic | verified | 24.555789 | 2026 | Source ↗ | Looks wrong? |
| 95 | Qwen1.5-7B-Chat | verified | 23.976608 | 2026 | Source ↗ | Looks wrong? |
| 96 | Qwen1.5-7B | verified | 20.947661 | 2026 | Source ↗ | Looks wrong? |
| 97 | meta-llama/Llama-3.2-1B-Instruct | verified | 17.820585 | 2026 | Source ↗ | Looks wrong? |
| 98 | gemma-1.1-2b-it | verified | 16.47 | 2026 | Source ↗ | Looks wrong? |
| 99 | Qwen2-1.5B-Instruct | verified | 14.792105 | 2026 | Source ↗ | Looks wrong? |
| 100 | internlm2-chat-1_8b | verified | 12.131579 | 2026 | Source ↗ | Looks wrong? |