Polish Cultural Competency2025en
Polish Linguistic and Cultural Competency Benchmark
Evaluates LLMs on Polish linguistic and cultural knowledge across 6 categories: art & entertainment, culture & tradition, geography, grammar, history, and vocabulary. Accuracy (0-100) per category. Created by Dadas et al. (2025).
Samples:165
Metrics:average, art-and-entertainment, culture-and-tradition, geography, grammar, history, vocabulary
Paper / WebsiteDownloadCurrent State of the Art
Gemini-3.1-Pro-Preview
97
average
PLCC — average
165 results · 1 SOTA advances · higher is better
All results
SOTA frontier
Model Size vs Score — Pareto Frontier
5 models · log scale · Pareto frontier shown
Global
Bielik
PLLuM
Pareto
Top Models Performance Comparison
Top 10 models ranked by average
Best Score
97.0
Top Model
Gemini-3.1-Pro-Pr...
Models Compared
10
Score Range
7.5
art-and-entertainment
| # | Model | Score | Paper / Code | Date |
|---|---|---|---|---|
| 1 | Gemini-3.0-Pro-PreviewOpen Source Google | 95 | Apr 2026 | |
| 2 | Gemini-3.1-Pro-PreviewOpen Source Google | 95 | Apr 2026 | |
| 3 | Gemini-3-Flash-PreviewOpen Source Google | 91 | Apr 2026 | |
| 4 | GPT-5.4-2026-03-05 (high reasoning)Open Source OpenAI | 91 | Apr 2026 | |
| 5 | Gemini-2.5-Pro-Preview-06-05Open Source Google | 91 | Apr 2026 | |
| 6 | GPT-4.5-preview-2025-02-27Open Source OpenAI | 90 | Apr 2026 | |
| 7 | GPT-5-Pro-2025-10-06 (high reasoning)Open Source OpenAI | 88 | Apr 2026 | |
| 8 | Gemini-2.5-Pro-Exp-03-25Open Source Google | 88 | Apr 2026 | |
| 9 | GPT-5.4-2026-03-05 (low reasoning)Open Source OpenAI | 87 | Apr 2026 | |
| 10 | Grok-4API xAI | 86 | Apr 2026 | |
| 11 | O1-2024-12-17Open Source OpenAI | 86 | Apr 2026 | |
| 12 | GPT-5-2025-08-07Open Source OpenAI | 85 | Apr 2026 | |
| 13 | GPT-5.1-2025-11-13 (high reasoning)Open Source OpenAI | 85 | Apr 2026 | |
| 14 | GPT-4o-2024-05-13Open Source OpenAI | 83 | Apr 2026 | |
| 15 | Gemini-Exp-1206Open Source Google | 83 | Apr 2026 | |
| 16 | O3-2025-04-16Open Source OpenAI | 83 | Apr 2026 | |
| 17 | GPT-4o-2024-11-20Open Source OpenAI | 82 | Apr 2026 | |
| 18 | GPT-4o-2024-08-06Open Source OpenAI | 82 | Apr 2026 | |
| 19 | Claude-3.7-SonnetOpen Source Anthropic | 80 | Apr 2026 | |
| 20 | GPT-5.2-2025-12-11 (xhigh reasoning)Open Source OpenAI | 79 | Apr 2026 | |
| 21 | GPT-5.4-2026-03-05 (no reasoning)Open Source OpenAI | 79 | Apr 2026 | |
| 22 | GPT-5.2-2025-12-11 (high reasoning)Open Source OpenAI | 78 | Apr 2026 | |
| 23 | Gemini-2.5-Flash-Preview-04-17Open Source Google | 78 | Apr 2026 | |
| 24 | GPT-4.1-2025-04-14Open Source OpenAI | 77 | Apr 2026 | |
| 25 | Claude-3.7-Sonnet-ThinkingOpen Source Anthropic | 77 | Apr 2026 | |
| 26 | Claude-3.5-Sonnet-20241022Open Source Anthropic | 77 | Apr 2026 | |
| 27 | GPT-5.4-mini-2026-03-17 (high reasoning)Open Source OpenAI | 76 | Apr 2026 | |
| 28 | Claude-Opus-4.6Open Source Anthropic | 75 | Apr 2026 | |
| 29 | GPT-5.2-2025-12-11 (medium reasoning)Open Source OpenAI | 74 | Apr 2026 | |
| 30 | Claude-Opus-4.5Open Source Anthropic | 74 | Apr 2026 | |
| 31 | Claude-3.5-Sonnet-20240620Open Source Anthropic | 73 | Apr 2026 | |
| 32 | Claude-3-OpusAPI Anthropic | 73 | Apr 2026 | |
| 33 | Claude-Opus-4API Anthropic | 72 | Apr 2026 | |
| 34 | PLLuM-8x7B-nc-chatOpen Source PLLuM | 72 | Apr 2026 | |
| 35 | GPT-5.1-2025-11-13 (default reasoning)Open Source OpenAI | 72 | Apr 2026 | |
| 36 | Gemini-2.0-Flash-Thinking-Exp-01-21Open Source Google | 72 | Apr 2026 | |
| 37 | PLLuM-12B-nc-chat-250715Open Source PLLuM | 72 | Apr 2026 | |
| 38 | DeepSeek-V3.2-SpecialeOpen Source DeepSeek | 71 | Apr 2026 | |
| 39 | Grok-3-BetaOpen Source xAI | 71 | Apr 2026 | |
| 40 | GPT-5.2-2025-12-11 (no reasoning)Open Source OpenAI | 70 | Apr 2026 | |
| 41 | Kimi-K2.5Open Source Moonshot.AI | 69 | Apr 2026 | |
| 42 | Bielik-11B-v3.0-InstructOpen Source SpeakLeash | 69 | Apr 2026 | |
| 43 | DeepSeek-v3.1 (thinking)Open Source DeepSeek | 69 | Apr 2026 | |
| 44 | Gemini-2.0-Flash-ExperimentalOpen Source Google | 68 | Apr 2026 | |
| 45 | Claude-Sonnet-4.6Open Source Anthropic | 67 | Apr 2026 | |
| 46 | Claude-Opus-4.1Open Source Anthropic | 67 | Apr 2026 | |
| 47 | DeepSeek-R1Open Source DeepSeek | 66 | Apr 2026 | |
| 48 | GLM-5API Zhipu AI | 66 | Apr 2026 | |
| 49 | DeepSeek-R1-0528Open Source DeepSeek | 65 | Apr 2026 | |
| 50 | Llama-3.1-Tulu-3-405BOpen Source Meta | 64 | Apr 2026 | |
| 51 | DeepSeek-v3-0324Open Source DeepSeek | 64 | Apr 2026 | |
| 52 | MiMo-V2-ProOpen Source Xiaomi | 64 | Apr 2026 | |
| 53 | GLM-4.7Open Source Zhipu AI | 64 | Apr 2026 | |
| 54 | DeepSeek-v3.1 (no thinking)Open Source DeepSeek | 63 | Apr 2026 | |
| 55 | Mistral-Large-2512Open Source Mistral | 63 | Apr 2026 | |
| 56 | Kimi-K2-ThinkingOpen Source Moonshot.AI | 63 | Apr 2026 | |
| 57 | Qwen3.5-397B-A17BOpen Source Alibaba | 63 | Apr 2026 | |
| 58 | O4-Mini-2025-04-16Open Source OpenAI | 62 | Apr 2026 | |
| 59 | Gemini-Pro-1.5Open Source Google | 62 | Apr 2026 | |
| 60 | GPT-5-mini-2025-08-07Open Source OpenAI | 62 | Apr 2026 | |
| 61 | Claude-Sonnet-4.5Open Source Anthropic | 61 | Apr 2026 | |
| 62 | DeepSeek-V3.2Open Source DeepSeek | 61 | Apr 2026 | |
| 63 | GPT-5.4-mini-2026-03-17 (no reasoning)Open Source OpenAI | 61 | Apr 2026 | |
| 64 | Grok-3-Mini-BetaOpen Source xAI | 61 | Apr 2026 | |
| 65 | DeepSeek-v3Open Source DeepSeek | 61 | Apr 2026 | |
| 66 | Bielik-2.6Open Source SpeakLeash | 61 | Apr 2026 | |
| 67 | GPT-4-turboAPI OpenAI | 61 | Apr 2026 | |
| 68 | PLLuM-12B-nc-chatOpen Source PLLuM | 59 | Apr 2026 | |
| 69 | DeepSeek-v3.2-ExpOpen Source DeepSeek | 59 | Apr 2026 | |
| 70 | Grok-4-FastOpen Source xAI | 59 | Apr 2026 | |
| 71 | GLM-4.6Open Source Zhipu AI | 59 | Apr 2026 | |
| 72 | Bielik-2.3Open Source SpeakLeash | 58 | Apr 2026 | |
| 73 | Grok-2-1212Open Source xAI | 57 | Apr 2026 | |
| 74 | Mistral-Medium-3Open Source Mistral | 56 | Apr 2026 | |
| 75 | Llama-3.1-405bOpen Source Meta | 56 | Apr 2026 | |
| 76 | GLM-4.5Open Source Zhipu AI | 56 | Apr 2026 | |
| 77 | Bielik-2.1Open Source SpeakLeash | 55 | Apr 2026 | |
| 78 | Claude-Sonnet-4API Anthropic | 55 | Apr 2026 | |
| 79 | Grok-4.20Open Source xAI | 55 | Apr 2026 | |
| 80 | Llama-PLLuM-70B-chat-250801Open Source PLLuM | 54 | Apr 2026 | |
| 81 | Grok-4.1-FastOpen Source xAI | 54 | Apr 2026 | |
| 82 | Bielik-2.2Open Source SpeakLeash | 54 | Apr 2026 | |
| 83 | Kimi-K2-0905Open Source Moonshot.AI | 54 | Apr 2026 | |
| 84 | Qwen3.5-122B-A10BOpen Source Alibaba | 53 | Apr 2026 | |
| 85 | Mistral-Small-4Open Source Mistral | 53 | Apr 2026 | |
| 86 | Bielik-2.5Open Source SpeakLeash | 52 | Apr 2026 | |
| 87 | GPT-4.1-mini-2025-04-14Open Source OpenAI | 51 | Apr 2026 | |
| 88 | Qwen3-MaxOpen Source Alibaba | 50 | Apr 2026 | |
| 89 | Kimi-K2Open Source Moonshot.AI | 50 | Apr 2026 | |
| 90 | GPT-5.4-nano-2026-03-17 (high reasoning)Open Source OpenAI | 50 | Apr 2026 | |
| 91 | GPT-4 OpenAI | 49 | Apr 2026 | |
| 92 | Llama-PLLuM-70B-chatOpen Source PLLuM | 49 | Apr 2026 | |
| 93 | Mistral-Large-2407Open Source Mistral | 48 | Apr 2026 | |
| 94 | PLLuM-12B-chatOpen Source PLLuM | 48 | Apr 2026 | |
| 95 | GLM-4.5-AirOpen Source Zhipu AI | 48 | Apr 2026 | |
| 96 | GPT-5-nano-2025-08-07Open Source OpenAI | 47 | Apr 2026 | |
| 97 | Llama-4-MaverickOpen Source Meta | 46 | Apr 2026 | |
| 98 | O3-mini-2025-01-31Open Source OpenAI | 46 | Apr 2026 | |
| 99 | Claude-3.0-SonnetOpen Source Anthropic | 46 | Apr 2026 | |
| 100 | WizardLM-2-8x22bOpen Source Microsoft | 45 | Apr 2026 | |
| 101 | PLLuM-8x7B-chatOpen Source PLLuM | 45 | Apr 2026 | |
| 102 | Mixtral-8x22bOpen Source Mistral | 45 | Apr 2026 | |
| 103 | Command-A-03-2025Open Source Cohere | 44 | Apr 2026 | |
| 104 | Qwen3.5-35B-A3BOpen Source Alibaba | 44 | Apr 2026 | |
| 105 | Command-R-Plus-08-2024Open Source Cohere | 44 | Apr 2026 | |
| 106 | Gemma-3-27b Google | 43 | Apr 2026 | |
| 107 | Qwen3-Next-80B-A3B-ThinkingOpen Source Alibaba | 43 | Apr 2026 | |
| 108 | Llama-3.3-70BOpen Source Meta | 43 | Apr 2026 | |
| 109 | Bielik-0.1Open Source SpeakLeash | 43 | Apr 2026 | |
| 110 | MiniMax-M2.7Open Source MiniMaxAI | 43 | Apr 2026 | |
| 111 | Qwen-MaxOpen Source Alibaba | 43 | Apr 2026 | |
| 112 | Claude-3.5-Haiku-20241022Open Source Anthropic | 43 | Apr 2026 | |
| 113 | GPT-OSS-120bOpen Source OpenAI | 42 | Apr 2026 | |
| 114 | Llama-3.1-70BOpen Source Meta | 42 | Apr 2026 | |
| 115 | GPT-4o-mini-2024-07-18Open Source OpenAI | 42 | Apr 2026 | |
| 116 | Llama-3.0-70BOpen Source Meta | 40 | Apr 2026 | |
| 117 | Command-R-Plus-04-2024Open Source Cohere | 39 | Apr 2026 | |
| 118 | GPT-3.5-turboOpen Source OpenAI | 39 | Apr 2026 | |
| 119 | Bielik-Minitron-7B-v3.0-InstructOpen Source SpeakLeash | 39 | Apr 2026 | |
| 120 | Mistral-Large-2411Open Source Mistral | 39 | Apr 2026 | |
| 121 | MiniMax-M2.5Open Source MiniMaxAI | 39 | Apr 2026 | |
| 122 | Mistral-Small-3.2-24B-2506Open Source Mistral | 38 | Apr 2026 | |
| 123 | O1-mini-2024-09-12Open Source OpenAI | 38 | Apr 2026 | |
| 124 | Qwen3.5-27BOpen Source Alibaba | 37 | Apr 2026 | |
| 125 | Qwen3-235B-A22B Alibaba | 37 | Apr 2026 | |
| 126 | Claude-Haiku-4.5Open Source Anthropic | 36 | Apr 2026 | |
| 127 | Mistral-Small-3.1-24B-2503Open Source Mistral | 35 | Apr 2026 | |
| 128 | Qwen3-Next-80B-A3B-InstructOpen Source Alibaba | 34 | Apr 2026 | |
| 129 | Mistral-Small-24B-2501Open Source Mistral | 33 | Apr 2026 | |
| 130 | Llama-PLLuM-8B-chatOpen Source PLLuM | 33 | Apr 2026 | |
| 131 | Gemini-Flash-1.5Open Source Google | 33 | Apr 2026 | |
| 132 | Gemma-2-27bOpen Source Google | 32 | Apr 2026 | |
| 133 | Mixtral-8x7bOpen Source Mistral | 31 | Apr 2026 | |
| 134 | GLM-4.7-FlashOpen Source Zhipu AI | 31 | Apr 2026 | |
| 135 | GPT-4.1-nano-2025-04-14Open Source OpenAI | 30 | Apr 2026 | |
| 136 | Magistral-Small-2506Open Source Mistral | 30 | Apr 2026 | |
| 137 | EuroLLM-9BOpen Source UTTER | 30 | Apr 2026 | |
| 138 | Bielik-4.5B-v3.0-InstructOpen Source SpeakLeash | 28 | Apr 2026 | |
| 139 | Bielik-1.5B-v3.0-InstructOpen Source SpeakLeash | 27 | Apr 2026 | |
| 140 | Qwen-PlusOpen Source Alibaba | 26 | Apr 2026 | |
| 141 | GPT-5.4-nano-2026-03-17 (no reasoning)Open Source OpenAI | 26 | Apr 2026 | |
| 142 | Qwen-2.5-72bOpen Source Alibaba | 25 | Apr 2026 | |
| 143 | Ministral-14b-2512Open Source Mistral | 25 | Apr 2026 | |
| 144 | Llama-4-ScoutOpen Source Meta | 23 | Apr 2026 | |
| 145 | Phi-4 Microsoft | 23 | Apr 2026 | |
| 146 | Qwen3.5-9BOpen Source Alibaba | 22 | Apr 2026 | |
| 147 | Mistral-7b-v0.3Open Source Mistral | 22 | Apr 2026 | |
| 148 | Qwen-2.5-14bOpen Source Alibaba | 21 | Apr 2026 | |
| 149 | Qwen3-32BOpen Source Alibaba | 21 | Apr 2026 | |
| 150 | Mistral-NemoOpen Source Mistral | 20 | Apr 2026 | |
| 151 | Ministral-8b-2512Open Source Mistral | 20 | Apr 2026 | |
| 152 | Qwen3-30B-A3BOpen Source Alibaba | 19 | Apr 2026 | |
| 153 | Llama-3.1-8BOpen Source Meta | 19 | Apr 2026 | |
| 154 | Gemma-2-9bOpen Source Google | 19 | Apr 2026 | |
| 155 | GPT-OSS-20bOpen Source OpenAI | 19 | Apr 2026 | |
| 156 | Qwen-2.5-32bOpen Source Alibaba | 17 | Apr 2026 | |
| 157 | Qwen-Turbo-2024-11-01Open Source Alibaba | 15 | Apr 2026 | |
| 158 | Command-R7BOpen Source Cohere | 14 | Apr 2026 | |
| 159 | Qwen3-14BOpen Source Alibaba | 14 | Apr 2026 | |
| 160 | Ministral-8bOpen Source Mistral | 14 | Apr 2026 | |
| 161 | Qwen3-8BOpen Source Alibaba | 12 | Apr 2026 | |
| 162 | Qwen3.5-4BOpen Source Alibaba | 12 | Apr 2026 | |
| 163 | Ministral-3b-2512Open Source Mistral | 11 | Apr 2026 | |
| 164 | Qwen3.5-2BOpen Source Alibaba | 5 | Apr 2026 | |
| 165 | Qwen-2.5-7bOpen Source Alibaba | 5 | Apr 2026 |
averagePrimary
| # | Model | Score | Paper / Code | Date |
|---|---|---|---|---|
| 1 | Gemini-3.1-Pro-PreviewOpen Source Google | 97 | Apr 2026 | |
| 2 | Gemini-3.0-Pro-PreviewOpen Source Google | 95.833333 | Apr 2026 | |
| 3 | GPT-5.4-2026-03-05 (high reasoning)Open Source OpenAI | 92.166667 | Apr 2026 | |
| 4 | Gemini-2.5-Pro-Preview-06-05Open Source Google | 92.166667 | Apr 2026 | |
| 5 | Gemini-3-Flash-PreviewOpen Source Google | 91.666667 | Apr 2026 | |
| 6 | GPT-5-Pro-2025-10-06 (high reasoning)Open Source OpenAI | 91 | Apr 2026 | |
| 7 | GPT-5.4-2026-03-05 (low reasoning)Open Source OpenAI | 90.5 | Apr 2026 | |
| 8 | Grok-4API xAI | 90.5 | Apr 2026 | |
| 9 | GPT-5-2025-08-07Open Source OpenAI | 89.5 | Apr 2026 | |
| 10 | Gemini-2.5-Pro-Exp-03-25Open Source Google | 89.5 | Apr 2026 | |
| 11 | GPT-5.2-2025-12-11 (xhigh reasoning)Open Source OpenAI | 89.333333 | Apr 2026 | |
| 12 | O3-2025-04-16Open Source OpenAI | 89.166667 | Apr 2026 | |
| 13 | O1-2024-12-17Open Source OpenAI | 89.166667 | Apr 2026 | |
| 14 | GPT-5.1-2025-11-13 (high reasoning)Open Source OpenAI | 88.833333 | Apr 2026 | |
| 15 | GPT-5.2-2025-12-11 (high reasoning)Open Source OpenAI | 87.166667 | Apr 2026 | |
| 16 | GPT-4.5-preview-2025-02-27Open Source OpenAI | 86.5 | Apr 2026 | |
| 17 | GPT-5.4-mini-2026-03-17 (high reasoning)Open Source OpenAI | 85.166667 | Apr 2026 | |
| 18 | GPT-5.2-2025-12-11 (medium reasoning)Open Source OpenAI | 85 | Apr 2026 | |
| 19 | GPT-5.4-2026-03-05 (no reasoning)Open Source OpenAI | 84.333333 | Apr 2026 | |
| 20 | Gemini-2.5-Flash-Preview-04-17Open Source Google | 83.5 | Apr 2026 | |
| 21 | Gemini-Exp-1206Open Source Google | 83 | Apr 2026 | |
| 22 | Claude-3.5-Sonnet-20241022Open Source Anthropic | 82.666667 | Apr 2026 | |
| 23 | GPT-4o-2024-05-13Open Source OpenAI | 82.333333 | Apr 2026 | |
| 24 | Claude-3.7-Sonnet-ThinkingOpen Source Anthropic | 82.166667 | Apr 2026 | |
| 25 | Claude-Opus-4.6Open Source Anthropic | 81.833333 | Apr 2026 | |
| 26 | Claude-3.7-SonnetOpen Source Anthropic | 81.5 | Apr 2026 | |
| 27 | GPT-4o-2024-08-06Open Source OpenAI | 81.333333 | Apr 2026 | |
| 28 | GPT-4o-2024-11-20Open Source OpenAI | 81.333333 | Apr 2026 | |
| 29 | DeepSeek-V3.2-SpecialeOpen Source DeepSeek | 81 | Apr 2026 | |
| 30 | Claude-3.5-Sonnet-20240620Open Source Anthropic | 80.666667 | Apr 2026 | |
| 31 | GPT-4.1-2025-04-14Open Source OpenAI | 80.333333 | Apr 2026 | |
| 32 | Claude-Opus-4.5Open Source Anthropic | 80.333333 | Apr 2026 | |
| 33 | GLM-5API Zhipu AI | 80 | Apr 2026 | |
| 34 | Claude-Opus-4.1Open Source Anthropic | 79 | Apr 2026 | |
| 35 | GPT-5.2-2025-12-11 (no reasoning)Open Source OpenAI | 78.833333 | Apr 2026 | |
| 36 | DeepSeek-v3.1 (thinking)Open Source DeepSeek | 78.666667 | Apr 2026 | |
| 37 | Claude-Opus-4API Anthropic | 78.666667 | Apr 2026 | |
| 38 | MiMo-V2-ProOpen Source Xiaomi | 78.5 | Apr 2026 | |
| 39 | Kimi-K2.5Open Source Moonshot.AI | 77.833333 | Apr 2026 | |
| 40 | GPT-5.1-2025-11-13 (default reasoning)Open Source OpenAI | 77.833333 | Apr 2026 | |
| 41 | Claude-Sonnet-4.6Open Source Anthropic | 77.666667 | Apr 2026 | |
| 42 | GPT-5-mini-2025-08-07Open Source OpenAI | 77.5 | Apr 2026 | |
| 43 | Grok-3-BetaOpen Source xAI | 77.166667 | Apr 2026 | |
| 44 | DeepSeek-R1-0528Open Source DeepSeek | 76.166667 | Apr 2026 | |
| 45 | DeepSeek-R1Open Source DeepSeek | 76 | Apr 2026 | |
| 46 | Qwen3.5-397B-A17BOpen Source Alibaba | 75 | Apr 2026 | |
| 47 | Gemini-2.0-Flash-Thinking-Exp-01-21Open Source Google | 74.833333 | Apr 2026 | |
| 48 | Gemini-2.0-Flash-ExperimentalOpen Source Google | 74.166667 | Apr 2026 | |
| 49 | Claude-3-OpusAPI Anthropic | 73.833333 | Apr 2026 | |
| 50 | GLM-4.7Open Source Zhipu AI | 73.5 | Apr 2026 | |
| 51 | GPT-5.4-mini-2026-03-17 (no reasoning)Open Source OpenAI | 73 | Apr 2026 | |
| 52 | O4-Mini-2025-04-16Open Source OpenAI | 72.833333 | Apr 2026 | |
| 53 | Grok-4.1-FastOpen Source xAI | 72.333333 | Apr 2026 | |
| 54 | DeepSeek-V3.2Open Source DeepSeek | 71.666667 | Apr 2026 | |
| 55 | Kimi-K2-ThinkingOpen Source Moonshot.AI | 71.666667 | Apr 2026 | |
| 56 | Grok-3-Mini-BetaOpen Source xAI | 71.333333 | Apr 2026 | |
| 57 | DeepSeek-v3-0324Open Source DeepSeek | 71 | Apr 2026 | |
| 58 | Claude-Sonnet-4.5Open Source Anthropic | 71 | Apr 2026 | |
| 59 | DeepSeek-v3.1 (no thinking)Open Source DeepSeek | 71 | Apr 2026 | |
| 60 | Bielik-11B-v3.0-InstructOpen Source SpeakLeash | 70.666667 | Apr 2026 | |
| 61 | Mistral-Large-2512Open Source Mistral | 70.666667 | Apr 2026 | |
| 62 | GLM-4.6Open Source Zhipu AI | 70.666667 | Apr 2026 | |
| 63 | Grok-4-FastOpen Source xAI | 70.166667 | Apr 2026 | |
| 64 | DeepSeek-v3.2-ExpOpen Source DeepSeek | 70 | Apr 2026 | |
| 65 | PLLuM-12B-nc-chat-250715Open Source PLLuM | 69.666667 | Apr 2026 | |
| 66 | Gemini-Pro-1.5Open Source Google | 69.666667 | Apr 2026 | |
| 67 | DeepSeek-v3Open Source DeepSeek | 69.166667 | Apr 2026 | |
| 68 | Qwen3.5-122B-A10BOpen Source Alibaba | 68.333333 | Apr 2026 | |
| 69 | Claude-Sonnet-4API Anthropic | 68.166667 | Apr 2026 | |
| 70 | PLLuM-8x7B-nc-chatOpen Source PLLuM | 68.166667 | Apr 2026 | |
| 71 | Grok-4.20Open Source xAI | 67.833333 | Apr 2026 | |
| 72 | GPT-4-turboAPI OpenAI | 67 | Apr 2026 | |
| 73 | Mistral-Medium-3Open Source Mistral | 66.833333 | Apr 2026 | |
| 74 | GLM-4.5Open Source Zhipu AI | 66.5 | Apr 2026 | |
| 75 | Grok-2-1212Open Source xAI | 66 | Apr 2026 | |
| 76 | GPT-5.4-nano-2026-03-17 (high reasoning)Open Source OpenAI | 65.833333 | Apr 2026 | |
| 77 | Bielik-2.6Open Source SpeakLeash | 65.5 | Apr 2026 | |
| 78 | Llama-3.1-Tulu-3-405BOpen Source Meta | 63.833333 | Apr 2026 | |
| 79 | MiniMax-M2.7Open Source MiniMaxAI | 63.333333 | Apr 2026 | |
| 80 | Bielik-2.2Open Source SpeakLeash | 63 | Apr 2026 | |
| 81 | GPT-5-nano-2025-08-07Open Source OpenAI | 62.5 | Apr 2026 | |
| 82 | GPT-4.1-mini-2025-04-14Open Source OpenAI | 62.166667 | Apr 2026 | |
| 83 | Bielik-2.3Open Source SpeakLeash | 62.166667 | Apr 2026 | |
| 84 | Kimi-K2Open Source Moonshot.AI | 62 | Apr 2026 | |
| 85 | Bielik-2.5Open Source SpeakLeash | 62 | Apr 2026 | |
| 86 | Qwen3-MaxOpen Source Alibaba | 61.333333 | Apr 2026 | |
| 87 | Kimi-K2-0905Open Source Moonshot.AI | 61 | Apr 2026 | |
| 88 | Bielik-2.1Open Source SpeakLeash | 61 | Apr 2026 | |
| 89 | Llama-3.1-405bOpen Source Meta | 60 | Apr 2026 | |
| 90 | MiniMax-M2.5Open Source MiniMaxAI | 59.666667 | Apr 2026 | |
| 91 | GPT-4 OpenAI | 59.5 | Apr 2026 | |
| 92 | PLLuM-12B-nc-chatOpen Source PLLuM | 59.5 | Apr 2026 | |
| 93 | O3-mini-2025-01-31Open Source OpenAI | 59.333333 | Apr 2026 | |
| 94 | Llama-PLLuM-70B-chatOpen Source PLLuM | 58.5 | Apr 2026 | |
| 95 | Llama-4-MaverickOpen Source Meta | 58.166667 | Apr 2026 | |
| 96 | Llama-PLLuM-70B-chat-250801Open Source PLLuM | 58 | Apr 2026 | |
| 97 | Claude-3.5-Haiku-20241022Open Source Anthropic | 57.833333 | Apr 2026 | |
| 98 | Qwen3.5-35B-A3BOpen Source Alibaba | 57 | Apr 2026 | |
| 99 | GPT-4o-mini-2024-07-18Open Source OpenAI | 56.833333 | Apr 2026 | |
| 100 | Claude-3.0-SonnetOpen Source Anthropic | 56.5 | Apr 2026 | |
| 101 | Mistral-Small-4Open Source Mistral | 56.333333 | Apr 2026 | |
| 102 | Command-A-03-2025Open Source Cohere | 56.166667 | Apr 2026 | |
| 103 | Qwen3-235B-A22B Alibaba | 55 | Apr 2026 | |
| 104 | GLM-4.5-AirOpen Source Zhipu AI | 54.666667 | Apr 2026 | |
| 105 | Qwen3-Next-80B-A3B-ThinkingOpen Source Alibaba | 54.333333 | Apr 2026 | |
| 106 | GPT-OSS-120bOpen Source OpenAI | 54.333333 | Apr 2026 | |
| 107 | Qwen3.5-27BOpen Source Alibaba | 54.333333 | Apr 2026 | |
| 108 | Mistral-Large-2407Open Source Mistral | 54.166667 | Apr 2026 | |
| 109 | PLLuM-8x7B-chatOpen Source PLLuM | 54.166667 | Apr 2026 | |
| 110 | Bielik-Minitron-7B-v3.0-InstructOpen Source SpeakLeash | 53 | Apr 2026 | |
| 111 | Mistral-Large-2411Open Source Mistral | 52 | Apr 2026 | |
| 112 | O1-mini-2024-09-12Open Source OpenAI | 51.666667 | Apr 2026 | |
| 113 | WizardLM-2-8x22bOpen Source Microsoft | 51.5 | Apr 2026 | |
| 114 | Qwen-MaxOpen Source Alibaba | 50.833333 | Apr 2026 | |
| 115 | Claude-Haiku-4.5Open Source Anthropic | 50.666667 | Apr 2026 | |
| 116 | Command-R-Plus-08-2024Open Source Cohere | 50.166667 | Apr 2026 | |
| 117 | Mixtral-8x22bOpen Source Mistral | 49.833333 | Apr 2026 | |
| 118 | Command-R-Plus-04-2024Open Source Cohere | 49.333333 | Apr 2026 | |
| 119 | Llama-3.3-70BOpen Source Meta | 48.833333 | Apr 2026 | |
| 120 | Llama-3.1-70BOpen Source Meta | 47.833333 | Apr 2026 | |
| 121 | Gemma-3-27b Google | 47.333333 | Apr 2026 | |
| 122 | PLLuM-12B-chatOpen Source PLLuM | 47 | Apr 2026 | |
| 123 | Bielik-0.1Open Source SpeakLeash | 46.666667 | Apr 2026 | |
| 124 | Gemini-Flash-1.5Open Source Google | 46.5 | Apr 2026 | |
| 125 | Mistral-Small-3.2-24B-2506Open Source Mistral | 46.166667 | Apr 2026 | |
| 126 | GPT-5.4-nano-2026-03-17 (no reasoning)Open Source OpenAI | 44.166667 | Apr 2026 | |
| 127 | GPT-4.1-nano-2025-04-14Open Source OpenAI | 43.666667 | Apr 2026 | |
| 128 | GPT-3.5-turboOpen Source OpenAI | 43.333333 | Apr 2026 | |
| 129 | Mistral-Small-3.1-24B-2503Open Source Mistral | 43.333333 | Apr 2026 | |
| 130 | Qwen3-Next-80B-A3B-InstructOpen Source Alibaba | 43 | Apr 2026 | |
| 131 | Llama-3.0-70BOpen Source Meta | 43 | Apr 2026 | |
| 132 | Gemma-2-27bOpen Source Google | 42.666667 | Apr 2026 | |
| 133 | GLM-4.7-FlashOpen Source Zhipu AI | 42.333333 | Apr 2026 | |
| 134 | Bielik-4.5B-v3.0-InstructOpen Source SpeakLeash | 42.333333 | Apr 2026 | |
| 135 | Llama-4-ScoutOpen Source Meta | 41.5 | Apr 2026 | |
| 136 | EuroLLM-9BOpen Source UTTER | 41 | Apr 2026 | |
| 137 | Qwen3.5-9BOpen Source Alibaba | 40.333333 | Apr 2026 | |
| 138 | Magistral-Small-2506Open Source Mistral | 39.333333 | Apr 2026 | |
| 139 | Qwen-2.5-72bOpen Source Alibaba | 39.166667 | Apr 2026 | |
| 140 | Ministral-14b-2512Open Source Mistral | 39 | Apr 2026 | |
| 141 | Mistral-Small-24B-2501Open Source Mistral | 39 | Apr 2026 | |
| 142 | Llama-PLLuM-8B-chatOpen Source PLLuM | 38.5 | Apr 2026 | |
| 143 | Qwen-PlusOpen Source Alibaba | 38.5 | Apr 2026 | |
| 144 | Qwen3-32BOpen Source Alibaba | 37.666667 | Apr 2026 | |
| 145 | Mixtral-8x7bOpen Source Mistral | 35.333333 | Apr 2026 | |
| 146 | Ministral-8b-2512Open Source Mistral | 35.166667 | Apr 2026 | |
| 147 | Qwen3-30B-A3BOpen Source Alibaba | 33 | Apr 2026 | |
| 148 | GPT-OSS-20bOpen Source OpenAI | 32.333333 | Apr 2026 | |
| 149 | Qwen-2.5-32bOpen Source Alibaba | 30.5 | Apr 2026 | |
| 150 | Qwen3-14BOpen Source Alibaba | 30.333333 | Apr 2026 | |
| 151 | Qwen3.5-4BOpen Source Alibaba | 29.666667 | Apr 2026 | |
| 152 | Phi-4 Microsoft | 29.166667 | Apr 2026 | |
| 153 | Gemma-2-9bOpen Source Google | 29.166667 | Apr 2026 | |
| 154 | Qwen-Turbo-2024-11-01Open Source Alibaba | 28.5 | Apr 2026 | |
| 155 | Bielik-1.5B-v3.0-InstructOpen Source SpeakLeash | 27.5 | Apr 2026 | |
| 156 | Qwen-2.5-14bOpen Source Alibaba | 26.666667 | Apr 2026 | |
| 157 | Qwen3-8BOpen Source Alibaba | 26 | Apr 2026 | |
| 158 | Mistral-NemoOpen Source Mistral | 23 | Apr 2026 | |
| 159 | Command-R7BOpen Source Cohere | 22.833333 | Apr 2026 | |
| 160 | Llama-3.1-8BOpen Source Meta | 22.666667 | Apr 2026 | |
| 161 | Ministral-3b-2512Open Source Mistral | 22.333333 | Apr 2026 | |
| 162 | Mistral-7b-v0.3Open Source Mistral | 21.833333 | Apr 2026 | |
| 163 | Ministral-8bOpen Source Mistral | 20.666667 | Apr 2026 | |
| 164 | Qwen-2.5-7bOpen Source Alibaba | 17.666667 | Apr 2026 | |
| 165 | Qwen3.5-2BOpen Source Alibaba | 13.833333 | Apr 2026 |
culture-and-tradition
| # | Model | Score | Paper / Code | Date |
|---|---|---|---|---|
| 1 | Gemini-3.1-Pro-PreviewOpen Source Google | 100 | Apr 2026 | |
| 2 | Gemini-3.0-Pro-PreviewOpen Source Google | 99 | Apr 2026 | |
| 3 | Gemini-3-Flash-PreviewOpen Source Google | 98 | Apr 2026 | |
| 4 | Gemini-2.5-Pro-Preview-06-05Open Source Google | 96 | Apr 2026 | |
| 5 | Grok-4API xAI | 95 | Apr 2026 | |
| 6 | GPT-5-Pro-2025-10-06 (high reasoning)Open Source OpenAI | 94 | Apr 2026 | |
| 7 | GPT-5.4-2026-03-05 (high reasoning)Open Source OpenAI | 93 | Apr 2026 | |
| 8 | GPT-5.2-2025-12-11 (xhigh reasoning)Open Source OpenAI | 93 | Apr 2026 | |
| 9 | GPT-5.4-2026-03-05 (low reasoning)Open Source OpenAI | 93 | Apr 2026 | |
| 10 | O1-2024-12-17Open Source OpenAI | 92 | Apr 2026 | |
| 11 | GPT-4o-2024-05-13Open Source OpenAI | 92 | Apr 2026 | |
| 12 | GPT-4.5-preview-2025-02-27Open Source OpenAI | 92 | Apr 2026 | |
| 13 | O3-2025-04-16Open Source OpenAI | 91 | Apr 2026 | |
| 14 | Gemini-2.5-Pro-Exp-03-25Open Source Google | 91 | Apr 2026 | |
| 15 | GPT-5.1-2025-11-13 (high reasoning)Open Source OpenAI | 90 | Apr 2026 | |
| 16 | Grok-3-BetaOpen Source xAI | 90 | Apr 2026 | |
| 17 | Gemini-Exp-1206Open Source Google | 90 | Apr 2026 | |
| 18 | GPT-4o-2024-08-06Open Source OpenAI | 89 | Apr 2026 | |
| 19 | GPT-5-2025-08-07Open Source OpenAI | 89 | Apr 2026 | |
| 20 | GPT-4o-2024-11-20Open Source OpenAI | 89 | Apr 2026 | |
| 21 | GPT-5.4-2026-03-05 (no reasoning)Open Source OpenAI | 88 | Apr 2026 | |
| 22 | Claude-3.5-Sonnet-20241022Open Source Anthropic | 87 | Apr 2026 | |
| 23 | GPT-5.2-2025-12-11 (high reasoning)Open Source OpenAI | 87 | Apr 2026 | |
| 24 | Claude-Opus-4.6Open Source Anthropic | 86 | Apr 2026 | |
| 25 | GPT-5.2-2025-12-11 (no reasoning)Open Source OpenAI | 86 | Apr 2026 | |
| 26 | Gemini-2.5-Flash-Preview-04-17Open Source Google | 85 | Apr 2026 | |
| 27 | Claude-3.5-Sonnet-20240620Open Source Anthropic | 85 | Apr 2026 | |
| 28 | GPT-4.1-2025-04-14Open Source OpenAI | 84 | Apr 2026 | |
| 29 | GPT-5.2-2025-12-11 (medium reasoning)Open Source OpenAI | 84 | Apr 2026 | |
| 30 | Claude-3.7-SonnetOpen Source Anthropic | 83 | Apr 2026 | |
| 31 | GPT-5.4-mini-2026-03-17 (high reasoning)Open Source OpenAI | 83 | Apr 2026 | |
| 32 | Claude-Opus-4.1Open Source Anthropic | 83 | Apr 2026 | |
| 33 | GPT-5.1-2025-11-13 (default reasoning)Open Source OpenAI | 82 | Apr 2026 | |
| 34 | Claude-Opus-4.5Open Source Anthropic | 82 | Apr 2026 | |
| 35 | Claude-Sonnet-4.6Open Source Anthropic | 82 | Apr 2026 | |
| 36 | Claude-3.7-Sonnet-ThinkingOpen Source Anthropic | 82 | Apr 2026 | |
| 37 | Claude-Opus-4API Anthropic | 81 | Apr 2026 | |
| 38 | GLM-5API Zhipu AI | 81 | Apr 2026 | |
| 39 | MiMo-V2-ProOpen Source Xiaomi | 79 | Apr 2026 | |
| 40 | GLM-4.7Open Source Zhipu AI | 79 | Apr 2026 | |
| 41 | Kimi-K2.5Open Source Moonshot.AI | 78 | Apr 2026 | |
| 42 | DeepSeek-V3.2Open Source DeepSeek | 78 | Apr 2026 | |
| 43 | Bielik-11B-v3.0-InstructOpen Source SpeakLeash | 78 | Apr 2026 | |
| 44 | Gemini-2.0-Flash-ExperimentalOpen Source Google | 78 | Apr 2026 | |
| 45 | Gemini-Pro-1.5Open Source Google | 77 | Apr 2026 | |
| 46 | DeepSeek-v3.1 (thinking)Open Source DeepSeek | 76 | Apr 2026 | |
| 47 | PLLuM-8x7B-nc-chatOpen Source PLLuM | 76 | Apr 2026 | |
| 48 | Gemini-2.0-Flash-Thinking-Exp-01-21Open Source Google | 76 | Apr 2026 | |
| 49 | GLM-4.6Open Source Zhipu AI | 76 | Apr 2026 | |
| 50 | DeepSeek-V3.2-SpecialeOpen Source DeepSeek | 76 | Apr 2026 | |
| 51 | Claude-3-OpusAPI Anthropic | 76 | Apr 2026 | |
| 52 | DeepSeek-v3-0324Open Source DeepSeek | 76 | Apr 2026 | |
| 53 | DeepSeek-R1Open Source DeepSeek | 75 | Apr 2026 | |
| 54 | PLLuM-12B-nc-chat-250715Open Source PLLuM | 75 | Apr 2026 | |
| 55 | DeepSeek-R1-0528Open Source DeepSeek | 75 | Apr 2026 | |
| 56 | Mistral-Large-2512Open Source Mistral | 75 | Apr 2026 | |
| 57 | Grok-4.1-FastOpen Source xAI | 74 | Apr 2026 | |
| 58 | GPT-5-mini-2025-08-07Open Source OpenAI | 74 | Apr 2026 | |
| 59 | GPT-4-turboAPI OpenAI | 74 | Apr 2026 | |
| 60 | DeepSeek-v3Open Source DeepSeek | 73 | Apr 2026 | |
| 61 | O4-Mini-2025-04-16Open Source OpenAI | 73 | Apr 2026 | |
| 62 | Qwen3.5-397B-A17BOpen Source Alibaba | 73 | Apr 2026 | |
| 63 | GPT-5.4-mini-2026-03-17 (no reasoning)Open Source OpenAI | 73 | Apr 2026 | |
| 64 | Claude-Sonnet-4.5Open Source Anthropic | 72 | Apr 2026 | |
| 65 | Claude-Sonnet-4API Anthropic | 72 | Apr 2026 | |
| 66 | Kimi-K2-ThinkingOpen Source Moonshot.AI | 71 | Apr 2026 | |
| 67 | Grok-4-FastOpen Source xAI | 71 | Apr 2026 | |
| 68 | DeepSeek-v3.2-ExpOpen Source DeepSeek | 71 | Apr 2026 | |
| 69 | DeepSeek-v3.1 (no thinking)Open Source DeepSeek | 69 | Apr 2026 | |
| 70 | Bielik-2.6Open Source SpeakLeash | 68 | Apr 2026 | |
| 71 | GLM-4.5Open Source Zhipu AI | 68 | Apr 2026 | |
| 72 | Mistral-Medium-3Open Source Mistral | 67 | Apr 2026 | |
| 73 | Grok-2-1212Open Source xAI | 67 | Apr 2026 | |
| 74 | Grok-3-Mini-BetaOpen Source xAI | 67 | Apr 2026 | |
| 75 | Kimi-K2Open Source Moonshot.AI | 67 | Apr 2026 | |
| 76 | Grok-4.20Open Source xAI | 65 | Apr 2026 | |
| 77 | PLLuM-12B-nc-chatOpen Source PLLuM | 65 | Apr 2026 | |
| 78 | Bielik-2.1Open Source SpeakLeash | 64 | Apr 2026 | |
| 79 | Llama-PLLuM-70B-chatOpen Source PLLuM | 64 | Apr 2026 | |
| 80 | Llama-3.1-Tulu-3-405BOpen Source Meta | 64 | Apr 2026 | |
| 81 | Kimi-K2-0905Open Source Moonshot.AI | 63 | Apr 2026 | |
| 82 | GPT-4 OpenAI | 63 | Apr 2026 | |
| 83 | GPT-4.1-mini-2025-04-14Open Source OpenAI | 62 | Apr 2026 | |
| 84 | Qwen3.5-122B-A10BOpen Source Alibaba | 62 | Apr 2026 | |
| 85 | Llama-PLLuM-70B-chat-250801Open Source PLLuM | 62 | Apr 2026 | |
| 86 | Claude-3.5-Haiku-20241022Open Source Anthropic | 62 | Apr 2026 | |
| 87 | Bielik-2.5Open Source SpeakLeash | 61 | Apr 2026 | |
| 88 | Bielik-2.3Open Source SpeakLeash | 61 | Apr 2026 | |
| 89 | Bielik-2.2Open Source SpeakLeash | 60 | Apr 2026 | |
| 90 | PLLuM-8x7B-chatOpen Source PLLuM | 60 | Apr 2026 | |
| 91 | MiniMax-M2.7Open Source MiniMaxAI | 59 | Apr 2026 | |
| 92 | MiniMax-M2.5Open Source MiniMaxAI | 59 | Apr 2026 | |
| 93 | GPT-5-nano-2025-08-07Open Source OpenAI | 59 | Apr 2026 | |
| 94 | GPT-4o-mini-2024-07-18Open Source OpenAI | 57 | Apr 2026 | |
| 95 | GPT-5.4-nano-2026-03-17 (high reasoning)Open Source OpenAI | 57 | Apr 2026 | |
| 96 | Llama-3.1-405bOpen Source Meta | 57 | Apr 2026 | |
| 97 | Bielik-Minitron-7B-v3.0-InstructOpen Source SpeakLeash | 57 | Apr 2026 | |
| 98 | Qwen3-MaxOpen Source Alibaba | 57 | Apr 2026 | |
| 99 | Command-A-03-2025Open Source Cohere | 55 | Apr 2026 | |
| 100 | Gemma-3-27b Google | 55 | Apr 2026 | |
| 101 | Claude-3.0-SonnetOpen Source Anthropic | 53 | Apr 2026 | |
| 102 | Mistral-Large-2407Open Source Mistral | 52 | Apr 2026 | |
| 103 | Bielik-0.1Open Source SpeakLeash | 52 | Apr 2026 | |
| 104 | Mistral-Large-2411Open Source Mistral | 52 | Apr 2026 | |
| 105 | Claude-Haiku-4.5Open Source Anthropic | 52 | Apr 2026 | |
| 106 | Command-R-Plus-04-2024Open Source Cohere | 52 | Apr 2026 | |
| 107 | Llama-4-MaverickOpen Source Meta | 52 | Apr 2026 | |
| 108 | GLM-4.5-AirOpen Source Zhipu AI | 51 | Apr 2026 | |
| 109 | O3-mini-2025-01-31Open Source OpenAI | 51 | Apr 2026 | |
| 110 | WizardLM-2-8x22bOpen Source Microsoft | 50 | Apr 2026 | |
| 111 | Qwen-MaxOpen Source Alibaba | 50 | Apr 2026 | |
| 112 | PLLuM-12B-chatOpen Source PLLuM | 49 | Apr 2026 | |
| 113 | Command-R-Plus-08-2024Open Source Cohere | 49 | Apr 2026 | |
| 114 | Mistral-Small-4Open Source Mistral | 49 | Apr 2026 | |
| 115 | Qwen3.5-35B-A3BOpen Source Alibaba | 46 | Apr 2026 | |
| 116 | Qwen3.5-27BOpen Source Alibaba | 46 | Apr 2026 | |
| 117 | GPT-OSS-120bOpen Source OpenAI | 46 | Apr 2026 | |
| 118 | Qwen3-235B-A22B Alibaba | 45 | Apr 2026 | |
| 119 | Qwen3-Next-80B-A3B-ThinkingOpen Source Alibaba | 45 | Apr 2026 | |
| 120 | GPT-5.4-nano-2026-03-17 (no reasoning)Open Source OpenAI | 44 | Apr 2026 | |
| 121 | Bielik-4.5B-v3.0-InstructOpen Source SpeakLeash | 44 | Apr 2026 | |
| 122 | O1-mini-2024-09-12Open Source OpenAI | 44 | Apr 2026 | |
| 123 | Mixtral-8x22bOpen Source Mistral | 41 | Apr 2026 | |
| 124 | Gemini-Flash-1.5Open Source Google | 41 | Apr 2026 | |
| 125 | Llama-3.1-70BOpen Source Meta | 41 | Apr 2026 | |
| 126 | Gemma-2-27bOpen Source Google | 41 | Apr 2026 | |
| 127 | GPT-4.1-nano-2025-04-14Open Source OpenAI | 40 | Apr 2026 | |
| 128 | EuroLLM-9BOpen Source UTTER | 40 | Apr 2026 | |
| 129 | Llama-3.3-70BOpen Source Meta | 40 | Apr 2026 | |
| 130 | GLM-4.7-FlashOpen Source Zhipu AI | 40 | Apr 2026 | |
| 131 | Mistral-Small-3.2-24B-2506Open Source Mistral | 39 | Apr 2026 | |
| 132 | Mistral-Small-3.1-24B-2503Open Source Mistral | 39 | Apr 2026 | |
| 133 | GPT-3.5-turboOpen Source OpenAI | 38 | Apr 2026 | |
| 134 | Llama-3.0-70BOpen Source Meta | 38 | Apr 2026 | |
| 135 | Qwen3-Next-80B-A3B-InstructOpen Source Alibaba | 36 | Apr 2026 | |
| 136 | Qwen3.5-9BOpen Source Alibaba | 36 | Apr 2026 | |
| 137 | Llama-4-ScoutOpen Source Meta | 35 | Apr 2026 | |
| 138 | Llama-PLLuM-8B-chatOpen Source PLLuM | 34 | Apr 2026 | |
| 139 | Qwen-PlusOpen Source Alibaba | 32 | Apr 2026 | |
| 140 | Ministral-8b-2512Open Source Mistral | 30 | Apr 2026 | |
| 141 | Qwen-2.5-72bOpen Source Alibaba | 30 | Apr 2026 | |
| 142 | Qwen3-30B-A3BOpen Source Alibaba | 30 | Apr 2026 | |
| 143 | Mistral-Small-24B-2501Open Source Mistral | 29 | Apr 2026 | |
| 144 | Ministral-14b-2512Open Source Mistral | 29 | Apr 2026 | |
| 145 | Magistral-Small-2506Open Source Mistral | 29 | Apr 2026 | |
| 146 | Qwen3-32BOpen Source Alibaba | 28 | Apr 2026 | |
| 147 | Mixtral-8x7bOpen Source Mistral | 27 | Apr 2026 | |
| 148 | GPT-OSS-20bOpen Source OpenAI | 26 | Apr 2026 | |
| 149 | Bielik-1.5B-v3.0-InstructOpen Source SpeakLeash | 25 | Apr 2026 | |
| 150 | Qwen3.5-4BOpen Source Alibaba | 24 | Apr 2026 | |
| 151 | Gemma-2-9bOpen Source Google | 23 | Apr 2026 | |
| 152 | Qwen-2.5-32bOpen Source Alibaba | 21 | Apr 2026 | |
| 153 | Qwen-Turbo-2024-11-01Open Source Alibaba | 20 | Apr 2026 | |
| 154 | Command-R7BOpen Source Cohere | 18 | Apr 2026 | |
| 155 | Qwen-2.5-14bOpen Source Alibaba | 17 | Apr 2026 | |
| 156 | Ministral-3b-2512Open Source Mistral | 17 | Apr 2026 | |
| 157 | Phi-4 Microsoft | 17 | Apr 2026 | |
| 158 | Qwen3-14BOpen Source Alibaba | 16 | Apr 2026 | |
| 159 | Qwen3.5-2BOpen Source Alibaba | 13 | Apr 2026 | |
| 160 | Qwen3-8BOpen Source Alibaba | 13 | Apr 2026 | |
| 161 | Llama-3.1-8BOpen Source Meta | 13 | Apr 2026 | |
| 162 | Mistral-NemoOpen Source Mistral | 13 | Apr 2026 | |
| 163 | Ministral-8bOpen Source Mistral | 12 | Apr 2026 | |
| 164 | Qwen-2.5-7bOpen Source Alibaba | 11 | Apr 2026 | |
| 165 | Mistral-7b-v0.3Open Source Mistral | 9 | Apr 2026 |
geography
| # | Model | Score | Paper / Code | Date |
|---|---|---|---|---|
| 1 | Gemini-3.1-Pro-PreviewOpen Source Google | 100 | Apr 2026 | |
| 2 | Gemini-3.0-Pro-PreviewOpen Source Google | 100 | Apr 2026 | |
| 3 | Gemini-2.5-Pro-Preview-06-05Open Source Google | 98 | Apr 2026 | |
| 4 | O3-2025-04-16Open Source OpenAI | 97 | Apr 2026 | |
| 5 | GPT-5.1-2025-11-13 (high reasoning)Open Source OpenAI | 97 | Apr 2026 | |
| 6 | GPT-5-2025-08-07Open Source OpenAI | 97 | Apr 2026 | |
| 7 | GPT-5.4-2026-03-05 (low reasoning)Open Source OpenAI | 97 | Apr 2026 | |
| 8 | Gemini-2.5-Pro-Exp-03-25Open Source Google | 97 | Apr 2026 | |
| 9 | GPT-5-Pro-2025-10-06 (high reasoning)Open Source OpenAI | 96 | Apr 2026 | |
| 10 | GPT-5.4-2026-03-05 (high reasoning)Open Source OpenAI | 96 | Apr 2026 | |
| 11 | Gemini-3-Flash-PreviewOpen Source Google | 96 | Apr 2026 | |
| 12 | O1-2024-12-17Open Source OpenAI | 95 | Apr 2026 | |
| 13 | GPT-5.2-2025-12-11 (high reasoning)Open Source OpenAI | 95 | Apr 2026 | |
| 14 | GPT-5.2-2025-12-11 (xhigh reasoning)Open Source OpenAI | 94 | Apr 2026 | |
| 15 | GPT-5-mini-2025-08-07Open Source OpenAI | 94 | Apr 2026 | |
| 16 | DeepSeek-V3.2-SpecialeOpen Source DeepSeek | 94 | Apr 2026 | |
| 17 | Grok-4API xAI | 94 | Apr 2026 | |
| 18 | GPT-5.2-2025-12-11 (medium reasoning)Open Source OpenAI | 94 | Apr 2026 | |
| 19 | Gemini-2.5-Flash-Preview-04-17Open Source Google | 94 | Apr 2026 | |
| 20 | GPT-5.4-mini-2026-03-17 (high reasoning)Open Source OpenAI | 92 | Apr 2026 | |
| 21 | GLM-5API Zhipu AI | 91 | Apr 2026 | |
| 22 | GPT-4.5-preview-2025-02-27Open Source OpenAI | 90 | Apr 2026 | |
| 23 | GPT-4o-2024-05-13Open Source OpenAI | 89 | Apr 2026 | |
| 24 | DeepSeek-v3.1 (thinking)Open Source DeepSeek | 89 | Apr 2026 | |
| 25 | GPT-4.1-2025-04-14Open Source OpenAI | 89 | Apr 2026 | |
| 26 | MiMo-V2-ProOpen Source Xiaomi | 89 | Apr 2026 | |
| 27 | Claude-Opus-4.6Open Source Anthropic | 88 | Apr 2026 | |
| 28 | GPT-5.4-2026-03-05 (no reasoning)Open Source OpenAI | 88 | Apr 2026 | |
| 29 | GPT-4o-2024-08-06Open Source OpenAI | 88 | Apr 2026 | |
| 30 | GLM-4.7Open Source Zhipu AI | 88 | Apr 2026 | |
| 31 | O4-Mini-2025-04-16Open Source OpenAI | 88 | Apr 2026 | |
| 32 | Claude-3.7-Sonnet-ThinkingOpen Source Anthropic | 87 | Apr 2026 | |
| 33 | Claude-3.7-SonnetOpen Source Anthropic | 87 | Apr 2026 | |
| 34 | Claude-3.5-Sonnet-20240620Open Source Anthropic | 86 | Apr 2026 | |
| 35 | Claude-Opus-4.1Open Source Anthropic | 86 | Apr 2026 | |
| 36 | GPT-4o-2024-11-20Open Source OpenAI | 86 | Apr 2026 | |
| 37 | Kimi-K2.5Open Source Moonshot.AI | 86 | Apr 2026 | |
| 38 | Gemini-Exp-1206Open Source Google | 86 | Apr 2026 | |
| 39 | GPT-5.2-2025-12-11 (no reasoning)Open Source OpenAI | 86 | Apr 2026 | |
| 40 | GPT-5.1-2025-11-13 (default reasoning)Open Source OpenAI | 86 | Apr 2026 | |
| 41 | Grok-4.1-FastOpen Source xAI | 85 | Apr 2026 | |
| 42 | DeepSeek-R1-0528Open Source DeepSeek | 85 | Apr 2026 | |
| 43 | Qwen3.5-397B-A17BOpen Source Alibaba | 85 | Apr 2026 | |
| 44 | Claude-3.5-Sonnet-20241022Open Source Anthropic | 85 | Apr 2026 | |
| 45 | Kimi-K2-ThinkingOpen Source Moonshot.AI | 84 | Apr 2026 | |
| 46 | DeepSeek-R1Open Source DeepSeek | 84 | Apr 2026 | |
| 47 | Gemini-2.0-Flash-Thinking-Exp-01-21Open Source Google | 84 | Apr 2026 | |
| 48 | Claude-Opus-4.5Open Source Anthropic | 84 | Apr 2026 | |
| 49 | Grok-3-Mini-BetaOpen Source xAI | 84 | Apr 2026 | |
| 50 | Claude-Opus-4API Anthropic | 83 | Apr 2026 | |
| 51 | Grok-3-BetaOpen Source xAI | 83 | Apr 2026 | |
| 52 | Qwen3.5-122B-A10BOpen Source Alibaba | 83 | Apr 2026 | |
| 53 | GLM-4.6Open Source Zhipu AI | 82 | Apr 2026 | |
| 54 | GPT-5.4-mini-2026-03-17 (no reasoning)Open Source OpenAI | 82 | Apr 2026 | |
| 55 | MiniMax-M2.7Open Source MiniMaxAI | 82 | Apr 2026 | |
| 56 | DeepSeek-v3.1 (no thinking)Open Source DeepSeek | 82 | Apr 2026 | |
| 57 | Claude-Sonnet-4.6Open Source Anthropic | 81 | Apr 2026 | |
| 58 | DeepSeek-v3.2-ExpOpen Source DeepSeek | 80 | Apr 2026 | |
| 59 | GPT-5-nano-2025-08-07Open Source OpenAI | 80 | Apr 2026 | |
| 60 | Claude-3-OpusAPI Anthropic | 80 | Apr 2026 | |
| 61 | PLLuM-12B-nc-chat-250715Open Source PLLuM | 79 | Apr 2026 | |
| 62 | GPT-4-turboAPI OpenAI | 79 | Apr 2026 | |
| 63 | Gemini-2.0-Flash-ExperimentalOpen Source Google | 79 | Apr 2026 | |
| 64 | DeepSeek-v3Open Source DeepSeek | 79 | Apr 2026 | |
| 65 | GLM-4.5Open Source Zhipu AI | 79 | Apr 2026 | |
| 66 | Grok-4-FastOpen Source xAI | 79 | Apr 2026 | |
| 67 | Claude-Sonnet-4.5Open Source Anthropic | 79 | Apr 2026 | |
| 68 | DeepSeek-v3-0324Open Source DeepSeek | 78 | Apr 2026 | |
| 69 | O3-mini-2025-01-31Open Source OpenAI | 78 | Apr 2026 | |
| 70 | DeepSeek-V3.2Open Source DeepSeek | 78 | Apr 2026 | |
| 71 | Mistral-Medium-3Open Source Mistral | 77 | Apr 2026 | |
| 72 | Claude-Sonnet-4API Anthropic | 77 | Apr 2026 | |
| 73 | Grok-2-1212Open Source xAI | 77 | Apr 2026 | |
| 74 | GPT-5.4-nano-2026-03-17 (high reasoning)Open Source OpenAI | 77 | Apr 2026 | |
| 75 | Mistral-Large-2512Open Source Mistral | 76 | Apr 2026 | |
| 76 | Bielik-11B-v3.0-InstructOpen Source SpeakLeash | 75 | Apr 2026 | |
| 77 | Qwen3-MaxOpen Source Alibaba | 75 | Apr 2026 | |
| 78 | GPT-4.1-mini-2025-04-14Open Source OpenAI | 75 | Apr 2026 | |
| 79 | Bielik-2.6Open Source SpeakLeash | 75 | Apr 2026 | |
| 80 | Llama-3.1-405bOpen Source Meta | 74 | Apr 2026 | |
| 81 | Grok-4.20Open Source xAI | 74 | Apr 2026 | |
| 82 | Gemini-Pro-1.5Open Source Google | 74 | Apr 2026 | |
| 83 | Qwen3.5-35B-A3BOpen Source Alibaba | 73 | Apr 2026 | |
| 84 | PLLuM-8x7B-nc-chatOpen Source PLLuM | 73 | Apr 2026 | |
| 85 | Bielik-2.5Open Source SpeakLeash | 72 | Apr 2026 | |
| 86 | Claude-3.5-Haiku-20241022Open Source Anthropic | 72 | Apr 2026 | |
| 87 | Bielik-2.2Open Source SpeakLeash | 72 | Apr 2026 | |
| 88 | Llama-4-MaverickOpen Source Meta | 71 | Apr 2026 | |
| 89 | GPT-OSS-120bOpen Source OpenAI | 71 | Apr 2026 | |
| 90 | Llama-3.1-Tulu-3-405BOpen Source Meta | 71 | Apr 2026 | |
| 91 | Kimi-K2Open Source Moonshot.AI | 70 | Apr 2026 | |
| 92 | PLLuM-12B-nc-chatOpen Source PLLuM | 70 | Apr 2026 | |
| 93 | GPT-4o-mini-2024-07-18Open Source OpenAI | 69 | Apr 2026 | |
| 94 | Qwen3-235B-A22B Alibaba | 69 | Apr 2026 | |
| 95 | MiniMax-M2.5Open Source MiniMaxAI | 68 | Apr 2026 | |
| 96 | Bielik-2.3Open Source SpeakLeash | 68 | Apr 2026 | |
| 97 | Llama-PLLuM-70B-chatOpen Source PLLuM | 68 | Apr 2026 | |
| 98 | Bielik-2.1Open Source SpeakLeash | 68 | Apr 2026 | |
| 99 | GPT-4 OpenAI | 67 | Apr 2026 | |
| 100 | Kimi-K2-0905Open Source Moonshot.AI | 67 | Apr 2026 | |
| 101 | Command-A-03-2025Open Source Cohere | 67 | Apr 2026 | |
| 102 | O1-mini-2024-09-12Open Source OpenAI | 66 | Apr 2026 | |
| 103 | PLLuM-8x7B-chatOpen Source PLLuM | 66 | Apr 2026 | |
| 104 | Claude-3.0-SonnetOpen Source Anthropic | 65 | Apr 2026 | |
| 105 | Mistral-Small-4Open Source Mistral | 64 | Apr 2026 | |
| 106 | GLM-4.5-AirOpen Source Zhipu AI | 64 | Apr 2026 | |
| 107 | Qwen3-Next-80B-A3B-ThinkingOpen Source Alibaba | 64 | Apr 2026 | |
| 108 | Qwen3.5-27BOpen Source Alibaba | 64 | Apr 2026 | |
| 109 | Llama-PLLuM-70B-chat-250801Open Source PLLuM | 63 | Apr 2026 | |
| 110 | Mistral-Large-2407Open Source Mistral | 63 | Apr 2026 | |
| 111 | Bielik-Minitron-7B-v3.0-InstructOpen Source SpeakLeash | 62 | Apr 2026 | |
| 112 | Command-R-Plus-08-2024Open Source Cohere | 61 | Apr 2026 | |
| 113 | Bielik-0.1Open Source SpeakLeash | 61 | Apr 2026 | |
| 114 | Gemini-Flash-1.5Open Source Google | 61 | Apr 2026 | |
| 115 | Mistral-Large-2411Open Source Mistral | 61 | Apr 2026 | |
| 116 | WizardLM-2-8x22bOpen Source Microsoft | 60 | Apr 2026 | |
| 117 | Mixtral-8x22bOpen Source Mistral | 59 | Apr 2026 | |
| 118 | GPT-4.1-nano-2025-04-14Open Source OpenAI | 59 | Apr 2026 | |
| 119 | Llama-3.3-70BOpen Source Meta | 59 | Apr 2026 | |
| 120 | Llama-3.1-70BOpen Source Meta | 58 | Apr 2026 | |
| 121 | GPT-3.5-turboOpen Source OpenAI | 55 | Apr 2026 | |
| 122 | GLM-4.7-FlashOpen Source Zhipu AI | 55 | Apr 2026 | |
| 123 | PLLuM-12B-chatOpen Source PLLuM | 54 | Apr 2026 | |
| 124 | EuroLLM-9BOpen Source UTTER | 54 | Apr 2026 | |
| 125 | Qwen-MaxOpen Source Alibaba | 53 | Apr 2026 | |
| 126 | Command-R-Plus-04-2024Open Source Cohere | 53 | Apr 2026 | |
| 127 | Bielik-4.5B-v3.0-InstructOpen Source SpeakLeash | 53 | Apr 2026 | |
| 128 | Claude-Haiku-4.5Open Source Anthropic | 52 | Apr 2026 | |
| 129 | GPT-5.4-nano-2026-03-17 (no reasoning)Open Source OpenAI | 52 | Apr 2026 | |
| 130 | Mistral-Small-3.2-24B-2506Open Source Mistral | 51 | Apr 2026 | |
| 131 | Gemma-3-27b Google | 51 | Apr 2026 | |
| 132 | Llama-4-ScoutOpen Source Meta | 51 | Apr 2026 | |
| 133 | Llama-3.0-70BOpen Source Meta | 49 | Apr 2026 | |
| 134 | Gemma-2-27bOpen Source Google | 47 | Apr 2026 | |
| 135 | Qwen3-Next-80B-A3B-InstructOpen Source Alibaba | 46 | Apr 2026 | |
| 136 | Llama-PLLuM-8B-chatOpen Source PLLuM | 46 | Apr 2026 | |
| 137 | Mistral-Small-3.1-24B-2503Open Source Mistral | 45 | Apr 2026 | |
| 138 | Qwen-2.5-72bOpen Source Alibaba | 45 | Apr 2026 | |
| 139 | Ministral-14b-2512Open Source Mistral | 45 | Apr 2026 | |
| 140 | Magistral-Small-2506Open Source Mistral | 45 | Apr 2026 | |
| 141 | Qwen3.5-9BOpen Source Alibaba | 44 | Apr 2026 | |
| 142 | Mixtral-8x7bOpen Source Mistral | 44 | Apr 2026 | |
| 143 | Mistral-Small-24B-2501Open Source Mistral | 42 | Apr 2026 | |
| 144 | Qwen-PlusOpen Source Alibaba | 42 | Apr 2026 | |
| 145 | Ministral-8b-2512Open Source Mistral | 39 | Apr 2026 | |
| 146 | Qwen3-32BOpen Source Alibaba | 37 | Apr 2026 | |
| 147 | Bielik-1.5B-v3.0-InstructOpen Source SpeakLeash | 35 | Apr 2026 | |
| 148 | GPT-OSS-20bOpen Source OpenAI | 35 | Apr 2026 | |
| 149 | Phi-4 Microsoft | 35 | Apr 2026 | |
| 150 | Command-R7BOpen Source Cohere | 33 | Apr 2026 | |
| 151 | Llama-3.1-8BOpen Source Meta | 31 | Apr 2026 | |
| 152 | Qwen3-30B-A3BOpen Source Alibaba | 31 | Apr 2026 | |
| 153 | Qwen3-14BOpen Source Alibaba | 30 | Apr 2026 | |
| 154 | Gemma-2-9bOpen Source Google | 30 | Apr 2026 | |
| 155 | Qwen-Turbo-2024-11-01Open Source Alibaba | 30 | Apr 2026 | |
| 156 | Qwen3-8BOpen Source Alibaba | 27 | Apr 2026 | |
| 157 | Mistral-7b-v0.3Open Source Mistral | 27 | Apr 2026 | |
| 158 | Qwen3.5-4BOpen Source Alibaba | 27 | Apr 2026 | |
| 159 | Mistral-NemoOpen Source Mistral | 26 | Apr 2026 | |
| 160 | Qwen-2.5-32bOpen Source Alibaba | 25 | Apr 2026 | |
| 161 | Ministral-3b-2512Open Source Mistral | 24 | Apr 2026 | |
| 162 | Qwen-2.5-14bOpen Source Alibaba | 23 | Apr 2026 | |
| 163 | Ministral-8bOpen Source Mistral | 19 | Apr 2026 | |
| 164 | Qwen-2.5-7bOpen Source Alibaba | 17 | Apr 2026 | |
| 165 | Qwen3.5-2BOpen Source Alibaba | 12 | Apr 2026 |
grammar
| # | Model | Score | Paper / Code | Date |
|---|---|---|---|---|
| 1 | Gemini-3.1-Pro-PreviewOpen Source Google | 93 | Apr 2026 | |
| 2 | Gemini-3.0-Pro-PreviewOpen Source Google | 91 | Apr 2026 | |
| 3 | GPT-5.4-2026-03-05 (high reasoning)Open Source OpenAI | 90 | Apr 2026 | |
| 4 | Grok-4API xAI | 90 | Apr 2026 | |
| 5 | GPT-5.2-2025-12-11 (xhigh reasoning)Open Source OpenAI | 89 | Apr 2026 | |
| 6 | GPT-5.4-2026-03-05 (low reasoning)Open Source OpenAI | 88 | Apr 2026 | |
| 7 | GPT-5.2-2025-12-11 (high reasoning)Open Source OpenAI | 87 | Apr 2026 | |
| 8 | Gemini-2.5-Pro-Preview-06-05Open Source Google | 86 | Apr 2026 | |
| 9 | GPT-5.4-mini-2026-03-17 (high reasoning)Open Source OpenAI | 85 | Apr 2026 | |
| 10 | GPT-5-Pro-2025-10-06 (high reasoning)Open Source OpenAI | 85 | Apr 2026 | |
| 11 | O3-2025-04-16Open Source OpenAI | 85 | Apr 2026 | |
| 12 | Gemini-3-Flash-PreviewOpen Source Google | 85 | Apr 2026 | |
| 13 | O1-2024-12-17Open Source OpenAI | 84 | Apr 2026 | |
| 14 | DeepSeek-V3.2-SpecialeOpen Source DeepSeek | 84 | Apr 2026 | |
| 15 | GPT-5-2025-08-07Open Source OpenAI | 84 | Apr 2026 | |
| 16 | GLM-5API Zhipu AI | 82 | Apr 2026 | |
| 17 | GPT-5-mini-2025-08-07Open Source OpenAI | 82 | Apr 2026 | |
| 18 | GPT-5.1-2025-11-13 (high reasoning)Open Source OpenAI | 82 | Apr 2026 | |
| 19 | GPT-5.2-2025-12-11 (medium reasoning)Open Source OpenAI | 82 | Apr 2026 | |
| 20 | Claude-Sonnet-4.6Open Source Anthropic | 80 | Apr 2026 | |
| 21 | Kimi-K2.5Open Source Moonshot.AI | 80 | Apr 2026 | |
| 22 | Claude-3.7-Sonnet-ThinkingOpen Source Anthropic | 80 | Apr 2026 | |
| 23 | Claude-Opus-4.5Open Source Anthropic | 79 | Apr 2026 | |
| 24 | GPT-5.4-2026-03-05 (no reasoning)Open Source OpenAI | 79 | Apr 2026 | |
| 25 | Gemini-2.5-Pro-Exp-03-25Open Source Google | 79 | Apr 2026 | |
| 26 | MiMo-V2-ProOpen Source Xiaomi | 79 | Apr 2026 | |
| 27 | Claude-3.5-Sonnet-20241022Open Source Anthropic | 79 | Apr 2026 | |
| 28 | Claude-Opus-4.6Open Source Anthropic | 77 | Apr 2026 | |
| 29 | Gemini-2.5-Flash-Preview-04-17Open Source Google | 77 | Apr 2026 | |
| 30 | Claude-Opus-4API Anthropic | 76 | Apr 2026 | |
| 31 | Qwen3.5-397B-A17BOpen Source Alibaba | 76 | Apr 2026 | |
| 32 | Claude-3.5-Sonnet-20240620Open Source Anthropic | 75 | Apr 2026 | |
| 33 | DeepSeek-v3.1 (thinking)Open Source DeepSeek | 75 | Apr 2026 | |
| 34 | DeepSeek-R1Open Source DeepSeek | 74 | Apr 2026 | |
| 35 | Claude-Opus-4.1Open Source Anthropic | 74 | Apr 2026 | |
| 36 | Claude-3.7-SonnetOpen Source Anthropic | 74 | Apr 2026 | |
| 37 | GPT-4.5-preview-2025-02-27Open Source OpenAI | 74 | Apr 2026 | |
| 38 | GPT-5.4-nano-2026-03-17 (high reasoning)Open Source OpenAI | 74 | Apr 2026 | |
| 39 | DeepSeek-R1-0528Open Source DeepSeek | 73 | Apr 2026 | |
| 40 | Qwen3.5-122B-A10BOpen Source Alibaba | 73 | Apr 2026 | |
| 41 | Kimi-K2-ThinkingOpen Source Moonshot.AI | 73 | Apr 2026 | |
| 42 | O4-Mini-2025-04-16Open Source OpenAI | 72 | Apr 2026 | |
| 43 | Grok-4.1-FastOpen Source xAI | 72 | Apr 2026 | |
| 44 | Grok-4-FastOpen Source xAI | 72 | Apr 2026 | |
| 45 | Grok-4.20Open Source xAI | 72 | Apr 2026 | |
| 46 | MiniMax-M2.7Open Source MiniMaxAI | 72 | Apr 2026 | |
| 47 | MiniMax-M2.5Open Source MiniMaxAI | 71 | Apr 2026 | |
| 48 | Grok-3-Mini-BetaOpen Source xAI | 71 | Apr 2026 | |
| 49 | GPT-5.4-mini-2026-03-17 (no reasoning)Open Source OpenAI | 70 | Apr 2026 | |
| 50 | GPT-5.1-2025-11-13 (default reasoning)Open Source OpenAI | 70 | Apr 2026 | |
| 51 | GPT-4o-2024-05-13Open Source OpenAI | 70 | Apr 2026 | |
| 52 | GPT-5-nano-2025-08-07Open Source OpenAI | 69 | Apr 2026 | |
| 53 | Gemini-Exp-1206Open Source Google | 69 | Apr 2026 | |
| 54 | GPT-5.2-2025-12-11 (no reasoning)Open Source OpenAI | 69 | Apr 2026 | |
| 55 | Gemini-2.0-Flash-Thinking-Exp-01-21Open Source Google | 68 | Apr 2026 | |
| 56 | Claude-Sonnet-4.5Open Source Anthropic | 68 | Apr 2026 | |
| 57 | O3-mini-2025-01-31Open Source OpenAI | 67 | Apr 2026 | |
| 58 | GPT-4o-2024-11-20Open Source OpenAI | 67 | Apr 2026 | |
| 59 | GPT-4.1-2025-04-14Open Source OpenAI | 67 | Apr 2026 | |
| 60 | Mistral-Large-2512Open Source Mistral | 67 | Apr 2026 | |
| 61 | Claude-3-OpusAPI Anthropic | 66 | Apr 2026 | |
| 62 | GPT-4o-2024-08-06Open Source OpenAI | 66 | Apr 2026 | |
| 63 | DeepSeek-V3.2Open Source DeepSeek | 66 | Apr 2026 | |
| 64 | GLM-4.7Open Source Zhipu AI | 66 | Apr 2026 | |
| 65 | Qwen3-235B-A22B Alibaba | 66 | Apr 2026 | |
| 66 | Qwen3.5-35B-A3BOpen Source Alibaba | 66 | Apr 2026 | |
| 67 | Qwen3-Next-80B-A3B-ThinkingOpen Source Alibaba | 65 | Apr 2026 | |
| 68 | Gemini-2.0-Flash-ExperimentalOpen Source Google | 65 | Apr 2026 | |
| 69 | Grok-3-BetaOpen Source xAI | 65 | Apr 2026 | |
| 70 | GPT-OSS-120bOpen Source OpenAI | 64 | Apr 2026 | |
| 71 | DeepSeek-v3.1 (no thinking)Open Source DeepSeek | 64 | Apr 2026 | |
| 72 | Grok-2-1212Open Source xAI | 64 | Apr 2026 | |
| 73 | DeepSeek-v3-0324Open Source DeepSeek | 64 | Apr 2026 | |
| 74 | GLM-4.6Open Source Zhipu AI | 63 | Apr 2026 | |
| 75 | DeepSeek-v3.2-ExpOpen Source DeepSeek | 63 | Apr 2026 | |
| 76 | Claude-Sonnet-4API Anthropic | 63 | Apr 2026 | |
| 77 | GPT-4.1-mini-2025-04-14Open Source OpenAI | 62 | Apr 2026 | |
| 78 | Qwen3.5-27BOpen Source Alibaba | 62 | Apr 2026 | |
| 79 | DeepSeek-v3Open Source DeepSeek | 62 | Apr 2026 | |
| 80 | O1-mini-2024-09-12Open Source OpenAI | 61 | Apr 2026 | |
| 81 | Mistral-Medium-3Open Source Mistral | 61 | Apr 2026 | |
| 82 | Llama-4-MaverickOpen Source Meta | 59 | Apr 2026 | |
| 83 | GLM-4.5Open Source Zhipu AI | 59 | Apr 2026 | |
| 84 | Kimi-K2-0905Open Source Moonshot.AI | 59 | Apr 2026 | |
| 85 | Claude-Haiku-4.5Open Source Anthropic | 59 | Apr 2026 | |
| 86 | GPT-4 OpenAI | 58 | Apr 2026 | |
| 87 | Qwen3-MaxOpen Source Alibaba | 58 | Apr 2026 | |
| 88 | Gemini-Pro-1.5Open Source Google | 58 | Apr 2026 | |
| 89 | Kimi-K2Open Source Moonshot.AI | 58 | Apr 2026 | |
| 90 | Llama-3.1-405bOpen Source Meta | 57 | Apr 2026 | |
| 91 | Claude-3.5-Haiku-20241022Open Source Anthropic | 57 | Apr 2026 | |
| 92 | Bielik-11B-v3.0-InstructOpen Source SpeakLeash | 57 | Apr 2026 | |
| 93 | Claude-3.0-SonnetOpen Source Anthropic | 56 | Apr 2026 | |
| 94 | Mistral-Small-4Open Source Mistral | 56 | Apr 2026 | |
| 95 | GPT-4-turboAPI OpenAI | 56 | Apr 2026 | |
| 96 | Llama-3.1-Tulu-3-405BOpen Source Meta | 56 | Apr 2026 | |
| 97 | Bielik-2.6Open Source SpeakLeash | 55 | Apr 2026 | |
| 98 | GPT-4o-mini-2024-07-18Open Source OpenAI | 55 | Apr 2026 | |
| 99 | GPT-OSS-20bOpen Source OpenAI | 54 | Apr 2026 | |
| 100 | Llama-PLLuM-70B-chat-250801Open Source PLLuM | 54 | Apr 2026 | |
| 101 | Qwen3.5-9BOpen Source Alibaba | 54 | Apr 2026 | |
| 102 | Mistral-Large-2411Open Source Mistral | 54 | Apr 2026 | |
| 103 | Bielik-2.2Open Source SpeakLeash | 53 | Apr 2026 | |
| 104 | Mistral-Small-3.2-24B-2506Open Source Mistral | 53 | Apr 2026 | |
| 105 | PLLuM-12B-nc-chat-250715Open Source PLLuM | 52 | Apr 2026 | |
| 106 | Qwen3-Next-80B-A3B-InstructOpen Source Alibaba | 52 | Apr 2026 | |
| 107 | GLM-4.5-AirOpen Source Zhipu AI | 52 | Apr 2026 | |
| 108 | Mistral-Large-2407Open Source Mistral | 51 | Apr 2026 | |
| 109 | Llama-4-ScoutOpen Source Meta | 51 | Apr 2026 | |
| 110 | Qwen-MaxOpen Source Alibaba | 51 | Apr 2026 | |
| 111 | Bielik-2.5Open Source SpeakLeash | 51 | Apr 2026 | |
| 112 | Llama-PLLuM-70B-chatOpen Source PLLuM | 50 | Apr 2026 | |
| 113 | Mixtral-8x22bOpen Source Mistral | 50 | Apr 2026 | |
| 114 | Mistral-Small-3.1-24B-2503Open Source Mistral | 50 | Apr 2026 | |
| 115 | Bielik-Minitron-7B-v3.0-InstructOpen Source SpeakLeash | 50 | Apr 2026 | |
| 116 | Bielik-2.1Open Source SpeakLeash | 50 | Apr 2026 | |
| 117 | Qwen3-30B-A3BOpen Source Alibaba | 49 | Apr 2026 | |
| 118 | Command-A-03-2025Open Source Cohere | 49 | Apr 2026 | |
| 119 | Bielik-2.3Open Source SpeakLeash | 49 | Apr 2026 | |
| 120 | WizardLM-2-8x22bOpen Source Microsoft | 49 | Apr 2026 | |
| 121 | Llama-3.3-70BOpen Source Meta | 49 | Apr 2026 | |
| 122 | Qwen3-32BOpen Source Alibaba | 48 | Apr 2026 | |
| 123 | Magistral-Small-2506Open Source Mistral | 47 | Apr 2026 | |
| 124 | PLLuM-8x7B-nc-chatOpen Source PLLuM | 47 | Apr 2026 | |
| 125 | Qwen-PlusOpen Source Alibaba | 47 | Apr 2026 | |
| 126 | Gemma-3-27b Google | 46 | Apr 2026 | |
| 127 | Gemma-2-27bOpen Source Google | 46 | Apr 2026 | |
| 128 | Gemini-Flash-1.5Open Source Google | 46 | Apr 2026 | |
| 129 | Qwen3-14BOpen Source Alibaba | 46 | Apr 2026 | |
| 130 | Mistral-Small-24B-2501Open Source Mistral | 45 | Apr 2026 | |
| 131 | Qwen3.5-4BOpen Source Alibaba | 45 | Apr 2026 | |
| 132 | GPT-5.4-nano-2026-03-17 (no reasoning)Open Source OpenAI | 45 | Apr 2026 | |
| 133 | Llama-3.0-70BOpen Source Meta | 45 | Apr 2026 | |
| 134 | GPT-4.1-nano-2025-04-14Open Source OpenAI | 45 | Apr 2026 | |
| 135 | Command-R-Plus-04-2024Open Source Cohere | 45 | Apr 2026 | |
| 136 | Qwen-2.5-72bOpen Source Alibaba | 45 | Apr 2026 | |
| 137 | Llama-3.1-70BOpen Source Meta | 44 | Apr 2026 | |
| 138 | Ministral-8b-2512Open Source Mistral | 44 | Apr 2026 | |
| 139 | Ministral-14b-2512Open Source Mistral | 44 | Apr 2026 | |
| 140 | GLM-4.7-FlashOpen Source Zhipu AI | 44 | Apr 2026 | |
| 141 | Qwen-2.5-32bOpen Source Alibaba | 43 | Apr 2026 | |
| 142 | Command-R-Plus-08-2024Open Source Cohere | 43 | Apr 2026 | |
| 143 | PLLuM-8x7B-chatOpen Source PLLuM | 42 | Apr 2026 | |
| 144 | GPT-3.5-turboOpen Source OpenAI | 41 | Apr 2026 | |
| 145 | PLLuM-12B-nc-chatOpen Source PLLuM | 41 | Apr 2026 | |
| 146 | EuroLLM-9BOpen Source UTTER | 39 | Apr 2026 | |
| 147 | Qwen3-8BOpen Source Alibaba | 38 | Apr 2026 | |
| 148 | Gemma-2-9bOpen Source Google | 38 | Apr 2026 | |
| 149 | PLLuM-12B-chatOpen Source PLLuM | 37 | Apr 2026 | |
| 150 | Bielik-4.5B-v3.0-InstructOpen Source SpeakLeash | 35 | Apr 2026 | |
| 151 | Phi-4 Microsoft | 34 | Apr 2026 | |
| 152 | Mixtral-8x7bOpen Source Mistral | 34 | Apr 2026 | |
| 153 | Qwen-2.5-14bOpen Source Alibaba | 34 | Apr 2026 | |
| 154 | Llama-PLLuM-8B-chatOpen Source PLLuM | 33 | Apr 2026 | |
| 155 | Qwen-Turbo-2024-11-01Open Source Alibaba | 33 | Apr 2026 | |
| 156 | Mistral-NemoOpen Source Mistral | 31 | Apr 2026 | |
| 157 | Ministral-3b-2512Open Source Mistral | 30 | Apr 2026 | |
| 158 | Llama-3.1-8BOpen Source Meta | 29 | Apr 2026 | |
| 159 | Bielik-0.1Open Source SpeakLeash | 29 | Apr 2026 | |
| 160 | Qwen-2.5-7bOpen Source Alibaba | 29 | Apr 2026 | |
| 161 | Mistral-7b-v0.3Open Source Mistral | 27 | Apr 2026 | |
| 162 | Ministral-8bOpen Source Mistral | 24 | Apr 2026 | |
| 163 | Bielik-1.5B-v3.0-InstructOpen Source SpeakLeash | 23 | Apr 2026 | |
| 164 | Command-R7BOpen Source Cohere | 23 | Apr 2026 | |
| 165 | Qwen3.5-2BOpen Source Alibaba | 19 | Apr 2026 |
history
| # | Model | Score | Paper / Code | Date |
|---|---|---|---|---|
| 1 | Gemini-3.1-Pro-PreviewOpen Source Google | 98 | Apr 2026 | |
| 2 | Gemini-3.0-Pro-PreviewOpen Source Google | 95 | Apr 2026 | |
| 3 | GPT-5.2-2025-12-11 (xhigh reasoning)Open Source OpenAI | 94 | Apr 2026 | |
| 4 | Grok-4API xAI | 94 | Apr 2026 | |
| 5 | GPT-5.4-2026-03-05 (low reasoning)Open Source OpenAI | 93 | Apr 2026 | |
| 6 | GPT-5.4-2026-03-05 (high reasoning)Open Source OpenAI | 92 | Apr 2026 | |
| 7 | Gemini-2.5-Pro-Exp-03-25Open Source Google | 92 | Apr 2026 | |
| 8 | Gemini-2.5-Pro-Preview-06-05Open Source Google | 92 | Apr 2026 | |
| 9 | Gemini-3-Flash-PreviewOpen Source Google | 92 | Apr 2026 | |
| 10 | Claude-3.7-Sonnet-ThinkingOpen Source Anthropic | 92 | Apr 2026 | |
| 11 | DeepSeek-R1-0528Open Source DeepSeek | 91 | Apr 2026 | |
| 12 | GPT-5-2025-08-07Open Source OpenAI | 91 | Apr 2026 | |
| 13 | GPT-5-Pro-2025-10-06 (high reasoning)Open Source OpenAI | 91 | Apr 2026 | |
| 14 | Claude-Opus-4.1Open Source Anthropic | 91 | Apr 2026 | |
| 15 | Claude-3.5-Sonnet-20241022Open Source Anthropic | 91 | Apr 2026 | |
| 16 | Claude-3.7-SonnetOpen Source Anthropic | 90 | Apr 2026 | |
| 17 | O1-2024-12-17Open Source OpenAI | 90 | Apr 2026 | |
| 18 | GPT-5.2-2025-12-11 (medium reasoning)Open Source OpenAI | 90 | Apr 2026 | |
| 19 | DeepSeek-V3.2-SpecialeOpen Source DeepSeek | 90 | Apr 2026 | |
| 20 | GPT-5.2-2025-12-11 (high reasoning)Open Source OpenAI | 90 | Apr 2026 | |
| 21 | GPT-4.5-preview-2025-02-27Open Source OpenAI | 90 | Apr 2026 | |
| 22 | O3-2025-04-16Open Source OpenAI | 89 | Apr 2026 | |
| 23 | Kimi-K2.5Open Source Moonshot.AI | 89 | Apr 2026 | |
| 24 | Claude-3.5-Sonnet-20240620Open Source Anthropic | 89 | Apr 2026 | |
| 25 | GPT-5.1-2025-11-13 (high reasoning)Open Source OpenAI | 89 | Apr 2026 | |
| 26 | DeepSeek-v3.1 (thinking)Open Source DeepSeek | 89 | Apr 2026 | |
| 27 | GPT-5.4-mini-2026-03-17 (high reasoning)Open Source OpenAI | 89 | Apr 2026 | |
| 28 | GLM-5API Zhipu AI | 88 | Apr 2026 | |
| 29 | Gemini-Exp-1206Open Source Google | 88 | Apr 2026 | |
| 30 | GLM-4.6Open Source Zhipu AI | 87 | Apr 2026 | |
| 31 | MiMo-V2-ProOpen Source Xiaomi | 87 | Apr 2026 | |
| 32 | Claude-Opus-4.6Open Source Anthropic | 87 | Apr 2026 | |
| 33 | GPT-5.4-2026-03-05 (no reasoning)Open Source OpenAI | 87 | Apr 2026 | |
| 34 | Claude-Opus-4.5Open Source Anthropic | 87 | Apr 2026 | |
| 35 | Claude-Opus-4API Anthropic | 87 | Apr 2026 | |
| 36 | Claude-3-OpusAPI Anthropic | 86 | Apr 2026 | |
| 37 | DeepSeek-v3.1 (no thinking)Open Source DeepSeek | 86 | Apr 2026 | |
| 38 | GPT-4o-2024-08-06Open Source OpenAI | 86 | Apr 2026 | |
| 39 | Gemini-2.5-Flash-Preview-04-17Open Source Google | 86 | Apr 2026 | |
| 40 | GLM-4.7Open Source Zhipu AI | 85 | Apr 2026 | |
| 41 | Grok-3-BetaOpen Source xAI | 85 | Apr 2026 | |
| 42 | GPT-5.2-2025-12-11 (no reasoning)Open Source OpenAI | 85 | Apr 2026 | |
| 43 | Claude-Sonnet-4.5Open Source Anthropic | 85 | Apr 2026 | |
| 44 | DeepSeek-R1Open Source DeepSeek | 85 | Apr 2026 | |
| 45 | GPT-4.1-2025-04-14Open Source OpenAI | 85 | Apr 2026 | |
| 46 | GPT-4o-2024-11-20Open Source OpenAI | 84 | Apr 2026 | |
| 47 | Grok-3-Mini-BetaOpen Source xAI | 84 | Apr 2026 | |
| 48 | Grok-4.1-FastOpen Source xAI | 84 | Apr 2026 | |
| 49 | DeepSeek-v3.2-ExpOpen Source DeepSeek | 83 | Apr 2026 | |
| 50 | GPT-5-mini-2025-08-07Open Source OpenAI | 83 | Apr 2026 | |
| 51 | Gemini-2.0-Flash-ExperimentalOpen Source Google | 83 | Apr 2026 | |
| 52 | Qwen3.5-397B-A17BOpen Source Alibaba | 83 | Apr 2026 | |
| 53 | DeepSeek-V3.2Open Source DeepSeek | 82 | Apr 2026 | |
| 54 | GPT-4o-2024-05-13Open Source OpenAI | 82 | Apr 2026 | |
| 55 | DeepSeek-v3-0324Open Source DeepSeek | 82 | Apr 2026 | |
| 56 | Claude-Sonnet-4.6Open Source Anthropic | 82 | Apr 2026 | |
| 57 | GPT-5.1-2025-11-13 (default reasoning)Open Source OpenAI | 82 | Apr 2026 | |
| 58 | GPT-5.4-mini-2026-03-17 (no reasoning)Open Source OpenAI | 82 | Apr 2026 | |
| 59 | Grok-4.20Open Source xAI | 82 | Apr 2026 | |
| 60 | Grok-4-FastOpen Source xAI | 81 | Apr 2026 | |
| 61 | Claude-Sonnet-4API Anthropic | 81 | Apr 2026 | |
| 62 | Kimi-K2-ThinkingOpen Source Moonshot.AI | 80 | Apr 2026 | |
| 63 | Gemini-2.0-Flash-Thinking-Exp-01-21Open Source Google | 80 | Apr 2026 | |
| 64 | Gemini-Pro-1.5Open Source Google | 79 | Apr 2026 | |
| 65 | Mistral-Large-2512Open Source Mistral | 79 | Apr 2026 | |
| 66 | Qwen3.5-122B-A10BOpen Source Alibaba | 78 | Apr 2026 | |
| 67 | Mistral-Medium-3Open Source Mistral | 78 | Apr 2026 | |
| 68 | Bielik-11B-v3.0-InstructOpen Source SpeakLeash | 78 | Apr 2026 | |
| 69 | DeepSeek-v3Open Source DeepSeek | 77 | Apr 2026 | |
| 70 | Bielik-2.2Open Source SpeakLeash | 77 | Apr 2026 | |
| 71 | O4-Mini-2025-04-16Open Source OpenAI | 77 | Apr 2026 | |
| 72 | GLM-4.5Open Source Zhipu AI | 77 | Apr 2026 | |
| 73 | Llama-4-MaverickOpen Source Meta | 76 | Apr 2026 | |
| 74 | Bielik-2.3Open Source SpeakLeash | 76 | Apr 2026 | |
| 75 | GPT-4-turboAPI OpenAI | 76 | Apr 2026 | |
| 76 | GPT-5.4-nano-2026-03-17 (high reasoning)Open Source OpenAI | 76 | Apr 2026 | |
| 77 | Llama-3.1-Tulu-3-405BOpen Source Meta | 75 | Apr 2026 | |
| 78 | Bielik-2.5Open Source SpeakLeash | 75 | Apr 2026 | |
| 79 | Grok-2-1212Open Source xAI | 74 | Apr 2026 | |
| 80 | Llama-PLLuM-70B-chatOpen Source PLLuM | 74 | Apr 2026 | |
| 81 | Qwen3-MaxOpen Source Alibaba | 74 | Apr 2026 | |
| 82 | Command-A-03-2025Open Source Cohere | 73 | Apr 2026 | |
| 83 | Kimi-K2Open Source Moonshot.AI | 73 | Apr 2026 | |
| 84 | PLLuM-12B-nc-chat-250715Open Source PLLuM | 73 | Apr 2026 | |
| 85 | PLLuM-8x7B-nc-chatOpen Source PLLuM | 73 | Apr 2026 | |
| 86 | GPT-5-nano-2025-08-07Open Source OpenAI | 73 | Apr 2026 | |
| 87 | Claude-3.0-SonnetOpen Source Anthropic | 73 | Apr 2026 | |
| 88 | Llama-3.1-405bOpen Source Meta | 73 | Apr 2026 | |
| 89 | Bielik-2.1Open Source SpeakLeash | 73 | Apr 2026 | |
| 90 | Qwen3-Next-80B-A3B-ThinkingOpen Source Alibaba | 72 | Apr 2026 | |
| 91 | GPT-4 OpenAI | 72 | Apr 2026 | |
| 92 | Bielik-2.6Open Source SpeakLeash | 72 | Apr 2026 | |
| 93 | Mistral-Large-2407Open Source Mistral | 71 | Apr 2026 | |
| 94 | Qwen3-235B-A22B Alibaba | 70 | Apr 2026 | |
| 95 | Kimi-K2-0905Open Source Moonshot.AI | 70 | Apr 2026 | |
| 96 | PLLuM-12B-nc-chatOpen Source PLLuM | 70 | Apr 2026 | |
| 97 | MiniMax-M2.5Open Source MiniMaxAI | 69 | Apr 2026 | |
| 98 | Llama-PLLuM-70B-chat-250801Open Source PLLuM | 69 | Apr 2026 | |
| 99 | Mixtral-8x22bOpen Source Mistral | 69 | Apr 2026 | |
| 100 | Llama-3.1-70BOpen Source Meta | 68 | Apr 2026 | |
| 101 | PLLuM-8x7B-chatOpen Source PLLuM | 68 | Apr 2026 | |
| 102 | Qwen3.5-35B-A3BOpen Source Alibaba | 68 | Apr 2026 | |
| 103 | WizardLM-2-8x22bOpen Source Microsoft | 67 | Apr 2026 | |
| 104 | O3-mini-2025-01-31Open Source OpenAI | 67 | Apr 2026 | |
| 105 | GPT-4o-mini-2024-07-18Open Source OpenAI | 67 | Apr 2026 | |
| 106 | GPT-4.1-mini-2025-04-14Open Source OpenAI | 67 | Apr 2026 | |
| 107 | GLM-4.5-AirOpen Source Zhipu AI | 66 | Apr 2026 | |
| 108 | Llama-3.3-70BOpen Source Meta | 65 | Apr 2026 | |
| 109 | GPT-OSS-120bOpen Source OpenAI | 65 | Apr 2026 | |
| 110 | MiniMax-M2.7Open Source MiniMaxAI | 64 | Apr 2026 | |
| 111 | Llama-3.0-70BOpen Source Meta | 64 | Apr 2026 | |
| 112 | Bielik-Minitron-7B-v3.0-InstructOpen Source SpeakLeash | 64 | Apr 2026 | |
| 113 | Mistral-Small-4Open Source Mistral | 64 | Apr 2026 | |
| 114 | Mistral-Large-2411Open Source Mistral | 64 | Apr 2026 | |
| 115 | Qwen3.5-27BOpen Source Alibaba | 63 | Apr 2026 | |
| 116 | Qwen-MaxOpen Source Alibaba | 63 | Apr 2026 | |
| 117 | PLLuM-12B-chatOpen Source PLLuM | 61 | Apr 2026 | |
| 118 | O1-mini-2024-09-12Open Source OpenAI | 61 | Apr 2026 | |
| 119 | Claude-3.5-Haiku-20241022Open Source Anthropic | 61 | Apr 2026 | |
| 120 | Command-R-Plus-04-2024Open Source Cohere | 61 | Apr 2026 | |
| 121 | Command-R-Plus-08-2024Open Source Cohere | 61 | Apr 2026 | |
| 122 | Mistral-Small-3.2-24B-2506Open Source Mistral | 61 | Apr 2026 | |
| 123 | Claude-Haiku-4.5Open Source Anthropic | 60 | Apr 2026 | |
| 124 | Bielik-0.1Open Source SpeakLeash | 58 | Apr 2026 | |
| 125 | Qwen3-Next-80B-A3B-InstructOpen Source Alibaba | 58 | Apr 2026 | |
| 126 | GPT-5.4-nano-2026-03-17 (no reasoning)Open Source OpenAI | 57 | Apr 2026 | |
| 127 | Mixtral-8x7bOpen Source Mistral | 56 | Apr 2026 | |
| 128 | Qwen3-32BOpen Source Alibaba | 55 | Apr 2026 | |
| 129 | Bielik-4.5B-v3.0-InstructOpen Source SpeakLeash | 55 | Apr 2026 | |
| 130 | Magistral-Small-2506Open Source Mistral | 54 | Apr 2026 | |
| 131 | GLM-4.7-FlashOpen Source Zhipu AI | 54 | Apr 2026 | |
| 132 | Mistral-Small-3.1-24B-2503Open Source Mistral | 54 | Apr 2026 | |
| 133 | Qwen-2.5-72bOpen Source Alibaba | 54 | Apr 2026 | |
| 134 | Gemma-2-27bOpen Source Google | 53 | Apr 2026 | |
| 135 | Gemma-3-27b Google | 52 | Apr 2026 | |
| 136 | Ministral-14b-2512Open Source Mistral | 52 | Apr 2026 | |
| 137 | Gemini-Flash-1.5Open Source Google | 51 | Apr 2026 | |
| 138 | GPT-3.5-turboOpen Source OpenAI | 51 | Apr 2026 | |
| 139 | GPT-4.1-nano-2025-04-14Open Source OpenAI | 50 | Apr 2026 | |
| 140 | Llama-PLLuM-8B-chatOpen Source PLLuM | 50 | Apr 2026 | |
| 141 | Mistral-Small-24B-2501Open Source Mistral | 49 | Apr 2026 | |
| 142 | EuroLLM-9BOpen Source UTTER | 49 | Apr 2026 | |
| 143 | Qwen3.5-9BOpen Source Alibaba | 48 | Apr 2026 | |
| 144 | Llama-4-ScoutOpen Source Meta | 47 | Apr 2026 | |
| 145 | Qwen-PlusOpen Source Alibaba | 46 | Apr 2026 | |
| 146 | Qwen-2.5-32bOpen Source Alibaba | 44 | Apr 2026 | |
| 147 | Ministral-8b-2512Open Source Mistral | 43 | Apr 2026 | |
| 148 | Qwen-Turbo-2024-11-01Open Source Alibaba | 42 | Apr 2026 | |
| 149 | Qwen3-30B-A3BOpen Source Alibaba | 42 | Apr 2026 | |
| 150 | Qwen3-14BOpen Source Alibaba | 42 | Apr 2026 | |
| 151 | Qwen3-8BOpen Source Alibaba | 41 | Apr 2026 | |
| 152 | Phi-4 Microsoft | 40 | Apr 2026 | |
| 153 | Qwen-2.5-14bOpen Source Alibaba | 37 | Apr 2026 | |
| 154 | GPT-OSS-20bOpen Source OpenAI | 37 | Apr 2026 | |
| 155 | Qwen3.5-4BOpen Source Alibaba | 36 | Apr 2026 | |
| 156 | Gemma-2-9bOpen Source Google | 35 | Apr 2026 | |
| 157 | Ministral-8bOpen Source Mistral | 33 | Apr 2026 | |
| 158 | Bielik-1.5B-v3.0-InstructOpen Source SpeakLeash | 32 | Apr 2026 | |
| 159 | Mistral-7b-v0.3Open Source Mistral | 30 | Apr 2026 | |
| 160 | Ministral-3b-2512Open Source Mistral | 30 | Apr 2026 | |
| 161 | Mistral-NemoOpen Source Mistral | 28 | Apr 2026 | |
| 162 | Command-R7BOpen Source Cohere | 27 | Apr 2026 | |
| 163 | Llama-3.1-8BOpen Source Meta | 25 | Apr 2026 | |
| 164 | Qwen-2.5-7bOpen Source Alibaba | 23 | Apr 2026 | |
| 165 | Qwen3.5-2BOpen Source Alibaba | 14 | Apr 2026 |
vocabulary
| # | Model | Score | Paper / Code | Date |
|---|---|---|---|---|
| 1 | Gemini-3.1-Pro-PreviewOpen Source Google | 96 | Apr 2026 | |
| 2 | Gemini-3.0-Pro-PreviewOpen Source Google | 95 | Apr 2026 | |
| 3 | GPT-5-Pro-2025-10-06 (high reasoning)Open Source OpenAI | 92 | Apr 2026 | |
| 4 | GPT-5.4-2026-03-05 (high reasoning)Open Source OpenAI | 91 | Apr 2026 | |
| 5 | GPT-5-2025-08-07Open Source OpenAI | 91 | Apr 2026 | |
| 6 | Gemini-2.5-Pro-Preview-06-05Open Source Google | 90 | Apr 2026 | |
| 7 | Gemini-2.5-Pro-Exp-03-25Open Source Google | 90 | Apr 2026 | |
| 8 | O3-2025-04-16Open Source OpenAI | 90 | Apr 2026 | |
| 9 | GPT-5.1-2025-11-13 (high reasoning)Open Source OpenAI | 90 | Apr 2026 | |
| 10 | O1-2024-12-17Open Source OpenAI | 88 | Apr 2026 | |
| 11 | Gemini-3-Flash-PreviewOpen Source Google | 88 | Apr 2026 | |
| 12 | GPT-5.2-2025-12-11 (xhigh reasoning)Open Source OpenAI | 87 | Apr 2026 | |
| 13 | GPT-5.2-2025-12-11 (high reasoning)Open Source OpenAI | 86 | Apr 2026 | |
| 14 | GPT-5.4-mini-2026-03-17 (high reasoning)Open Source OpenAI | 86 | Apr 2026 | |
| 15 | GPT-5.2-2025-12-11 (medium reasoning)Open Source OpenAI | 86 | Apr 2026 | |
| 16 | GPT-5.4-2026-03-05 (low reasoning)Open Source OpenAI | 85 | Apr 2026 | |
| 17 | GPT-5.4-2026-03-05 (no reasoning)Open Source OpenAI | 85 | Apr 2026 | |
| 18 | Grok-4API xAI | 84 | Apr 2026 | |
| 19 | GPT-4.5-preview-2025-02-27Open Source OpenAI | 83 | Apr 2026 | |
| 20 | Gemini-Exp-1206Open Source Google | 82 | Apr 2026 | |
| 21 | Gemini-2.5-Flash-Preview-04-17Open Source Google | 81 | Apr 2026 | |
| 22 | GPT-4o-2024-11-20Open Source OpenAI | 80 | Apr 2026 | |
| 23 | GPT-4.1-2025-04-14Open Source OpenAI | 80 | Apr 2026 | |
| 24 | GPT-4o-2024-05-13Open Source OpenAI | 78 | Apr 2026 | |
| 25 | Claude-Opus-4.6Open Source Anthropic | 78 | Apr 2026 | |
| 26 | Claude-3.5-Sonnet-20241022Open Source Anthropic | 77 | Apr 2026 | |
| 27 | GPT-5.2-2025-12-11 (no reasoning)Open Source OpenAI | 77 | Apr 2026 | |
| 28 | GPT-4o-2024-08-06Open Source OpenAI | 77 | Apr 2026 | |
| 29 | Claude-Opus-4.5Open Source Anthropic | 76 | Apr 2026 | |
| 30 | Claude-3.5-Sonnet-20240620Open Source Anthropic | 76 | Apr 2026 | |
| 31 | GPT-5.1-2025-11-13 (default reasoning)Open Source OpenAI | 75 | Apr 2026 | |
| 32 | Claude-3.7-SonnetOpen Source Anthropic | 75 | Apr 2026 | |
| 33 | Claude-3.7-Sonnet-ThinkingOpen Source Anthropic | 75 | Apr 2026 | |
| 34 | DeepSeek-v3.1 (thinking)Open Source DeepSeek | 74 | Apr 2026 | |
| 35 | Claude-Sonnet-4.6Open Source Anthropic | 74 | Apr 2026 | |
| 36 | Claude-Opus-4.1Open Source Anthropic | 73 | Apr 2026 | |
| 37 | MiMo-V2-ProOpen Source Xiaomi | 73 | Apr 2026 | |
| 38 | Claude-Opus-4API Anthropic | 73 | Apr 2026 | |
| 39 | DeepSeek-R1Open Source DeepSeek | 72 | Apr 2026 | |
| 40 | Gemini-2.0-Flash-ExperimentalOpen Source Google | 72 | Apr 2026 | |
| 41 | GLM-5API Zhipu AI | 72 | Apr 2026 | |
| 42 | DeepSeek-V3.2-SpecialeOpen Source DeepSeek | 71 | Apr 2026 | |
| 43 | Qwen3.5-397B-A17BOpen Source Alibaba | 70 | Apr 2026 | |
| 44 | GPT-5.4-mini-2026-03-17 (no reasoning)Open Source OpenAI | 70 | Apr 2026 | |
| 45 | GPT-5-mini-2025-08-07Open Source OpenAI | 70 | Apr 2026 | |
| 46 | Gemini-2.0-Flash-Thinking-Exp-01-21Open Source Google | 69 | Apr 2026 | |
| 47 | Grok-3-BetaOpen Source xAI | 69 | Apr 2026 | |
| 48 | DeepSeek-R1-0528Open Source DeepSeek | 68 | Apr 2026 | |
| 49 | PLLuM-8x7B-nc-chatOpen Source PLLuM | 68 | Apr 2026 | |
| 50 | Gemini-Pro-1.5Open Source Google | 68 | Apr 2026 | |
| 51 | Bielik-11B-v3.0-InstructOpen Source SpeakLeash | 67 | Apr 2026 | |
| 52 | PLLuM-12B-nc-chat-250715Open Source PLLuM | 67 | Apr 2026 | |
| 53 | Kimi-K2.5Open Source Moonshot.AI | 65 | Apr 2026 | |
| 54 | Grok-4.1-FastOpen Source xAI | 65 | Apr 2026 | |
| 55 | DeepSeek-V3.2Open Source DeepSeek | 65 | Apr 2026 | |
| 56 | O4-Mini-2025-04-16Open Source OpenAI | 65 | Apr 2026 | |
| 57 | DeepSeek-v3.2-ExpOpen Source DeepSeek | 64 | Apr 2026 | |
| 58 | Mistral-Large-2512Open Source Mistral | 64 | Apr 2026 | |
| 59 | DeepSeek-v3Open Source DeepSeek | 63 | Apr 2026 | |
| 60 | Bielik-2.6Open Source SpeakLeash | 62 | Apr 2026 | |
| 61 | Mistral-Medium-3Open Source Mistral | 62 | Apr 2026 | |
| 62 | Bielik-2.2Open Source SpeakLeash | 62 | Apr 2026 | |
| 63 | DeepSeek-v3-0324Open Source DeepSeek | 62 | Apr 2026 | |
| 64 | Claude-3-OpusAPI Anthropic | 62 | Apr 2026 | |
| 65 | DeepSeek-v3.1 (no thinking)Open Source DeepSeek | 62 | Apr 2026 | |
| 66 | Qwen3.5-122B-A10BOpen Source Alibaba | 61 | Apr 2026 | |
| 67 | Grok-3-Mini-BetaOpen Source xAI | 61 | Apr 2026 | |
| 68 | GPT-5.4-nano-2026-03-17 (high reasoning)Open Source OpenAI | 61 | Apr 2026 | |
| 69 | Claude-Sonnet-4.5Open Source Anthropic | 61 | Apr 2026 | |
| 70 | Bielik-2.3Open Source SpeakLeash | 61 | Apr 2026 | |
| 71 | Bielik-2.5Open Source SpeakLeash | 61 | Apr 2026 | |
| 72 | Claude-Sonnet-4API Anthropic | 61 | Apr 2026 | |
| 73 | MiniMax-M2.7Open Source MiniMaxAI | 60 | Apr 2026 | |
| 74 | GLM-4.5Open Source Zhipu AI | 60 | Apr 2026 | |
| 75 | Kimi-K2-ThinkingOpen Source Moonshot.AI | 59 | Apr 2026 | |
| 76 | GLM-4.7Open Source Zhipu AI | 59 | Apr 2026 | |
| 77 | Grok-4-FastOpen Source xAI | 59 | Apr 2026 | |
| 78 | Grok-4.20Open Source xAI | 59 | Apr 2026 | |
| 79 | Grok-2-1212Open Source xAI | 57 | Apr 2026 | |
| 80 | GLM-4.6Open Source Zhipu AI | 57 | Apr 2026 | |
| 81 | Bielik-2.1Open Source SpeakLeash | 56 | Apr 2026 | |
| 82 | GPT-4.1-mini-2025-04-14Open Source OpenAI | 56 | Apr 2026 | |
| 83 | GPT-4-turboAPI OpenAI | 56 | Apr 2026 | |
| 84 | Kimi-K2Open Source Moonshot.AI | 54 | Apr 2026 | |
| 85 | Qwen3-MaxOpen Source Alibaba | 54 | Apr 2026 | |
| 86 | Qwen3.5-27BOpen Source Alibaba | 54 | Apr 2026 | |
| 87 | Llama-3.1-Tulu-3-405BOpen Source Meta | 53 | Apr 2026 | |
| 88 | Kimi-K2-0905Open Source Moonshot.AI | 53 | Apr 2026 | |
| 89 | Claude-3.5-Haiku-20241022Open Source Anthropic | 52 | Apr 2026 | |
| 90 | Mistral-Small-4Open Source Mistral | 52 | Apr 2026 | |
| 91 | MiniMax-M2.5Open Source MiniMaxAI | 52 | Apr 2026 | |
| 92 | PLLuM-12B-nc-chatOpen Source PLLuM | 52 | Apr 2026 | |
| 93 | GPT-4o-mini-2024-07-18Open Source OpenAI | 51 | Apr 2026 | |
| 94 | Command-A-03-2025Open Source Cohere | 49 | Apr 2026 | |
| 95 | GPT-4 OpenAI | 48 | Apr 2026 | |
| 96 | GPT-5-nano-2025-08-07Open Source OpenAI | 47 | Apr 2026 | |
| 97 | Gemini-Flash-1.5Open Source Google | 47 | Apr 2026 | |
| 98 | GLM-4.5-AirOpen Source Zhipu AI | 47 | Apr 2026 | |
| 99 | O3-mini-2025-01-31Open Source OpenAI | 47 | Apr 2026 | |
| 100 | Command-R-Plus-04-2024Open Source Cohere | 46 | Apr 2026 | |
| 101 | Llama-PLLuM-70B-chatOpen Source PLLuM | 46 | Apr 2026 | |
| 102 | Bielik-Minitron-7B-v3.0-InstructOpen Source SpeakLeash | 46 | Apr 2026 | |
| 103 | Llama-PLLuM-70B-chat-250801Open Source PLLuM | 46 | Apr 2026 | |
| 104 | Claude-3.0-SonnetOpen Source Anthropic | 46 | Apr 2026 | |
| 105 | Claude-Haiku-4.5Open Source Anthropic | 45 | Apr 2026 | |
| 106 | Qwen3.5-35B-A3BOpen Source Alibaba | 45 | Apr 2026 | |
| 107 | Llama-4-MaverickOpen Source Meta | 45 | Apr 2026 | |
| 108 | Qwen-MaxOpen Source Alibaba | 45 | Apr 2026 | |
| 109 | PLLuM-8x7B-chatOpen Source PLLuM | 44 | Apr 2026 | |
| 110 | Llama-3.1-405bOpen Source Meta | 43 | Apr 2026 | |
| 111 | Qwen3-235B-A22B Alibaba | 43 | Apr 2026 | |
| 112 | Command-R-Plus-08-2024Open Source Cohere | 43 | Apr 2026 | |
| 113 | Mistral-Large-2411Open Source Mistral | 42 | Apr 2026 | |
| 114 | Llama-4-ScoutOpen Source Meta | 42 | Apr 2026 | |
| 115 | GPT-5.4-nano-2026-03-17 (no reasoning)Open Source OpenAI | 41 | Apr 2026 | |
| 116 | Mistral-Large-2407Open Source Mistral | 40 | Apr 2026 | |
| 117 | O1-mini-2024-09-12Open Source OpenAI | 40 | Apr 2026 | |
| 118 | Bielik-4.5B-v3.0-InstructOpen Source SpeakLeash | 39 | Apr 2026 | |
| 119 | Ministral-14b-2512Open Source Mistral | 39 | Apr 2026 | |
| 120 | GPT-4.1-nano-2025-04-14Open Source OpenAI | 38 | Apr 2026 | |
| 121 | Qwen3.5-9BOpen Source Alibaba | 38 | Apr 2026 | |
| 122 | WizardLM-2-8x22bOpen Source Microsoft | 38 | Apr 2026 | |
| 123 | GPT-OSS-120bOpen Source OpenAI | 38 | Apr 2026 | |
| 124 | Qwen-PlusOpen Source Alibaba | 38 | Apr 2026 | |
| 125 | Gemma-3-27b Google | 37 | Apr 2026 | |
| 126 | Mistral-Small-3.1-24B-2503Open Source Mistral | 37 | Apr 2026 | |
| 127 | Bielik-0.1Open Source SpeakLeash | 37 | Apr 2026 | |
| 128 | Qwen3-32BOpen Source Alibaba | 37 | Apr 2026 | |
| 129 | Llama-3.3-70BOpen Source Meta | 37 | Apr 2026 | |
| 130 | Qwen3-Next-80B-A3B-ThinkingOpen Source Alibaba | 37 | Apr 2026 | |
| 131 | Gemma-2-27bOpen Source Google | 37 | Apr 2026 | |
| 132 | Mistral-Small-24B-2501Open Source Mistral | 36 | Apr 2026 | |
| 133 | GPT-3.5-turboOpen Source OpenAI | 36 | Apr 2026 | |
| 134 | Qwen-2.5-72bOpen Source Alibaba | 36 | Apr 2026 | |
| 135 | Ministral-8b-2512Open Source Mistral | 35 | Apr 2026 | |
| 136 | Llama-PLLuM-8B-chatOpen Source PLLuM | 35 | Apr 2026 | |
| 137 | Mixtral-8x22bOpen Source Mistral | 35 | Apr 2026 | |
| 138 | Mistral-Small-3.2-24B-2506Open Source Mistral | 35 | Apr 2026 | |
| 139 | Llama-3.1-70BOpen Source Meta | 34 | Apr 2026 | |
| 140 | Qwen3-14BOpen Source Alibaba | 34 | Apr 2026 | |
| 141 | EuroLLM-9BOpen Source UTTER | 34 | Apr 2026 | |
| 142 | Qwen3.5-4BOpen Source Alibaba | 34 | Apr 2026 | |
| 143 | Qwen-2.5-32bOpen Source Alibaba | 33 | Apr 2026 | |
| 144 | PLLuM-12B-chatOpen Source PLLuM | 33 | Apr 2026 | |
| 145 | Qwen3-Next-80B-A3B-InstructOpen Source Alibaba | 32 | Apr 2026 | |
| 146 | Magistral-Small-2506Open Source Mistral | 31 | Apr 2026 | |
| 147 | Qwen-Turbo-2024-11-01Open Source Alibaba | 31 | Apr 2026 | |
| 148 | Gemma-2-9bOpen Source Google | 30 | Apr 2026 | |
| 149 | GLM-4.7-FlashOpen Source Zhipu AI | 30 | Apr 2026 | |
| 150 | Qwen-2.5-14bOpen Source Alibaba | 28 | Apr 2026 | |
| 151 | Qwen3-30B-A3BOpen Source Alibaba | 27 | Apr 2026 | |
| 152 | Phi-4 Microsoft | 26 | Apr 2026 | |
| 153 | Qwen3-8BOpen Source Alibaba | 25 | Apr 2026 | |
| 154 | Bielik-1.5B-v3.0-InstructOpen Source SpeakLeash | 23 | Apr 2026 | |
| 155 | GPT-OSS-20bOpen Source OpenAI | 23 | Apr 2026 | |
| 156 | Command-R7BOpen Source Cohere | 22 | Apr 2026 | |
| 157 | Ministral-3b-2512Open Source Mistral | 22 | Apr 2026 | |
| 158 | Ministral-8bOpen Source Mistral | 22 | Apr 2026 | |
| 159 | Llama-3.0-70BOpen Source Meta | 22 | Apr 2026 | |
| 160 | Qwen-2.5-7bOpen Source Alibaba | 21 | Apr 2026 | |
| 161 | Mistral-NemoOpen Source Mistral | 20 | Apr 2026 | |
| 162 | Qwen3.5-2BOpen Source Alibaba | 20 | Apr 2026 | |
| 163 | Mixtral-8x7bOpen Source Mistral | 20 | Apr 2026 | |
| 164 | Llama-3.1-8BOpen Source Meta | 19 | Apr 2026 | |
| 165 | Mistral-7b-v0.3Open Source Mistral | 16 | Apr 2026 |