Polish Cultural Competency2025en

Polish Linguistic and Cultural Competency Benchmark

Evaluates LLMs on Polish linguistic and cultural knowledge across 6 categories: art & entertainment, culture & tradition, geography, grammar, history, and vocabulary. Accuracy (0-100) per category. Created by Dadas et al. (2025).

Samples:165
Metrics:average, art-and-entertainment, culture-and-tradition, geography, grammar, history, vocabulary
Paper / WebsiteDownload
Current State of the Art

Gemini-3.1-Pro-Preview

Google

97

average

PLCC — average

165 results · 1 SOTA advances · higher is better

All results
SOTA frontier
2040608010020262027averageGemini-3.1-Pro-Preview

Model Size vs Score — Pareto Frontier

5 models · log scale · Pareto frontier shown

Global
Bielik
PLLuM
Pareto
29.029.530.030.531.031.532.032.533.033.534.034.535.035.536.036.537.037.538.038.539.039.540.040.541.041.542.042.543.043.544.044.545.045.546.046.547.047.548.048.549.049.550.050.551.051.552.052.553.053.554.054.555.055.556.056.557.057.558.011B14B24B32B70B120B235BParameters (log scale)average

Top Models Performance Comparison

Top 10 models ranked by average

average1Gemini-3.1-Pro-Preview97.0100.0%2Gemini-3.0-Pro-Preview95.898.8%3GPT-5.4-2026-03-05 (high ...92.295.0%4Gemini-2.5-Pro-Preview-06-0592.295.0%5Gemini-3-Flash-Preview91.794.5%6GPT-5-Pro-2025-10-06 (hig...91.093.8%7GPT-5.4-2026-03-05 (low r...90.593.3%8Grok-490.593.3%9GPT-5-2025-08-0789.592.3%10Gemini-2.5-Pro-Exp-03-2589.592.3%0%25%50%75%100%% of best
Best Score
97.0
Top Model
Gemini-3.1-Pro-Pr...
Models Compared
10
Score Range
7.5

art-and-entertainment

#ModelScorePaper / CodeDate
1
Gemini-3.0-Pro-PreviewOpen Source
Google
95Apr 2026
2
Gemini-3.1-Pro-PreviewOpen Source
Google
95Apr 2026
3
Gemini-3-Flash-PreviewOpen Source
Google
91Apr 2026
4
GPT-5.4-2026-03-05 (high reasoning)Open Source
OpenAI
91Apr 2026
5
Gemini-2.5-Pro-Preview-06-05Open Source
Google
91Apr 2026
6
GPT-4.5-preview-2025-02-27Open Source
OpenAI
90Apr 2026
7
GPT-5-Pro-2025-10-06 (high reasoning)Open Source
OpenAI
88Apr 2026
8
Gemini-2.5-Pro-Exp-03-25Open Source
Google
88Apr 2026
9
GPT-5.4-2026-03-05 (low reasoning)Open Source
OpenAI
87Apr 2026
10
Grok-4API
xAI
86Apr 2026
11
O1-2024-12-17Open Source
OpenAI
86Apr 2026
12
GPT-5-2025-08-07Open Source
OpenAI
85Apr 2026
13
GPT-5.1-2025-11-13 (high reasoning)Open Source
OpenAI
85Apr 2026
14
GPT-4o-2024-05-13Open Source
OpenAI
83Apr 2026
15
Gemini-Exp-1206Open Source
Google
83Apr 2026
16
O3-2025-04-16Open Source
OpenAI
83Apr 2026
17
GPT-4o-2024-11-20Open Source
OpenAI
82Apr 2026
18
GPT-4o-2024-08-06Open Source
OpenAI
82Apr 2026
19
Claude-3.7-SonnetOpen Source
Anthropic
80Apr 2026
20
GPT-5.2-2025-12-11 (xhigh reasoning)Open Source
OpenAI
79Apr 2026
21
GPT-5.4-2026-03-05 (no reasoning)Open Source
OpenAI
79Apr 2026
22
GPT-5.2-2025-12-11 (high reasoning)Open Source
OpenAI
78Apr 2026
23
Gemini-2.5-Flash-Preview-04-17Open Source
Google
78Apr 2026
24
GPT-4.1-2025-04-14Open Source
OpenAI
77Apr 2026
25
Claude-3.7-Sonnet-ThinkingOpen Source
Anthropic
77Apr 2026
26
Claude-3.5-Sonnet-20241022Open Source
Anthropic
77Apr 2026
27
GPT-5.4-mini-2026-03-17 (high reasoning)Open Source
OpenAI
76Apr 2026
28
Claude-Opus-4.6Open Source
Anthropic
75Apr 2026
29
GPT-5.2-2025-12-11 (medium reasoning)Open Source
OpenAI
74Apr 2026
30
Claude-Opus-4.5Open Source
Anthropic
74Apr 2026
31
Claude-3.5-Sonnet-20240620Open Source
Anthropic
73Apr 2026
32
Claude-3-OpusAPI
Anthropic
73Apr 2026
33
Claude-Opus-4API
Anthropic
72Apr 2026
34
PLLuM-8x7B-nc-chatOpen Source
PLLuM
72Apr 2026
35
GPT-5.1-2025-11-13 (default reasoning)Open Source
OpenAI
72Apr 2026
36
Gemini-2.0-Flash-Thinking-Exp-01-21Open Source
Google
72Apr 2026
37
PLLuM-12B-nc-chat-250715Open Source
PLLuM
72Apr 2026
38
DeepSeek-V3.2-SpecialeOpen Source
DeepSeek
71Apr 2026
39
Grok-3-BetaOpen Source
xAI
71Apr 2026
40
GPT-5.2-2025-12-11 (no reasoning)Open Source
OpenAI
70Apr 2026
41
Kimi-K2.5Open Source
Moonshot.AI
69Apr 2026
42
Bielik-11B-v3.0-InstructOpen Source
SpeakLeash
69Apr 2026
43
DeepSeek-v3.1 (thinking)Open Source
DeepSeek
69Apr 2026
44
Gemini-2.0-Flash-ExperimentalOpen Source
Google
68Apr 2026
45
Claude-Sonnet-4.6Open Source
Anthropic
67Apr 2026
46
Claude-Opus-4.1Open Source
Anthropic
67Apr 2026
47
DeepSeek-R1Open Source
DeepSeek
66Apr 2026
48
GLM-5API
Zhipu AI
66Apr 2026
49
DeepSeek-R1-0528Open Source
DeepSeek
65Apr 2026
50
Llama-3.1-Tulu-3-405BOpen Source
Meta
64Apr 2026
51
DeepSeek-v3-0324Open Source
DeepSeek
64Apr 2026
52
MiMo-V2-ProOpen Source
Xiaomi
64Apr 2026
53
GLM-4.7Open Source
Zhipu AI
64Apr 2026
54
DeepSeek-v3.1 (no thinking)Open Source
DeepSeek
63Apr 2026
55
Mistral-Large-2512Open Source
Mistral
63Apr 2026
56
Kimi-K2-ThinkingOpen Source
Moonshot.AI
63Apr 2026
57
Qwen3.5-397B-A17BOpen Source
Alibaba
63Apr 2026
58
O4-Mini-2025-04-16Open Source
OpenAI
62Apr 2026
59
Gemini-Pro-1.5Open Source
Google
62Apr 2026
60
GPT-5-mini-2025-08-07Open Source
OpenAI
62Apr 2026
61
Claude-Sonnet-4.5Open Source
Anthropic
61Apr 2026
62
DeepSeek-V3.2Open Source
DeepSeek
61Apr 2026
63
GPT-5.4-mini-2026-03-17 (no reasoning)Open Source
OpenAI
61Apr 2026
64
Grok-3-Mini-BetaOpen Source
xAI
61Apr 2026
65
DeepSeek-v3Open Source
DeepSeek
61Apr 2026
66
Bielik-2.6Open Source
SpeakLeash
61Apr 2026
67
GPT-4-turboAPI
OpenAI
61Apr 2026
68
PLLuM-12B-nc-chatOpen Source
PLLuM
59Apr 2026
69
DeepSeek-v3.2-ExpOpen Source
DeepSeek
59Apr 2026
70
Grok-4-FastOpen Source
xAI
59Apr 2026
71
GLM-4.6Open Source
Zhipu AI
59Apr 2026
72
Bielik-2.3Open Source
SpeakLeash
58Apr 2026
73
Grok-2-1212Open Source
xAI
57Apr 2026
74
Mistral-Medium-3Open Source
Mistral
56Apr 2026
75
Llama-3.1-405bOpen Source
Meta
56Apr 2026
76
GLM-4.5Open Source
Zhipu AI
56Apr 2026
77
Bielik-2.1Open Source
SpeakLeash
55Apr 2026
78
Claude-Sonnet-4API
Anthropic
55Apr 2026
79
Grok-4.20Open Source
xAI
55Apr 2026
80
Llama-PLLuM-70B-chat-250801Open Source
PLLuM
54Apr 2026
81
Grok-4.1-FastOpen Source
xAI
54Apr 2026
82
Bielik-2.2Open Source
SpeakLeash
54Apr 2026
83
Kimi-K2-0905Open Source
Moonshot.AI
54Apr 2026
84
Qwen3.5-122B-A10BOpen Source
Alibaba
53Apr 2026
85
Mistral-Small-4Open Source
Mistral
53Apr 2026
86
Bielik-2.5Open Source
SpeakLeash
52Apr 2026
87
GPT-4.1-mini-2025-04-14Open Source
OpenAI
51Apr 2026
88
Qwen3-MaxOpen Source
Alibaba
50Apr 2026
89
Kimi-K2Open Source
Moonshot.AI
50Apr 2026
90
GPT-5.4-nano-2026-03-17 (high reasoning)Open Source
OpenAI
50Apr 2026
91
GPT-4
OpenAI
49Apr 2026
92
Llama-PLLuM-70B-chatOpen Source
PLLuM
49Apr 2026
93
Mistral-Large-2407Open Source
Mistral
48Apr 2026
94
PLLuM-12B-chatOpen Source
PLLuM
48Apr 2026
95
GLM-4.5-AirOpen Source
Zhipu AI
48Apr 2026
96
GPT-5-nano-2025-08-07Open Source
OpenAI
47Apr 2026
97
Llama-4-MaverickOpen Source
Meta
46Apr 2026
98
O3-mini-2025-01-31Open Source
OpenAI
46Apr 2026
99
Claude-3.0-SonnetOpen Source
Anthropic
46Apr 2026
100
WizardLM-2-8x22bOpen Source
Microsoft
45Apr 2026
101
PLLuM-8x7B-chatOpen Source
PLLuM
45Apr 2026
102
Mixtral-8x22bOpen Source
Mistral
45Apr 2026
103
Command-A-03-2025Open Source
Cohere
44Apr 2026
104
Qwen3.5-35B-A3BOpen Source
Alibaba
44Apr 2026
105
Command-R-Plus-08-2024Open Source
Cohere
44Apr 2026
106
Gemma-3-27b
Google
43Apr 2026
107
Qwen3-Next-80B-A3B-ThinkingOpen Source
Alibaba
43Apr 2026
108
Llama-3.3-70BOpen Source
Meta
43Apr 2026
109
Bielik-0.1Open Source
SpeakLeash
43Apr 2026
110
MiniMax-M2.7Open Source
MiniMaxAI
43Apr 2026
111
Qwen-MaxOpen Source
Alibaba
43Apr 2026
112
Claude-3.5-Haiku-20241022Open Source
Anthropic
43Apr 2026
113
GPT-OSS-120bOpen Source
OpenAI
42Apr 2026
114
Llama-3.1-70BOpen Source
Meta
42Apr 2026
115
GPT-4o-mini-2024-07-18Open Source
OpenAI
42Apr 2026
116
Llama-3.0-70BOpen Source
Meta
40Apr 2026
117
Command-R-Plus-04-2024Open Source
Cohere
39Apr 2026
118
GPT-3.5-turboOpen Source
OpenAI
39Apr 2026
119
Bielik-Minitron-7B-v3.0-InstructOpen Source
SpeakLeash
39Apr 2026
120
Mistral-Large-2411Open Source
Mistral
39Apr 2026
121
MiniMax-M2.5Open Source
MiniMaxAI
39Apr 2026
122
Mistral-Small-3.2-24B-2506Open Source
Mistral
38Apr 2026
123
O1-mini-2024-09-12Open Source
OpenAI
38Apr 2026
124
Qwen3.5-27BOpen Source
Alibaba
37Apr 2026
125
Qwen3-235B-A22B
Alibaba
37Apr 2026
126
Claude-Haiku-4.5Open Source
Anthropic
36Apr 2026
127
Mistral-Small-3.1-24B-2503Open Source
Mistral
35Apr 2026
128
Qwen3-Next-80B-A3B-InstructOpen Source
Alibaba
34Apr 2026
129
Mistral-Small-24B-2501Open Source
Mistral
33Apr 2026
130
Llama-PLLuM-8B-chatOpen Source
PLLuM
33Apr 2026
131
Gemini-Flash-1.5Open Source
Google
33Apr 2026
132
Gemma-2-27bOpen Source
Google
32Apr 2026
133
Mixtral-8x7bOpen Source
Mistral
31Apr 2026
134
GLM-4.7-FlashOpen Source
Zhipu AI
31Apr 2026
135
GPT-4.1-nano-2025-04-14Open Source
OpenAI
30Apr 2026
136
Magistral-Small-2506Open Source
Mistral
30Apr 2026
137
EuroLLM-9BOpen Source
UTTER
30Apr 2026
138
Bielik-4.5B-v3.0-InstructOpen Source
SpeakLeash
28Apr 2026
139
Bielik-1.5B-v3.0-InstructOpen Source
SpeakLeash
27Apr 2026
140
Qwen-PlusOpen Source
Alibaba
26Apr 2026
141
GPT-5.4-nano-2026-03-17 (no reasoning)Open Source
OpenAI
26Apr 2026
142
Qwen-2.5-72bOpen Source
Alibaba
25Apr 2026
143
Ministral-14b-2512Open Source
Mistral
25Apr 2026
144
Llama-4-ScoutOpen Source
Meta
23Apr 2026
145
Phi-4
Microsoft
23Apr 2026
146
Qwen3.5-9BOpen Source
Alibaba
22Apr 2026
147
Mistral-7b-v0.3Open Source
Mistral
22Apr 2026
148
Qwen-2.5-14bOpen Source
Alibaba
21Apr 2026
149
Qwen3-32BOpen Source
Alibaba
21Apr 2026
150
Mistral-NemoOpen Source
Mistral
20Apr 2026
151
Ministral-8b-2512Open Source
Mistral
20Apr 2026
152
Qwen3-30B-A3BOpen Source
Alibaba
19Apr 2026
153
Llama-3.1-8BOpen Source
Meta
19Apr 2026
154
Gemma-2-9bOpen Source
Google
19Apr 2026
155
GPT-OSS-20bOpen Source
OpenAI
19Apr 2026
156
Qwen-2.5-32bOpen Source
Alibaba
17Apr 2026
157
Qwen-Turbo-2024-11-01Open Source
Alibaba
15Apr 2026
158
Command-R7BOpen Source
Cohere
14Apr 2026
159
Qwen3-14BOpen Source
Alibaba
14Apr 2026
160
Ministral-8bOpen Source
Mistral
14Apr 2026
161
Qwen3-8BOpen Source
Alibaba
12Apr 2026
162
Qwen3.5-4BOpen Source
Alibaba
12Apr 2026
163
Ministral-3b-2512Open Source
Mistral
11Apr 2026
164
Qwen3.5-2BOpen Source
Alibaba
5Apr 2026
165
Qwen-2.5-7bOpen Source
Alibaba
5Apr 2026

averagePrimary

#ModelScorePaper / CodeDate
1
Gemini-3.1-Pro-PreviewOpen Source
Google
97Apr 2026
2
Gemini-3.0-Pro-PreviewOpen Source
Google
95.833333Apr 2026
3
GPT-5.4-2026-03-05 (high reasoning)Open Source
OpenAI
92.166667Apr 2026
4
Gemini-2.5-Pro-Preview-06-05Open Source
Google
92.166667Apr 2026
5
Gemini-3-Flash-PreviewOpen Source
Google
91.666667Apr 2026
6
GPT-5-Pro-2025-10-06 (high reasoning)Open Source
OpenAI
91Apr 2026
7
GPT-5.4-2026-03-05 (low reasoning)Open Source
OpenAI
90.5Apr 2026
8
Grok-4API
xAI
90.5Apr 2026
9
GPT-5-2025-08-07Open Source
OpenAI
89.5Apr 2026
10
Gemini-2.5-Pro-Exp-03-25Open Source
Google
89.5Apr 2026
11
GPT-5.2-2025-12-11 (xhigh reasoning)Open Source
OpenAI
89.333333Apr 2026
12
O3-2025-04-16Open Source
OpenAI
89.166667Apr 2026
13
O1-2024-12-17Open Source
OpenAI
89.166667Apr 2026
14
GPT-5.1-2025-11-13 (high reasoning)Open Source
OpenAI
88.833333Apr 2026
15
GPT-5.2-2025-12-11 (high reasoning)Open Source
OpenAI
87.166667Apr 2026
16
GPT-4.5-preview-2025-02-27Open Source
OpenAI
86.5Apr 2026
17
GPT-5.4-mini-2026-03-17 (high reasoning)Open Source
OpenAI
85.166667Apr 2026
18
GPT-5.2-2025-12-11 (medium reasoning)Open Source
OpenAI
85Apr 2026
19
GPT-5.4-2026-03-05 (no reasoning)Open Source
OpenAI
84.333333Apr 2026
20
Gemini-2.5-Flash-Preview-04-17Open Source
Google
83.5Apr 2026
21
Gemini-Exp-1206Open Source
Google
83Apr 2026
22
Claude-3.5-Sonnet-20241022Open Source
Anthropic
82.666667Apr 2026
23
GPT-4o-2024-05-13Open Source
OpenAI
82.333333Apr 2026
24
Claude-3.7-Sonnet-ThinkingOpen Source
Anthropic
82.166667Apr 2026
25
Claude-Opus-4.6Open Source
Anthropic
81.833333Apr 2026
26
Claude-3.7-SonnetOpen Source
Anthropic
81.5Apr 2026
27
GPT-4o-2024-08-06Open Source
OpenAI
81.333333Apr 2026
28
GPT-4o-2024-11-20Open Source
OpenAI
81.333333Apr 2026
29
DeepSeek-V3.2-SpecialeOpen Source
DeepSeek
81Apr 2026
30
Claude-3.5-Sonnet-20240620Open Source
Anthropic
80.666667Apr 2026
31
GPT-4.1-2025-04-14Open Source
OpenAI
80.333333Apr 2026
32
Claude-Opus-4.5Open Source
Anthropic
80.333333Apr 2026
33
GLM-5API
Zhipu AI
80Apr 2026
34
Claude-Opus-4.1Open Source
Anthropic
79Apr 2026
35
GPT-5.2-2025-12-11 (no reasoning)Open Source
OpenAI
78.833333Apr 2026
36
DeepSeek-v3.1 (thinking)Open Source
DeepSeek
78.666667Apr 2026
37
Claude-Opus-4API
Anthropic
78.666667Apr 2026
38
MiMo-V2-ProOpen Source
Xiaomi
78.5Apr 2026
39
Kimi-K2.5Open Source
Moonshot.AI
77.833333Apr 2026
40
GPT-5.1-2025-11-13 (default reasoning)Open Source
OpenAI
77.833333Apr 2026
41
Claude-Sonnet-4.6Open Source
Anthropic
77.666667Apr 2026
42
GPT-5-mini-2025-08-07Open Source
OpenAI
77.5Apr 2026
43
Grok-3-BetaOpen Source
xAI
77.166667Apr 2026
44
DeepSeek-R1-0528Open Source
DeepSeek
76.166667Apr 2026
45
DeepSeek-R1Open Source
DeepSeek
76Apr 2026
46
Qwen3.5-397B-A17BOpen Source
Alibaba
75Apr 2026
47
Gemini-2.0-Flash-Thinking-Exp-01-21Open Source
Google
74.833333Apr 2026
48
Gemini-2.0-Flash-ExperimentalOpen Source
Google
74.166667Apr 2026
49
Claude-3-OpusAPI
Anthropic
73.833333Apr 2026
50
GLM-4.7Open Source
Zhipu AI
73.5Apr 2026
51
GPT-5.4-mini-2026-03-17 (no reasoning)Open Source
OpenAI
73Apr 2026
52
O4-Mini-2025-04-16Open Source
OpenAI
72.833333Apr 2026
53
Grok-4.1-FastOpen Source
xAI
72.333333Apr 2026
54
DeepSeek-V3.2Open Source
DeepSeek
71.666667Apr 2026
55
Kimi-K2-ThinkingOpen Source
Moonshot.AI
71.666667Apr 2026
56
Grok-3-Mini-BetaOpen Source
xAI
71.333333Apr 2026
57
DeepSeek-v3-0324Open Source
DeepSeek
71Apr 2026
58
Claude-Sonnet-4.5Open Source
Anthropic
71Apr 2026
59
DeepSeek-v3.1 (no thinking)Open Source
DeepSeek
71Apr 2026
60
Bielik-11B-v3.0-InstructOpen Source
SpeakLeash
70.666667Apr 2026
61
Mistral-Large-2512Open Source
Mistral
70.666667Apr 2026
62
GLM-4.6Open Source
Zhipu AI
70.666667Apr 2026
63
Grok-4-FastOpen Source
xAI
70.166667Apr 2026
64
DeepSeek-v3.2-ExpOpen Source
DeepSeek
70Apr 2026
65
PLLuM-12B-nc-chat-250715Open Source
PLLuM
69.666667Apr 2026
66
Gemini-Pro-1.5Open Source
Google
69.666667Apr 2026
67
DeepSeek-v3Open Source
DeepSeek
69.166667Apr 2026
68
Qwen3.5-122B-A10BOpen Source
Alibaba
68.333333Apr 2026
69
Claude-Sonnet-4API
Anthropic
68.166667Apr 2026
70
PLLuM-8x7B-nc-chatOpen Source
PLLuM
68.166667Apr 2026
71
Grok-4.20Open Source
xAI
67.833333Apr 2026
72
GPT-4-turboAPI
OpenAI
67Apr 2026
73
Mistral-Medium-3Open Source
Mistral
66.833333Apr 2026
74
GLM-4.5Open Source
Zhipu AI
66.5Apr 2026
75
Grok-2-1212Open Source
xAI
66Apr 2026
76
GPT-5.4-nano-2026-03-17 (high reasoning)Open Source
OpenAI
65.833333Apr 2026
77
Bielik-2.6Open Source
SpeakLeash
65.5Apr 2026
78
Llama-3.1-Tulu-3-405BOpen Source
Meta
63.833333Apr 2026
79
MiniMax-M2.7Open Source
MiniMaxAI
63.333333Apr 2026
80
Bielik-2.2Open Source
SpeakLeash
63Apr 2026
81
GPT-5-nano-2025-08-07Open Source
OpenAI
62.5Apr 2026
82
GPT-4.1-mini-2025-04-14Open Source
OpenAI
62.166667Apr 2026
83
Bielik-2.3Open Source
SpeakLeash
62.166667Apr 2026
84
Kimi-K2Open Source
Moonshot.AI
62Apr 2026
85
Bielik-2.5Open Source
SpeakLeash
62Apr 2026
86
Qwen3-MaxOpen Source
Alibaba
61.333333Apr 2026
87
Kimi-K2-0905Open Source
Moonshot.AI
61Apr 2026
88
Bielik-2.1Open Source
SpeakLeash
61Apr 2026
89
Llama-3.1-405bOpen Source
Meta
60Apr 2026
90
MiniMax-M2.5Open Source
MiniMaxAI
59.666667Apr 2026
91
GPT-4
OpenAI
59.5Apr 2026
92
PLLuM-12B-nc-chatOpen Source
PLLuM
59.5Apr 2026
93
O3-mini-2025-01-31Open Source
OpenAI
59.333333Apr 2026
94
Llama-PLLuM-70B-chatOpen Source
PLLuM
58.5Apr 2026
95
Llama-4-MaverickOpen Source
Meta
58.166667Apr 2026
96
Llama-PLLuM-70B-chat-250801Open Source
PLLuM
58Apr 2026
97
Claude-3.5-Haiku-20241022Open Source
Anthropic
57.833333Apr 2026
98
Qwen3.5-35B-A3BOpen Source
Alibaba
57Apr 2026
99
GPT-4o-mini-2024-07-18Open Source
OpenAI
56.833333Apr 2026
100
Claude-3.0-SonnetOpen Source
Anthropic
56.5Apr 2026
101
Mistral-Small-4Open Source
Mistral
56.333333Apr 2026
102
Command-A-03-2025Open Source
Cohere
56.166667Apr 2026
103
Qwen3-235B-A22B
Alibaba
55Apr 2026
104
GLM-4.5-AirOpen Source
Zhipu AI
54.666667Apr 2026
105
Qwen3-Next-80B-A3B-ThinkingOpen Source
Alibaba
54.333333Apr 2026
106
GPT-OSS-120bOpen Source
OpenAI
54.333333Apr 2026
107
Qwen3.5-27BOpen Source
Alibaba
54.333333Apr 2026
108
Mistral-Large-2407Open Source
Mistral
54.166667Apr 2026
109
PLLuM-8x7B-chatOpen Source
PLLuM
54.166667Apr 2026
110
Bielik-Minitron-7B-v3.0-InstructOpen Source
SpeakLeash
53Apr 2026
111
Mistral-Large-2411Open Source
Mistral
52Apr 2026
112
O1-mini-2024-09-12Open Source
OpenAI
51.666667Apr 2026
113
WizardLM-2-8x22bOpen Source
Microsoft
51.5Apr 2026
114
Qwen-MaxOpen Source
Alibaba
50.833333Apr 2026
115
Claude-Haiku-4.5Open Source
Anthropic
50.666667Apr 2026
116
Command-R-Plus-08-2024Open Source
Cohere
50.166667Apr 2026
117
Mixtral-8x22bOpen Source
Mistral
49.833333Apr 2026
118
Command-R-Plus-04-2024Open Source
Cohere
49.333333Apr 2026
119
Llama-3.3-70BOpen Source
Meta
48.833333Apr 2026
120
Llama-3.1-70BOpen Source
Meta
47.833333Apr 2026
121
Gemma-3-27b
Google
47.333333Apr 2026
122
PLLuM-12B-chatOpen Source
PLLuM
47Apr 2026
123
Bielik-0.1Open Source
SpeakLeash
46.666667Apr 2026
124
Gemini-Flash-1.5Open Source
Google
46.5Apr 2026
125
Mistral-Small-3.2-24B-2506Open Source
Mistral
46.166667Apr 2026
126
GPT-5.4-nano-2026-03-17 (no reasoning)Open Source
OpenAI
44.166667Apr 2026
127
GPT-4.1-nano-2025-04-14Open Source
OpenAI
43.666667Apr 2026
128
GPT-3.5-turboOpen Source
OpenAI
43.333333Apr 2026
129
Mistral-Small-3.1-24B-2503Open Source
Mistral
43.333333Apr 2026
130
Qwen3-Next-80B-A3B-InstructOpen Source
Alibaba
43Apr 2026
131
Llama-3.0-70BOpen Source
Meta
43Apr 2026
132
Gemma-2-27bOpen Source
Google
42.666667Apr 2026
133
GLM-4.7-FlashOpen Source
Zhipu AI
42.333333Apr 2026
134
Bielik-4.5B-v3.0-InstructOpen Source
SpeakLeash
42.333333Apr 2026
135
Llama-4-ScoutOpen Source
Meta
41.5Apr 2026
136
EuroLLM-9BOpen Source
UTTER
41Apr 2026
137
Qwen3.5-9BOpen Source
Alibaba
40.333333Apr 2026
138
Magistral-Small-2506Open Source
Mistral
39.333333Apr 2026
139
Qwen-2.5-72bOpen Source
Alibaba
39.166667Apr 2026
140
Ministral-14b-2512Open Source
Mistral
39Apr 2026
141
Mistral-Small-24B-2501Open Source
Mistral
39Apr 2026
142
Llama-PLLuM-8B-chatOpen Source
PLLuM
38.5Apr 2026
143
Qwen-PlusOpen Source
Alibaba
38.5Apr 2026
144
Qwen3-32BOpen Source
Alibaba
37.666667Apr 2026
145
Mixtral-8x7bOpen Source
Mistral
35.333333Apr 2026
146
Ministral-8b-2512Open Source
Mistral
35.166667Apr 2026
147
Qwen3-30B-A3BOpen Source
Alibaba
33Apr 2026
148
GPT-OSS-20bOpen Source
OpenAI
32.333333Apr 2026
149
Qwen-2.5-32bOpen Source
Alibaba
30.5Apr 2026
150
Qwen3-14BOpen Source
Alibaba
30.333333Apr 2026
151
Qwen3.5-4BOpen Source
Alibaba
29.666667Apr 2026
152
Phi-4
Microsoft
29.166667Apr 2026
153
Gemma-2-9bOpen Source
Google
29.166667Apr 2026
154
Qwen-Turbo-2024-11-01Open Source
Alibaba
28.5Apr 2026
155
Bielik-1.5B-v3.0-InstructOpen Source
SpeakLeash
27.5Apr 2026
156
Qwen-2.5-14bOpen Source
Alibaba
26.666667Apr 2026
157
Qwen3-8BOpen Source
Alibaba
26Apr 2026
158
Mistral-NemoOpen Source
Mistral
23Apr 2026
159
Command-R7BOpen Source
Cohere
22.833333Apr 2026
160
Llama-3.1-8BOpen Source
Meta
22.666667Apr 2026
161
Ministral-3b-2512Open Source
Mistral
22.333333Apr 2026
162
Mistral-7b-v0.3Open Source
Mistral
21.833333Apr 2026
163
Ministral-8bOpen Source
Mistral
20.666667Apr 2026
164
Qwen-2.5-7bOpen Source
Alibaba
17.666667Apr 2026
165
Qwen3.5-2BOpen Source
Alibaba
13.833333Apr 2026

culture-and-tradition

#ModelScorePaper / CodeDate
1
Gemini-3.1-Pro-PreviewOpen Source
Google
100Apr 2026
2
Gemini-3.0-Pro-PreviewOpen Source
Google
99Apr 2026
3
Gemini-3-Flash-PreviewOpen Source
Google
98Apr 2026
4
Gemini-2.5-Pro-Preview-06-05Open Source
Google
96Apr 2026
5
Grok-4API
xAI
95Apr 2026
6
GPT-5-Pro-2025-10-06 (high reasoning)Open Source
OpenAI
94Apr 2026
7
GPT-5.4-2026-03-05 (high reasoning)Open Source
OpenAI
93Apr 2026
8
GPT-5.2-2025-12-11 (xhigh reasoning)Open Source
OpenAI
93Apr 2026
9
GPT-5.4-2026-03-05 (low reasoning)Open Source
OpenAI
93Apr 2026
10
O1-2024-12-17Open Source
OpenAI
92Apr 2026
11
GPT-4o-2024-05-13Open Source
OpenAI
92Apr 2026
12
GPT-4.5-preview-2025-02-27Open Source
OpenAI
92Apr 2026
13
O3-2025-04-16Open Source
OpenAI
91Apr 2026
14
Gemini-2.5-Pro-Exp-03-25Open Source
Google
91Apr 2026
15
GPT-5.1-2025-11-13 (high reasoning)Open Source
OpenAI
90Apr 2026
16
Grok-3-BetaOpen Source
xAI
90Apr 2026
17
Gemini-Exp-1206Open Source
Google
90Apr 2026
18
GPT-4o-2024-08-06Open Source
OpenAI
89Apr 2026
19
GPT-5-2025-08-07Open Source
OpenAI
89Apr 2026
20
GPT-4o-2024-11-20Open Source
OpenAI
89Apr 2026
21
GPT-5.4-2026-03-05 (no reasoning)Open Source
OpenAI
88Apr 2026
22
Claude-3.5-Sonnet-20241022Open Source
Anthropic
87Apr 2026
23
GPT-5.2-2025-12-11 (high reasoning)Open Source
OpenAI
87Apr 2026
24
Claude-Opus-4.6Open Source
Anthropic
86Apr 2026
25
GPT-5.2-2025-12-11 (no reasoning)Open Source
OpenAI
86Apr 2026
26
Gemini-2.5-Flash-Preview-04-17Open Source
Google
85Apr 2026
27
Claude-3.5-Sonnet-20240620Open Source
Anthropic
85Apr 2026
28
GPT-4.1-2025-04-14Open Source
OpenAI
84Apr 2026
29
GPT-5.2-2025-12-11 (medium reasoning)Open Source
OpenAI
84Apr 2026
30
Claude-3.7-SonnetOpen Source
Anthropic
83Apr 2026
31
GPT-5.4-mini-2026-03-17 (high reasoning)Open Source
OpenAI
83Apr 2026
32
Claude-Opus-4.1Open Source
Anthropic
83Apr 2026
33
GPT-5.1-2025-11-13 (default reasoning)Open Source
OpenAI
82Apr 2026
34
Claude-Opus-4.5Open Source
Anthropic
82Apr 2026
35
Claude-Sonnet-4.6Open Source
Anthropic
82Apr 2026
36
Claude-3.7-Sonnet-ThinkingOpen Source
Anthropic
82Apr 2026
37
Claude-Opus-4API
Anthropic
81Apr 2026
38
GLM-5API
Zhipu AI
81Apr 2026
39
MiMo-V2-ProOpen Source
Xiaomi
79Apr 2026
40
GLM-4.7Open Source
Zhipu AI
79Apr 2026
41
Kimi-K2.5Open Source
Moonshot.AI
78Apr 2026
42
DeepSeek-V3.2Open Source
DeepSeek
78Apr 2026
43
Bielik-11B-v3.0-InstructOpen Source
SpeakLeash
78Apr 2026
44
Gemini-2.0-Flash-ExperimentalOpen Source
Google
78Apr 2026
45
Gemini-Pro-1.5Open Source
Google
77Apr 2026
46
DeepSeek-v3.1 (thinking)Open Source
DeepSeek
76Apr 2026
47
PLLuM-8x7B-nc-chatOpen Source
PLLuM
76Apr 2026
48
Gemini-2.0-Flash-Thinking-Exp-01-21Open Source
Google
76Apr 2026
49
GLM-4.6Open Source
Zhipu AI
76Apr 2026
50
DeepSeek-V3.2-SpecialeOpen Source
DeepSeek
76Apr 2026
51
Claude-3-OpusAPI
Anthropic
76Apr 2026
52
DeepSeek-v3-0324Open Source
DeepSeek
76Apr 2026
53
DeepSeek-R1Open Source
DeepSeek
75Apr 2026
54
PLLuM-12B-nc-chat-250715Open Source
PLLuM
75Apr 2026
55
DeepSeek-R1-0528Open Source
DeepSeek
75Apr 2026
56
Mistral-Large-2512Open Source
Mistral
75Apr 2026
57
Grok-4.1-FastOpen Source
xAI
74Apr 2026
58
GPT-5-mini-2025-08-07Open Source
OpenAI
74Apr 2026
59
GPT-4-turboAPI
OpenAI
74Apr 2026
60
DeepSeek-v3Open Source
DeepSeek
73Apr 2026
61
O4-Mini-2025-04-16Open Source
OpenAI
73Apr 2026
62
Qwen3.5-397B-A17BOpen Source
Alibaba
73Apr 2026
63
GPT-5.4-mini-2026-03-17 (no reasoning)Open Source
OpenAI
73Apr 2026
64
Claude-Sonnet-4.5Open Source
Anthropic
72Apr 2026
65
Claude-Sonnet-4API
Anthropic
72Apr 2026
66
Kimi-K2-ThinkingOpen Source
Moonshot.AI
71Apr 2026
67
Grok-4-FastOpen Source
xAI
71Apr 2026
68
DeepSeek-v3.2-ExpOpen Source
DeepSeek
71Apr 2026
69
DeepSeek-v3.1 (no thinking)Open Source
DeepSeek
69Apr 2026
70
Bielik-2.6Open Source
SpeakLeash
68Apr 2026
71
GLM-4.5Open Source
Zhipu AI
68Apr 2026
72
Mistral-Medium-3Open Source
Mistral
67Apr 2026
73
Grok-2-1212Open Source
xAI
67Apr 2026
74
Grok-3-Mini-BetaOpen Source
xAI
67Apr 2026
75
Kimi-K2Open Source
Moonshot.AI
67Apr 2026
76
Grok-4.20Open Source
xAI
65Apr 2026
77
PLLuM-12B-nc-chatOpen Source
PLLuM
65Apr 2026
78
Bielik-2.1Open Source
SpeakLeash
64Apr 2026
79
Llama-PLLuM-70B-chatOpen Source
PLLuM
64Apr 2026
80
Llama-3.1-Tulu-3-405BOpen Source
Meta
64Apr 2026
81
Kimi-K2-0905Open Source
Moonshot.AI
63Apr 2026
82
GPT-4
OpenAI
63Apr 2026
83
GPT-4.1-mini-2025-04-14Open Source
OpenAI
62Apr 2026
84
Qwen3.5-122B-A10BOpen Source
Alibaba
62Apr 2026
85
Llama-PLLuM-70B-chat-250801Open Source
PLLuM
62Apr 2026
86
Claude-3.5-Haiku-20241022Open Source
Anthropic
62Apr 2026
87
Bielik-2.5Open Source
SpeakLeash
61Apr 2026
88
Bielik-2.3Open Source
SpeakLeash
61Apr 2026
89
Bielik-2.2Open Source
SpeakLeash
60Apr 2026
90
PLLuM-8x7B-chatOpen Source
PLLuM
60Apr 2026
91
MiniMax-M2.7Open Source
MiniMaxAI
59Apr 2026
92
MiniMax-M2.5Open Source
MiniMaxAI
59Apr 2026
93
GPT-5-nano-2025-08-07Open Source
OpenAI
59Apr 2026
94
GPT-4o-mini-2024-07-18Open Source
OpenAI
57Apr 2026
95
GPT-5.4-nano-2026-03-17 (high reasoning)Open Source
OpenAI
57Apr 2026
96
Llama-3.1-405bOpen Source
Meta
57Apr 2026
97
Bielik-Minitron-7B-v3.0-InstructOpen Source
SpeakLeash
57Apr 2026
98
Qwen3-MaxOpen Source
Alibaba
57Apr 2026
99
Command-A-03-2025Open Source
Cohere
55Apr 2026
100
Gemma-3-27b
Google
55Apr 2026
101
Claude-3.0-SonnetOpen Source
Anthropic
53Apr 2026
102
Mistral-Large-2407Open Source
Mistral
52Apr 2026
103
Bielik-0.1Open Source
SpeakLeash
52Apr 2026
104
Mistral-Large-2411Open Source
Mistral
52Apr 2026
105
Claude-Haiku-4.5Open Source
Anthropic
52Apr 2026
106
Command-R-Plus-04-2024Open Source
Cohere
52Apr 2026
107
Llama-4-MaverickOpen Source
Meta
52Apr 2026
108
GLM-4.5-AirOpen Source
Zhipu AI
51Apr 2026
109
O3-mini-2025-01-31Open Source
OpenAI
51Apr 2026
110
WizardLM-2-8x22bOpen Source
Microsoft
50Apr 2026
111
Qwen-MaxOpen Source
Alibaba
50Apr 2026
112
PLLuM-12B-chatOpen Source
PLLuM
49Apr 2026
113
Command-R-Plus-08-2024Open Source
Cohere
49Apr 2026
114
Mistral-Small-4Open Source
Mistral
49Apr 2026
115
Qwen3.5-35B-A3BOpen Source
Alibaba
46Apr 2026
116
Qwen3.5-27BOpen Source
Alibaba
46Apr 2026
117
GPT-OSS-120bOpen Source
OpenAI
46Apr 2026
118
Qwen3-235B-A22B
Alibaba
45Apr 2026
119
Qwen3-Next-80B-A3B-ThinkingOpen Source
Alibaba
45Apr 2026
120
GPT-5.4-nano-2026-03-17 (no reasoning)Open Source
OpenAI
44Apr 2026
121
Bielik-4.5B-v3.0-InstructOpen Source
SpeakLeash
44Apr 2026
122
O1-mini-2024-09-12Open Source
OpenAI
44Apr 2026
123
Mixtral-8x22bOpen Source
Mistral
41Apr 2026
124
Gemini-Flash-1.5Open Source
Google
41Apr 2026
125
Llama-3.1-70BOpen Source
Meta
41Apr 2026
126
Gemma-2-27bOpen Source
Google
41Apr 2026
127
GPT-4.1-nano-2025-04-14Open Source
OpenAI
40Apr 2026
128
EuroLLM-9BOpen Source
UTTER
40Apr 2026
129
Llama-3.3-70BOpen Source
Meta
40Apr 2026
130
GLM-4.7-FlashOpen Source
Zhipu AI
40Apr 2026
131
Mistral-Small-3.2-24B-2506Open Source
Mistral
39Apr 2026
132
Mistral-Small-3.1-24B-2503Open Source
Mistral
39Apr 2026
133
GPT-3.5-turboOpen Source
OpenAI
38Apr 2026
134
Llama-3.0-70BOpen Source
Meta
38Apr 2026
135
Qwen3-Next-80B-A3B-InstructOpen Source
Alibaba
36Apr 2026
136
Qwen3.5-9BOpen Source
Alibaba
36Apr 2026
137
Llama-4-ScoutOpen Source
Meta
35Apr 2026
138
Llama-PLLuM-8B-chatOpen Source
PLLuM
34Apr 2026
139
Qwen-PlusOpen Source
Alibaba
32Apr 2026
140
Ministral-8b-2512Open Source
Mistral
30Apr 2026
141
Qwen-2.5-72bOpen Source
Alibaba
30Apr 2026
142
Qwen3-30B-A3BOpen Source
Alibaba
30Apr 2026
143
Mistral-Small-24B-2501Open Source
Mistral
29Apr 2026
144
Ministral-14b-2512Open Source
Mistral
29Apr 2026
145
Magistral-Small-2506Open Source
Mistral
29Apr 2026
146
Qwen3-32BOpen Source
Alibaba
28Apr 2026
147
Mixtral-8x7bOpen Source
Mistral
27Apr 2026
148
GPT-OSS-20bOpen Source
OpenAI
26Apr 2026
149
Bielik-1.5B-v3.0-InstructOpen Source
SpeakLeash
25Apr 2026
150
Qwen3.5-4BOpen Source
Alibaba
24Apr 2026
151
Gemma-2-9bOpen Source
Google
23Apr 2026
152
Qwen-2.5-32bOpen Source
Alibaba
21Apr 2026
153
Qwen-Turbo-2024-11-01Open Source
Alibaba
20Apr 2026
154
Command-R7BOpen Source
Cohere
18Apr 2026
155
Qwen-2.5-14bOpen Source
Alibaba
17Apr 2026
156
Ministral-3b-2512Open Source
Mistral
17Apr 2026
157
Phi-4
Microsoft
17Apr 2026
158
Qwen3-14BOpen Source
Alibaba
16Apr 2026
159
Qwen3.5-2BOpen Source
Alibaba
13Apr 2026
160
Qwen3-8BOpen Source
Alibaba
13Apr 2026
161
Llama-3.1-8BOpen Source
Meta
13Apr 2026
162
Mistral-NemoOpen Source
Mistral
13Apr 2026
163
Ministral-8bOpen Source
Mistral
12Apr 2026
164
Qwen-2.5-7bOpen Source
Alibaba
11Apr 2026
165
Mistral-7b-v0.3Open Source
Mistral
9Apr 2026

geography

#ModelScorePaper / CodeDate
1
Gemini-3.1-Pro-PreviewOpen Source
Google
100Apr 2026
2
Gemini-3.0-Pro-PreviewOpen Source
Google
100Apr 2026
3
Gemini-2.5-Pro-Preview-06-05Open Source
Google
98Apr 2026
4
O3-2025-04-16Open Source
OpenAI
97Apr 2026
5
GPT-5.1-2025-11-13 (high reasoning)Open Source
OpenAI
97Apr 2026
6
GPT-5-2025-08-07Open Source
OpenAI
97Apr 2026
7
GPT-5.4-2026-03-05 (low reasoning)Open Source
OpenAI
97Apr 2026
8
Gemini-2.5-Pro-Exp-03-25Open Source
Google
97Apr 2026
9
GPT-5-Pro-2025-10-06 (high reasoning)Open Source
OpenAI
96Apr 2026
10
GPT-5.4-2026-03-05 (high reasoning)Open Source
OpenAI
96Apr 2026
11
Gemini-3-Flash-PreviewOpen Source
Google
96Apr 2026
12
O1-2024-12-17Open Source
OpenAI
95Apr 2026
13
GPT-5.2-2025-12-11 (high reasoning)Open Source
OpenAI
95Apr 2026
14
GPT-5.2-2025-12-11 (xhigh reasoning)Open Source
OpenAI
94Apr 2026
15
GPT-5-mini-2025-08-07Open Source
OpenAI
94Apr 2026
16
DeepSeek-V3.2-SpecialeOpen Source
DeepSeek
94Apr 2026
17
Grok-4API
xAI
94Apr 2026
18
GPT-5.2-2025-12-11 (medium reasoning)Open Source
OpenAI
94Apr 2026
19
Gemini-2.5-Flash-Preview-04-17Open Source
Google
94Apr 2026
20
GPT-5.4-mini-2026-03-17 (high reasoning)Open Source
OpenAI
92Apr 2026
21
GLM-5API
Zhipu AI
91Apr 2026
22
GPT-4.5-preview-2025-02-27Open Source
OpenAI
90Apr 2026
23
GPT-4o-2024-05-13Open Source
OpenAI
89Apr 2026
24
DeepSeek-v3.1 (thinking)Open Source
DeepSeek
89Apr 2026
25
GPT-4.1-2025-04-14Open Source
OpenAI
89Apr 2026
26
MiMo-V2-ProOpen Source
Xiaomi
89Apr 2026
27
Claude-Opus-4.6Open Source
Anthropic
88Apr 2026
28
GPT-5.4-2026-03-05 (no reasoning)Open Source
OpenAI
88Apr 2026
29
GPT-4o-2024-08-06Open Source
OpenAI
88Apr 2026
30
GLM-4.7Open Source
Zhipu AI
88Apr 2026
31
O4-Mini-2025-04-16Open Source
OpenAI
88Apr 2026
32
Claude-3.7-Sonnet-ThinkingOpen Source
Anthropic
87Apr 2026
33
Claude-3.7-SonnetOpen Source
Anthropic
87Apr 2026
34
Claude-3.5-Sonnet-20240620Open Source
Anthropic
86Apr 2026
35
Claude-Opus-4.1Open Source
Anthropic
86Apr 2026
36
GPT-4o-2024-11-20Open Source
OpenAI
86Apr 2026
37
Kimi-K2.5Open Source
Moonshot.AI
86Apr 2026
38
Gemini-Exp-1206Open Source
Google
86Apr 2026
39
GPT-5.2-2025-12-11 (no reasoning)Open Source
OpenAI
86Apr 2026
40
GPT-5.1-2025-11-13 (default reasoning)Open Source
OpenAI
86Apr 2026
41
Grok-4.1-FastOpen Source
xAI
85Apr 2026
42
DeepSeek-R1-0528Open Source
DeepSeek
85Apr 2026
43
Qwen3.5-397B-A17BOpen Source
Alibaba
85Apr 2026
44
Claude-3.5-Sonnet-20241022Open Source
Anthropic
85Apr 2026
45
Kimi-K2-ThinkingOpen Source
Moonshot.AI
84Apr 2026
46
DeepSeek-R1Open Source
DeepSeek
84Apr 2026
47
Gemini-2.0-Flash-Thinking-Exp-01-21Open Source
Google
84Apr 2026
48
Claude-Opus-4.5Open Source
Anthropic
84Apr 2026
49
Grok-3-Mini-BetaOpen Source
xAI
84Apr 2026
50
Claude-Opus-4API
Anthropic
83Apr 2026
51
Grok-3-BetaOpen Source
xAI
83Apr 2026
52
Qwen3.5-122B-A10BOpen Source
Alibaba
83Apr 2026
53
GLM-4.6Open Source
Zhipu AI
82Apr 2026
54
GPT-5.4-mini-2026-03-17 (no reasoning)Open Source
OpenAI
82Apr 2026
55
MiniMax-M2.7Open Source
MiniMaxAI
82Apr 2026
56
DeepSeek-v3.1 (no thinking)Open Source
DeepSeek
82Apr 2026
57
Claude-Sonnet-4.6Open Source
Anthropic
81Apr 2026
58
DeepSeek-v3.2-ExpOpen Source
DeepSeek
80Apr 2026
59
GPT-5-nano-2025-08-07Open Source
OpenAI
80Apr 2026
60
Claude-3-OpusAPI
Anthropic
80Apr 2026
61
PLLuM-12B-nc-chat-250715Open Source
PLLuM
79Apr 2026
62
GPT-4-turboAPI
OpenAI
79Apr 2026
63
Gemini-2.0-Flash-ExperimentalOpen Source
Google
79Apr 2026
64
DeepSeek-v3Open Source
DeepSeek
79Apr 2026
65
GLM-4.5Open Source
Zhipu AI
79Apr 2026
66
Grok-4-FastOpen Source
xAI
79Apr 2026
67
Claude-Sonnet-4.5Open Source
Anthropic
79Apr 2026
68
DeepSeek-v3-0324Open Source
DeepSeek
78Apr 2026
69
O3-mini-2025-01-31Open Source
OpenAI
78Apr 2026
70
DeepSeek-V3.2Open Source
DeepSeek
78Apr 2026
71
Mistral-Medium-3Open Source
Mistral
77Apr 2026
72
Claude-Sonnet-4API
Anthropic
77Apr 2026
73
Grok-2-1212Open Source
xAI
77Apr 2026
74
GPT-5.4-nano-2026-03-17 (high reasoning)Open Source
OpenAI
77Apr 2026
75
Mistral-Large-2512Open Source
Mistral
76Apr 2026
76
Bielik-11B-v3.0-InstructOpen Source
SpeakLeash
75Apr 2026
77
Qwen3-MaxOpen Source
Alibaba
75Apr 2026
78
GPT-4.1-mini-2025-04-14Open Source
OpenAI
75Apr 2026
79
Bielik-2.6Open Source
SpeakLeash
75Apr 2026
80
Llama-3.1-405bOpen Source
Meta
74Apr 2026
81
Grok-4.20Open Source
xAI
74Apr 2026
82
Gemini-Pro-1.5Open Source
Google
74Apr 2026
83
Qwen3.5-35B-A3BOpen Source
Alibaba
73Apr 2026
84
PLLuM-8x7B-nc-chatOpen Source
PLLuM
73Apr 2026
85
Bielik-2.5Open Source
SpeakLeash
72Apr 2026
86
Claude-3.5-Haiku-20241022Open Source
Anthropic
72Apr 2026
87
Bielik-2.2Open Source
SpeakLeash
72Apr 2026
88
Llama-4-MaverickOpen Source
Meta
71Apr 2026
89
GPT-OSS-120bOpen Source
OpenAI
71Apr 2026
90
Llama-3.1-Tulu-3-405BOpen Source
Meta
71Apr 2026
91
Kimi-K2Open Source
Moonshot.AI
70Apr 2026
92
PLLuM-12B-nc-chatOpen Source
PLLuM
70Apr 2026
93
GPT-4o-mini-2024-07-18Open Source
OpenAI
69Apr 2026
94
Qwen3-235B-A22B
Alibaba
69Apr 2026
95
MiniMax-M2.5Open Source
MiniMaxAI
68Apr 2026
96
Bielik-2.3Open Source
SpeakLeash
68Apr 2026
97
Llama-PLLuM-70B-chatOpen Source
PLLuM
68Apr 2026
98
Bielik-2.1Open Source
SpeakLeash
68Apr 2026
99
GPT-4
OpenAI
67Apr 2026
100
Kimi-K2-0905Open Source
Moonshot.AI
67Apr 2026
101
Command-A-03-2025Open Source
Cohere
67Apr 2026
102
O1-mini-2024-09-12Open Source
OpenAI
66Apr 2026
103
PLLuM-8x7B-chatOpen Source
PLLuM
66Apr 2026
104
Claude-3.0-SonnetOpen Source
Anthropic
65Apr 2026
105
Mistral-Small-4Open Source
Mistral
64Apr 2026
106
GLM-4.5-AirOpen Source
Zhipu AI
64Apr 2026
107
Qwen3-Next-80B-A3B-ThinkingOpen Source
Alibaba
64Apr 2026
108
Qwen3.5-27BOpen Source
Alibaba
64Apr 2026
109
Llama-PLLuM-70B-chat-250801Open Source
PLLuM
63Apr 2026
110
Mistral-Large-2407Open Source
Mistral
63Apr 2026
111
Bielik-Minitron-7B-v3.0-InstructOpen Source
SpeakLeash
62Apr 2026
112
Command-R-Plus-08-2024Open Source
Cohere
61Apr 2026
113
Bielik-0.1Open Source
SpeakLeash
61Apr 2026
114
Gemini-Flash-1.5Open Source
Google
61Apr 2026
115
Mistral-Large-2411Open Source
Mistral
61Apr 2026
116
WizardLM-2-8x22bOpen Source
Microsoft
60Apr 2026
117
Mixtral-8x22bOpen Source
Mistral
59Apr 2026
118
GPT-4.1-nano-2025-04-14Open Source
OpenAI
59Apr 2026
119
Llama-3.3-70BOpen Source
Meta
59Apr 2026
120
Llama-3.1-70BOpen Source
Meta
58Apr 2026
121
GPT-3.5-turboOpen Source
OpenAI
55Apr 2026
122
GLM-4.7-FlashOpen Source
Zhipu AI
55Apr 2026
123
PLLuM-12B-chatOpen Source
PLLuM
54Apr 2026
124
EuroLLM-9BOpen Source
UTTER
54Apr 2026
125
Qwen-MaxOpen Source
Alibaba
53Apr 2026
126
Command-R-Plus-04-2024Open Source
Cohere
53Apr 2026
127
Bielik-4.5B-v3.0-InstructOpen Source
SpeakLeash
53Apr 2026
128
Claude-Haiku-4.5Open Source
Anthropic
52Apr 2026
129
GPT-5.4-nano-2026-03-17 (no reasoning)Open Source
OpenAI
52Apr 2026
130
Mistral-Small-3.2-24B-2506Open Source
Mistral
51Apr 2026
131
Gemma-3-27b
Google
51Apr 2026
132
Llama-4-ScoutOpen Source
Meta
51Apr 2026
133
Llama-3.0-70BOpen Source
Meta
49Apr 2026
134
Gemma-2-27bOpen Source
Google
47Apr 2026
135
Qwen3-Next-80B-A3B-InstructOpen Source
Alibaba
46Apr 2026
136
Llama-PLLuM-8B-chatOpen Source
PLLuM
46Apr 2026
137
Mistral-Small-3.1-24B-2503Open Source
Mistral
45Apr 2026
138
Qwen-2.5-72bOpen Source
Alibaba
45Apr 2026
139
Ministral-14b-2512Open Source
Mistral
45Apr 2026
140
Magistral-Small-2506Open Source
Mistral
45Apr 2026
141
Qwen3.5-9BOpen Source
Alibaba
44Apr 2026
142
Mixtral-8x7bOpen Source
Mistral
44Apr 2026
143
Mistral-Small-24B-2501Open Source
Mistral
42Apr 2026
144
Qwen-PlusOpen Source
Alibaba
42Apr 2026
145
Ministral-8b-2512Open Source
Mistral
39Apr 2026
146
Qwen3-32BOpen Source
Alibaba
37Apr 2026
147
Bielik-1.5B-v3.0-InstructOpen Source
SpeakLeash
35Apr 2026
148
GPT-OSS-20bOpen Source
OpenAI
35Apr 2026
149
Phi-4
Microsoft
35Apr 2026
150
Command-R7BOpen Source
Cohere
33Apr 2026
151
Llama-3.1-8BOpen Source
Meta
31Apr 2026
152
Qwen3-30B-A3BOpen Source
Alibaba
31Apr 2026
153
Qwen3-14BOpen Source
Alibaba
30Apr 2026
154
Gemma-2-9bOpen Source
Google
30Apr 2026
155
Qwen-Turbo-2024-11-01Open Source
Alibaba
30Apr 2026
156
Qwen3-8BOpen Source
Alibaba
27Apr 2026
157
Mistral-7b-v0.3Open Source
Mistral
27Apr 2026
158
Qwen3.5-4BOpen Source
Alibaba
27Apr 2026
159
Mistral-NemoOpen Source
Mistral
26Apr 2026
160
Qwen-2.5-32bOpen Source
Alibaba
25Apr 2026
161
Ministral-3b-2512Open Source
Mistral
24Apr 2026
162
Qwen-2.5-14bOpen Source
Alibaba
23Apr 2026
163
Ministral-8bOpen Source
Mistral
19Apr 2026
164
Qwen-2.5-7bOpen Source
Alibaba
17Apr 2026
165
Qwen3.5-2BOpen Source
Alibaba
12Apr 2026

grammar

#ModelScorePaper / CodeDate
1
Gemini-3.1-Pro-PreviewOpen Source
Google
93Apr 2026
2
Gemini-3.0-Pro-PreviewOpen Source
Google
91Apr 2026
3
GPT-5.4-2026-03-05 (high reasoning)Open Source
OpenAI
90Apr 2026
4
Grok-4API
xAI
90Apr 2026
5
GPT-5.2-2025-12-11 (xhigh reasoning)Open Source
OpenAI
89Apr 2026
6
GPT-5.4-2026-03-05 (low reasoning)Open Source
OpenAI
88Apr 2026
7
GPT-5.2-2025-12-11 (high reasoning)Open Source
OpenAI
87Apr 2026
8
Gemini-2.5-Pro-Preview-06-05Open Source
Google
86Apr 2026
9
GPT-5.4-mini-2026-03-17 (high reasoning)Open Source
OpenAI
85Apr 2026
10
GPT-5-Pro-2025-10-06 (high reasoning)Open Source
OpenAI
85Apr 2026
11
O3-2025-04-16Open Source
OpenAI
85Apr 2026
12
Gemini-3-Flash-PreviewOpen Source
Google
85Apr 2026
13
O1-2024-12-17Open Source
OpenAI
84Apr 2026
14
DeepSeek-V3.2-SpecialeOpen Source
DeepSeek
84Apr 2026
15
GPT-5-2025-08-07Open Source
OpenAI
84Apr 2026
16
GLM-5API
Zhipu AI
82Apr 2026
17
GPT-5-mini-2025-08-07Open Source
OpenAI
82Apr 2026
18
GPT-5.1-2025-11-13 (high reasoning)Open Source
OpenAI
82Apr 2026
19
GPT-5.2-2025-12-11 (medium reasoning)Open Source
OpenAI
82Apr 2026
20
Claude-Sonnet-4.6Open Source
Anthropic
80Apr 2026
21
Kimi-K2.5Open Source
Moonshot.AI
80Apr 2026
22
Claude-3.7-Sonnet-ThinkingOpen Source
Anthropic
80Apr 2026
23
Claude-Opus-4.5Open Source
Anthropic
79Apr 2026
24
GPT-5.4-2026-03-05 (no reasoning)Open Source
OpenAI
79Apr 2026
25
Gemini-2.5-Pro-Exp-03-25Open Source
Google
79Apr 2026
26
MiMo-V2-ProOpen Source
Xiaomi
79Apr 2026
27
Claude-3.5-Sonnet-20241022Open Source
Anthropic
79Apr 2026
28
Claude-Opus-4.6Open Source
Anthropic
77Apr 2026
29
Gemini-2.5-Flash-Preview-04-17Open Source
Google
77Apr 2026
30
Claude-Opus-4API
Anthropic
76Apr 2026
31
Qwen3.5-397B-A17BOpen Source
Alibaba
76Apr 2026
32
Claude-3.5-Sonnet-20240620Open Source
Anthropic
75Apr 2026
33
DeepSeek-v3.1 (thinking)Open Source
DeepSeek
75Apr 2026
34
DeepSeek-R1Open Source
DeepSeek
74Apr 2026
35
Claude-Opus-4.1Open Source
Anthropic
74Apr 2026
36
Claude-3.7-SonnetOpen Source
Anthropic
74Apr 2026
37
GPT-4.5-preview-2025-02-27Open Source
OpenAI
74Apr 2026
38
GPT-5.4-nano-2026-03-17 (high reasoning)Open Source
OpenAI
74Apr 2026
39
DeepSeek-R1-0528Open Source
DeepSeek
73Apr 2026
40
Qwen3.5-122B-A10BOpen Source
Alibaba
73Apr 2026
41
Kimi-K2-ThinkingOpen Source
Moonshot.AI
73Apr 2026
42
O4-Mini-2025-04-16Open Source
OpenAI
72Apr 2026
43
Grok-4.1-FastOpen Source
xAI
72Apr 2026
44
Grok-4-FastOpen Source
xAI
72Apr 2026
45
Grok-4.20Open Source
xAI
72Apr 2026
46
MiniMax-M2.7Open Source
MiniMaxAI
72Apr 2026
47
MiniMax-M2.5Open Source
MiniMaxAI
71Apr 2026
48
Grok-3-Mini-BetaOpen Source
xAI
71Apr 2026
49
GPT-5.4-mini-2026-03-17 (no reasoning)Open Source
OpenAI
70Apr 2026
50
GPT-5.1-2025-11-13 (default reasoning)Open Source
OpenAI
70Apr 2026
51
GPT-4o-2024-05-13Open Source
OpenAI
70Apr 2026
52
GPT-5-nano-2025-08-07Open Source
OpenAI
69Apr 2026
53
Gemini-Exp-1206Open Source
Google
69Apr 2026
54
GPT-5.2-2025-12-11 (no reasoning)Open Source
OpenAI
69Apr 2026
55
Gemini-2.0-Flash-Thinking-Exp-01-21Open Source
Google
68Apr 2026
56
Claude-Sonnet-4.5Open Source
Anthropic
68Apr 2026
57
O3-mini-2025-01-31Open Source
OpenAI
67Apr 2026
58
GPT-4o-2024-11-20Open Source
OpenAI
67Apr 2026
59
GPT-4.1-2025-04-14Open Source
OpenAI
67Apr 2026
60
Mistral-Large-2512Open Source
Mistral
67Apr 2026
61
Claude-3-OpusAPI
Anthropic
66Apr 2026
62
GPT-4o-2024-08-06Open Source
OpenAI
66Apr 2026
63
DeepSeek-V3.2Open Source
DeepSeek
66Apr 2026
64
GLM-4.7Open Source
Zhipu AI
66Apr 2026
65
Qwen3-235B-A22B
Alibaba
66Apr 2026
66
Qwen3.5-35B-A3BOpen Source
Alibaba
66Apr 2026
67
Qwen3-Next-80B-A3B-ThinkingOpen Source
Alibaba
65Apr 2026
68
Gemini-2.0-Flash-ExperimentalOpen Source
Google
65Apr 2026
69
Grok-3-BetaOpen Source
xAI
65Apr 2026
70
GPT-OSS-120bOpen Source
OpenAI
64Apr 2026
71
DeepSeek-v3.1 (no thinking)Open Source
DeepSeek
64Apr 2026
72
Grok-2-1212Open Source
xAI
64Apr 2026
73
DeepSeek-v3-0324Open Source
DeepSeek
64Apr 2026
74
GLM-4.6Open Source
Zhipu AI
63Apr 2026
75
DeepSeek-v3.2-ExpOpen Source
DeepSeek
63Apr 2026
76
Claude-Sonnet-4API
Anthropic
63Apr 2026
77
GPT-4.1-mini-2025-04-14Open Source
OpenAI
62Apr 2026
78
Qwen3.5-27BOpen Source
Alibaba
62Apr 2026
79
DeepSeek-v3Open Source
DeepSeek
62Apr 2026
80
O1-mini-2024-09-12Open Source
OpenAI
61Apr 2026
81
Mistral-Medium-3Open Source
Mistral
61Apr 2026
82
Llama-4-MaverickOpen Source
Meta
59Apr 2026
83
GLM-4.5Open Source
Zhipu AI
59Apr 2026
84
Kimi-K2-0905Open Source
Moonshot.AI
59Apr 2026
85
Claude-Haiku-4.5Open Source
Anthropic
59Apr 2026
86
GPT-4
OpenAI
58Apr 2026
87
Qwen3-MaxOpen Source
Alibaba
58Apr 2026
88
Gemini-Pro-1.5Open Source
Google
58Apr 2026
89
Kimi-K2Open Source
Moonshot.AI
58Apr 2026
90
Llama-3.1-405bOpen Source
Meta
57Apr 2026
91
Claude-3.5-Haiku-20241022Open Source
Anthropic
57Apr 2026
92
Bielik-11B-v3.0-InstructOpen Source
SpeakLeash
57Apr 2026
93
Claude-3.0-SonnetOpen Source
Anthropic
56Apr 2026
94
Mistral-Small-4Open Source
Mistral
56Apr 2026
95
GPT-4-turboAPI
OpenAI
56Apr 2026
96
Llama-3.1-Tulu-3-405BOpen Source
Meta
56Apr 2026
97
Bielik-2.6Open Source
SpeakLeash
55Apr 2026
98
GPT-4o-mini-2024-07-18Open Source
OpenAI
55Apr 2026
99
GPT-OSS-20bOpen Source
OpenAI
54Apr 2026
100
Llama-PLLuM-70B-chat-250801Open Source
PLLuM
54Apr 2026
101
Qwen3.5-9BOpen Source
Alibaba
54Apr 2026
102
Mistral-Large-2411Open Source
Mistral
54Apr 2026
103
Bielik-2.2Open Source
SpeakLeash
53Apr 2026
104
Mistral-Small-3.2-24B-2506Open Source
Mistral
53Apr 2026
105
PLLuM-12B-nc-chat-250715Open Source
PLLuM
52Apr 2026
106
Qwen3-Next-80B-A3B-InstructOpen Source
Alibaba
52Apr 2026
107
GLM-4.5-AirOpen Source
Zhipu AI
52Apr 2026
108
Mistral-Large-2407Open Source
Mistral
51Apr 2026
109
Llama-4-ScoutOpen Source
Meta
51Apr 2026
110
Qwen-MaxOpen Source
Alibaba
51Apr 2026
111
Bielik-2.5Open Source
SpeakLeash
51Apr 2026
112
Llama-PLLuM-70B-chatOpen Source
PLLuM
50Apr 2026
113
Mixtral-8x22bOpen Source
Mistral
50Apr 2026
114
Mistral-Small-3.1-24B-2503Open Source
Mistral
50Apr 2026
115
Bielik-Minitron-7B-v3.0-InstructOpen Source
SpeakLeash
50Apr 2026
116
Bielik-2.1Open Source
SpeakLeash
50Apr 2026
117
Qwen3-30B-A3BOpen Source
Alibaba
49Apr 2026
118
Command-A-03-2025Open Source
Cohere
49Apr 2026
119
Bielik-2.3Open Source
SpeakLeash
49Apr 2026
120
WizardLM-2-8x22bOpen Source
Microsoft
49Apr 2026
121
Llama-3.3-70BOpen Source
Meta
49Apr 2026
122
Qwen3-32BOpen Source
Alibaba
48Apr 2026
123
Magistral-Small-2506Open Source
Mistral
47Apr 2026
124
PLLuM-8x7B-nc-chatOpen Source
PLLuM
47Apr 2026
125
Qwen-PlusOpen Source
Alibaba
47Apr 2026
126
Gemma-3-27b
Google
46Apr 2026
127
Gemma-2-27bOpen Source
Google
46Apr 2026
128
Gemini-Flash-1.5Open Source
Google
46Apr 2026
129
Qwen3-14BOpen Source
Alibaba
46Apr 2026
130
Mistral-Small-24B-2501Open Source
Mistral
45Apr 2026
131
Qwen3.5-4BOpen Source
Alibaba
45Apr 2026
132
GPT-5.4-nano-2026-03-17 (no reasoning)Open Source
OpenAI
45Apr 2026
133
Llama-3.0-70BOpen Source
Meta
45Apr 2026
134
GPT-4.1-nano-2025-04-14Open Source
OpenAI
45Apr 2026
135
Command-R-Plus-04-2024Open Source
Cohere
45Apr 2026
136
Qwen-2.5-72bOpen Source
Alibaba
45Apr 2026
137
Llama-3.1-70BOpen Source
Meta
44Apr 2026
138
Ministral-8b-2512Open Source
Mistral
44Apr 2026
139
Ministral-14b-2512Open Source
Mistral
44Apr 2026
140
GLM-4.7-FlashOpen Source
Zhipu AI
44Apr 2026
141
Qwen-2.5-32bOpen Source
Alibaba
43Apr 2026
142
Command-R-Plus-08-2024Open Source
Cohere
43Apr 2026
143
PLLuM-8x7B-chatOpen Source
PLLuM
42Apr 2026
144
GPT-3.5-turboOpen Source
OpenAI
41Apr 2026
145
PLLuM-12B-nc-chatOpen Source
PLLuM
41Apr 2026
146
EuroLLM-9BOpen Source
UTTER
39Apr 2026
147
Qwen3-8BOpen Source
Alibaba
38Apr 2026
148
Gemma-2-9bOpen Source
Google
38Apr 2026
149
PLLuM-12B-chatOpen Source
PLLuM
37Apr 2026
150
Bielik-4.5B-v3.0-InstructOpen Source
SpeakLeash
35Apr 2026
151
Phi-4
Microsoft
34Apr 2026
152
Mixtral-8x7bOpen Source
Mistral
34Apr 2026
153
Qwen-2.5-14bOpen Source
Alibaba
34Apr 2026
154
Llama-PLLuM-8B-chatOpen Source
PLLuM
33Apr 2026
155
Qwen-Turbo-2024-11-01Open Source
Alibaba
33Apr 2026
156
Mistral-NemoOpen Source
Mistral
31Apr 2026
157
Ministral-3b-2512Open Source
Mistral
30Apr 2026
158
Llama-3.1-8BOpen Source
Meta
29Apr 2026
159
Bielik-0.1Open Source
SpeakLeash
29Apr 2026
160
Qwen-2.5-7bOpen Source
Alibaba
29Apr 2026
161
Mistral-7b-v0.3Open Source
Mistral
27Apr 2026
162
Ministral-8bOpen Source
Mistral
24Apr 2026
163
Bielik-1.5B-v3.0-InstructOpen Source
SpeakLeash
23Apr 2026
164
Command-R7BOpen Source
Cohere
23Apr 2026
165
Qwen3.5-2BOpen Source
Alibaba
19Apr 2026

history

#ModelScorePaper / CodeDate
1
Gemini-3.1-Pro-PreviewOpen Source
Google
98Apr 2026
2
Gemini-3.0-Pro-PreviewOpen Source
Google
95Apr 2026
3
GPT-5.2-2025-12-11 (xhigh reasoning)Open Source
OpenAI
94Apr 2026
4
Grok-4API
xAI
94Apr 2026
5
GPT-5.4-2026-03-05 (low reasoning)Open Source
OpenAI
93Apr 2026
6
GPT-5.4-2026-03-05 (high reasoning)Open Source
OpenAI
92Apr 2026
7
Gemini-2.5-Pro-Exp-03-25Open Source
Google
92Apr 2026
8
Gemini-2.5-Pro-Preview-06-05Open Source
Google
92Apr 2026
9
Gemini-3-Flash-PreviewOpen Source
Google
92Apr 2026
10
Claude-3.7-Sonnet-ThinkingOpen Source
Anthropic
92Apr 2026
11
DeepSeek-R1-0528Open Source
DeepSeek
91Apr 2026
12
GPT-5-2025-08-07Open Source
OpenAI
91Apr 2026
13
GPT-5-Pro-2025-10-06 (high reasoning)Open Source
OpenAI
91Apr 2026
14
Claude-Opus-4.1Open Source
Anthropic
91Apr 2026
15
Claude-3.5-Sonnet-20241022Open Source
Anthropic
91Apr 2026
16
Claude-3.7-SonnetOpen Source
Anthropic
90Apr 2026
17
O1-2024-12-17Open Source
OpenAI
90Apr 2026
18
GPT-5.2-2025-12-11 (medium reasoning)Open Source
OpenAI
90Apr 2026
19
DeepSeek-V3.2-SpecialeOpen Source
DeepSeek
90Apr 2026
20
GPT-5.2-2025-12-11 (high reasoning)Open Source
OpenAI
90Apr 2026
21
GPT-4.5-preview-2025-02-27Open Source
OpenAI
90Apr 2026
22
O3-2025-04-16Open Source
OpenAI
89Apr 2026
23
Kimi-K2.5Open Source
Moonshot.AI
89Apr 2026
24
Claude-3.5-Sonnet-20240620Open Source
Anthropic
89Apr 2026
25
GPT-5.1-2025-11-13 (high reasoning)Open Source
OpenAI
89Apr 2026
26
DeepSeek-v3.1 (thinking)Open Source
DeepSeek
89Apr 2026
27
GPT-5.4-mini-2026-03-17 (high reasoning)Open Source
OpenAI
89Apr 2026
28
GLM-5API
Zhipu AI
88Apr 2026
29
Gemini-Exp-1206Open Source
Google
88Apr 2026
30
GLM-4.6Open Source
Zhipu AI
87Apr 2026
31
MiMo-V2-ProOpen Source
Xiaomi
87Apr 2026
32
Claude-Opus-4.6Open Source
Anthropic
87Apr 2026
33
GPT-5.4-2026-03-05 (no reasoning)Open Source
OpenAI
87Apr 2026
34
Claude-Opus-4.5Open Source
Anthropic
87Apr 2026
35
Claude-Opus-4API
Anthropic
87Apr 2026
36
Claude-3-OpusAPI
Anthropic
86Apr 2026
37
DeepSeek-v3.1 (no thinking)Open Source
DeepSeek
86Apr 2026
38
GPT-4o-2024-08-06Open Source
OpenAI
86Apr 2026
39
Gemini-2.5-Flash-Preview-04-17Open Source
Google
86Apr 2026
40
GLM-4.7Open Source
Zhipu AI
85Apr 2026
41
Grok-3-BetaOpen Source
xAI
85Apr 2026
42
GPT-5.2-2025-12-11 (no reasoning)Open Source
OpenAI
85Apr 2026
43
Claude-Sonnet-4.5Open Source
Anthropic
85Apr 2026
44
DeepSeek-R1Open Source
DeepSeek
85Apr 2026
45
GPT-4.1-2025-04-14Open Source
OpenAI
85Apr 2026
46
GPT-4o-2024-11-20Open Source
OpenAI
84Apr 2026
47
Grok-3-Mini-BetaOpen Source
xAI
84Apr 2026
48
Grok-4.1-FastOpen Source
xAI
84Apr 2026
49
DeepSeek-v3.2-ExpOpen Source
DeepSeek
83Apr 2026
50
GPT-5-mini-2025-08-07Open Source
OpenAI
83Apr 2026
51
Gemini-2.0-Flash-ExperimentalOpen Source
Google
83Apr 2026
52
Qwen3.5-397B-A17BOpen Source
Alibaba
83Apr 2026
53
DeepSeek-V3.2Open Source
DeepSeek
82Apr 2026
54
GPT-4o-2024-05-13Open Source
OpenAI
82Apr 2026
55
DeepSeek-v3-0324Open Source
DeepSeek
82Apr 2026
56
Claude-Sonnet-4.6Open Source
Anthropic
82Apr 2026
57
GPT-5.1-2025-11-13 (default reasoning)Open Source
OpenAI
82Apr 2026
58
GPT-5.4-mini-2026-03-17 (no reasoning)Open Source
OpenAI
82Apr 2026
59
Grok-4.20Open Source
xAI
82Apr 2026
60
Grok-4-FastOpen Source
xAI
81Apr 2026
61
Claude-Sonnet-4API
Anthropic
81Apr 2026
62
Kimi-K2-ThinkingOpen Source
Moonshot.AI
80Apr 2026
63
Gemini-2.0-Flash-Thinking-Exp-01-21Open Source
Google
80Apr 2026
64
Gemini-Pro-1.5Open Source
Google
79Apr 2026
65
Mistral-Large-2512Open Source
Mistral
79Apr 2026
66
Qwen3.5-122B-A10BOpen Source
Alibaba
78Apr 2026
67
Mistral-Medium-3Open Source
Mistral
78Apr 2026
68
Bielik-11B-v3.0-InstructOpen Source
SpeakLeash
78Apr 2026
69
DeepSeek-v3Open Source
DeepSeek
77Apr 2026
70
Bielik-2.2Open Source
SpeakLeash
77Apr 2026
71
O4-Mini-2025-04-16Open Source
OpenAI
77Apr 2026
72
GLM-4.5Open Source
Zhipu AI
77Apr 2026
73
Llama-4-MaverickOpen Source
Meta
76Apr 2026
74
Bielik-2.3Open Source
SpeakLeash
76Apr 2026
75
GPT-4-turboAPI
OpenAI
76Apr 2026
76
GPT-5.4-nano-2026-03-17 (high reasoning)Open Source
OpenAI
76Apr 2026
77
Llama-3.1-Tulu-3-405BOpen Source
Meta
75Apr 2026
78
Bielik-2.5Open Source
SpeakLeash
75Apr 2026
79
Grok-2-1212Open Source
xAI
74Apr 2026
80
Llama-PLLuM-70B-chatOpen Source
PLLuM
74Apr 2026
81
Qwen3-MaxOpen Source
Alibaba
74Apr 2026
82
Command-A-03-2025Open Source
Cohere
73Apr 2026
83
Kimi-K2Open Source
Moonshot.AI
73Apr 2026
84
PLLuM-12B-nc-chat-250715Open Source
PLLuM
73Apr 2026
85
PLLuM-8x7B-nc-chatOpen Source
PLLuM
73Apr 2026
86
GPT-5-nano-2025-08-07Open Source
OpenAI
73Apr 2026
87
Claude-3.0-SonnetOpen Source
Anthropic
73Apr 2026
88
Llama-3.1-405bOpen Source
Meta
73Apr 2026
89
Bielik-2.1Open Source
SpeakLeash
73Apr 2026
90
Qwen3-Next-80B-A3B-ThinkingOpen Source
Alibaba
72Apr 2026
91
GPT-4
OpenAI
72Apr 2026
92
Bielik-2.6Open Source
SpeakLeash
72Apr 2026
93
Mistral-Large-2407Open Source
Mistral
71Apr 2026
94
Qwen3-235B-A22B
Alibaba
70Apr 2026
95
Kimi-K2-0905Open Source
Moonshot.AI
70Apr 2026
96
PLLuM-12B-nc-chatOpen Source
PLLuM
70Apr 2026
97
MiniMax-M2.5Open Source
MiniMaxAI
69Apr 2026
98
Llama-PLLuM-70B-chat-250801Open Source
PLLuM
69Apr 2026
99
Mixtral-8x22bOpen Source
Mistral
69Apr 2026
100
Llama-3.1-70BOpen Source
Meta
68Apr 2026
101
PLLuM-8x7B-chatOpen Source
PLLuM
68Apr 2026
102
Qwen3.5-35B-A3BOpen Source
Alibaba
68Apr 2026
103
WizardLM-2-8x22bOpen Source
Microsoft
67Apr 2026
104
O3-mini-2025-01-31Open Source
OpenAI
67Apr 2026
105
GPT-4o-mini-2024-07-18Open Source
OpenAI
67Apr 2026
106
GPT-4.1-mini-2025-04-14Open Source
OpenAI
67Apr 2026
107
GLM-4.5-AirOpen Source
Zhipu AI
66Apr 2026
108
Llama-3.3-70BOpen Source
Meta
65Apr 2026
109
GPT-OSS-120bOpen Source
OpenAI
65Apr 2026
110
MiniMax-M2.7Open Source
MiniMaxAI
64Apr 2026
111
Llama-3.0-70BOpen Source
Meta
64Apr 2026
112
Bielik-Minitron-7B-v3.0-InstructOpen Source
SpeakLeash
64Apr 2026
113
Mistral-Small-4Open Source
Mistral
64Apr 2026
114
Mistral-Large-2411Open Source
Mistral
64Apr 2026
115
Qwen3.5-27BOpen Source
Alibaba
63Apr 2026
116
Qwen-MaxOpen Source
Alibaba
63Apr 2026
117
PLLuM-12B-chatOpen Source
PLLuM
61Apr 2026
118
O1-mini-2024-09-12Open Source
OpenAI
61Apr 2026
119
Claude-3.5-Haiku-20241022Open Source
Anthropic
61Apr 2026
120
Command-R-Plus-04-2024Open Source
Cohere
61Apr 2026
121
Command-R-Plus-08-2024Open Source
Cohere
61Apr 2026
122
Mistral-Small-3.2-24B-2506Open Source
Mistral
61Apr 2026
123
Claude-Haiku-4.5Open Source
Anthropic
60Apr 2026
124
Bielik-0.1Open Source
SpeakLeash
58Apr 2026
125
Qwen3-Next-80B-A3B-InstructOpen Source
Alibaba
58Apr 2026
126
GPT-5.4-nano-2026-03-17 (no reasoning)Open Source
OpenAI
57Apr 2026
127
Mixtral-8x7bOpen Source
Mistral
56Apr 2026
128
Qwen3-32BOpen Source
Alibaba
55Apr 2026
129
Bielik-4.5B-v3.0-InstructOpen Source
SpeakLeash
55Apr 2026
130
Magistral-Small-2506Open Source
Mistral
54Apr 2026
131
GLM-4.7-FlashOpen Source
Zhipu AI
54Apr 2026
132
Mistral-Small-3.1-24B-2503Open Source
Mistral
54Apr 2026
133
Qwen-2.5-72bOpen Source
Alibaba
54Apr 2026
134
Gemma-2-27bOpen Source
Google
53Apr 2026
135
Gemma-3-27b
Google
52Apr 2026
136
Ministral-14b-2512Open Source
Mistral
52Apr 2026
137
Gemini-Flash-1.5Open Source
Google
51Apr 2026
138
GPT-3.5-turboOpen Source
OpenAI
51Apr 2026
139
GPT-4.1-nano-2025-04-14Open Source
OpenAI
50Apr 2026
140
Llama-PLLuM-8B-chatOpen Source
PLLuM
50Apr 2026
141
Mistral-Small-24B-2501Open Source
Mistral
49Apr 2026
142
EuroLLM-9BOpen Source
UTTER
49Apr 2026
143
Qwen3.5-9BOpen Source
Alibaba
48Apr 2026
144
Llama-4-ScoutOpen Source
Meta
47Apr 2026
145
Qwen-PlusOpen Source
Alibaba
46Apr 2026
146
Qwen-2.5-32bOpen Source
Alibaba
44Apr 2026
147
Ministral-8b-2512Open Source
Mistral
43Apr 2026
148
Qwen-Turbo-2024-11-01Open Source
Alibaba
42Apr 2026
149
Qwen3-30B-A3BOpen Source
Alibaba
42Apr 2026
150
Qwen3-14BOpen Source
Alibaba
42Apr 2026
151
Qwen3-8BOpen Source
Alibaba
41Apr 2026
152
Phi-4
Microsoft
40Apr 2026
153
Qwen-2.5-14bOpen Source
Alibaba
37Apr 2026
154
GPT-OSS-20bOpen Source
OpenAI
37Apr 2026
155
Qwen3.5-4BOpen Source
Alibaba
36Apr 2026
156
Gemma-2-9bOpen Source
Google
35Apr 2026
157
Ministral-8bOpen Source
Mistral
33Apr 2026
158
Bielik-1.5B-v3.0-InstructOpen Source
SpeakLeash
32Apr 2026
159
Mistral-7b-v0.3Open Source
Mistral
30Apr 2026
160
Ministral-3b-2512Open Source
Mistral
30Apr 2026
161
Mistral-NemoOpen Source
Mistral
28Apr 2026
162
Command-R7BOpen Source
Cohere
27Apr 2026
163
Llama-3.1-8BOpen Source
Meta
25Apr 2026
164
Qwen-2.5-7bOpen Source
Alibaba
23Apr 2026
165
Qwen3.5-2BOpen Source
Alibaba
14Apr 2026

vocabulary

#ModelScorePaper / CodeDate
1
Gemini-3.1-Pro-PreviewOpen Source
Google
96Apr 2026
2
Gemini-3.0-Pro-PreviewOpen Source
Google
95Apr 2026
3
GPT-5-Pro-2025-10-06 (high reasoning)Open Source
OpenAI
92Apr 2026
4
GPT-5.4-2026-03-05 (high reasoning)Open Source
OpenAI
91Apr 2026
5
GPT-5-2025-08-07Open Source
OpenAI
91Apr 2026
6
Gemini-2.5-Pro-Preview-06-05Open Source
Google
90Apr 2026
7
Gemini-2.5-Pro-Exp-03-25Open Source
Google
90Apr 2026
8
O3-2025-04-16Open Source
OpenAI
90Apr 2026
9
GPT-5.1-2025-11-13 (high reasoning)Open Source
OpenAI
90Apr 2026
10
O1-2024-12-17Open Source
OpenAI
88Apr 2026
11
Gemini-3-Flash-PreviewOpen Source
Google
88Apr 2026
12
GPT-5.2-2025-12-11 (xhigh reasoning)Open Source
OpenAI
87Apr 2026
13
GPT-5.2-2025-12-11 (high reasoning)Open Source
OpenAI
86Apr 2026
14
GPT-5.4-mini-2026-03-17 (high reasoning)Open Source
OpenAI
86Apr 2026
15
GPT-5.2-2025-12-11 (medium reasoning)Open Source
OpenAI
86Apr 2026
16
GPT-5.4-2026-03-05 (low reasoning)Open Source
OpenAI
85Apr 2026
17
GPT-5.4-2026-03-05 (no reasoning)Open Source
OpenAI
85Apr 2026
18
Grok-4API
xAI
84Apr 2026
19
GPT-4.5-preview-2025-02-27Open Source
OpenAI
83Apr 2026
20
Gemini-Exp-1206Open Source
Google
82Apr 2026
21
Gemini-2.5-Flash-Preview-04-17Open Source
Google
81Apr 2026
22
GPT-4o-2024-11-20Open Source
OpenAI
80Apr 2026
23
GPT-4.1-2025-04-14Open Source
OpenAI
80Apr 2026
24
GPT-4o-2024-05-13Open Source
OpenAI
78Apr 2026
25
Claude-Opus-4.6Open Source
Anthropic
78Apr 2026
26
Claude-3.5-Sonnet-20241022Open Source
Anthropic
77Apr 2026
27
GPT-5.2-2025-12-11 (no reasoning)Open Source
OpenAI
77Apr 2026
28
GPT-4o-2024-08-06Open Source
OpenAI
77Apr 2026
29
Claude-Opus-4.5Open Source
Anthropic
76Apr 2026
30
Claude-3.5-Sonnet-20240620Open Source
Anthropic
76Apr 2026
31
GPT-5.1-2025-11-13 (default reasoning)Open Source
OpenAI
75Apr 2026
32
Claude-3.7-SonnetOpen Source
Anthropic
75Apr 2026
33
Claude-3.7-Sonnet-ThinkingOpen Source
Anthropic
75Apr 2026
34
DeepSeek-v3.1 (thinking)Open Source
DeepSeek
74Apr 2026
35
Claude-Sonnet-4.6Open Source
Anthropic
74Apr 2026
36
Claude-Opus-4.1Open Source
Anthropic
73Apr 2026
37
MiMo-V2-ProOpen Source
Xiaomi
73Apr 2026
38
Claude-Opus-4API
Anthropic
73Apr 2026
39
DeepSeek-R1Open Source
DeepSeek
72Apr 2026
40
Gemini-2.0-Flash-ExperimentalOpen Source
Google
72Apr 2026
41
GLM-5API
Zhipu AI
72Apr 2026
42
DeepSeek-V3.2-SpecialeOpen Source
DeepSeek
71Apr 2026
43
Qwen3.5-397B-A17BOpen Source
Alibaba
70Apr 2026
44
GPT-5.4-mini-2026-03-17 (no reasoning)Open Source
OpenAI
70Apr 2026
45
GPT-5-mini-2025-08-07Open Source
OpenAI
70Apr 2026
46
Gemini-2.0-Flash-Thinking-Exp-01-21Open Source
Google
69Apr 2026
47
Grok-3-BetaOpen Source
xAI
69Apr 2026
48
DeepSeek-R1-0528Open Source
DeepSeek
68Apr 2026
49
PLLuM-8x7B-nc-chatOpen Source
PLLuM
68Apr 2026
50
Gemini-Pro-1.5Open Source
Google
68Apr 2026
51
Bielik-11B-v3.0-InstructOpen Source
SpeakLeash
67Apr 2026
52
PLLuM-12B-nc-chat-250715Open Source
PLLuM
67Apr 2026
53
Kimi-K2.5Open Source
Moonshot.AI
65Apr 2026
54
Grok-4.1-FastOpen Source
xAI
65Apr 2026
55
DeepSeek-V3.2Open Source
DeepSeek
65Apr 2026
56
O4-Mini-2025-04-16Open Source
OpenAI
65Apr 2026
57
DeepSeek-v3.2-ExpOpen Source
DeepSeek
64Apr 2026
58
Mistral-Large-2512Open Source
Mistral
64Apr 2026
59
DeepSeek-v3Open Source
DeepSeek
63Apr 2026
60
Bielik-2.6Open Source
SpeakLeash
62Apr 2026
61
Mistral-Medium-3Open Source
Mistral
62Apr 2026
62
Bielik-2.2Open Source
SpeakLeash
62Apr 2026
63
DeepSeek-v3-0324Open Source
DeepSeek
62Apr 2026
64
Claude-3-OpusAPI
Anthropic
62Apr 2026
65
DeepSeek-v3.1 (no thinking)Open Source
DeepSeek
62Apr 2026
66
Qwen3.5-122B-A10BOpen Source
Alibaba
61Apr 2026
67
Grok-3-Mini-BetaOpen Source
xAI
61Apr 2026
68
GPT-5.4-nano-2026-03-17 (high reasoning)Open Source
OpenAI
61Apr 2026
69
Claude-Sonnet-4.5Open Source
Anthropic
61Apr 2026
70
Bielik-2.3Open Source
SpeakLeash
61Apr 2026
71
Bielik-2.5Open Source
SpeakLeash
61Apr 2026
72
Claude-Sonnet-4API
Anthropic
61Apr 2026
73
MiniMax-M2.7Open Source
MiniMaxAI
60Apr 2026
74
GLM-4.5Open Source
Zhipu AI
60Apr 2026
75
Kimi-K2-ThinkingOpen Source
Moonshot.AI
59Apr 2026
76
GLM-4.7Open Source
Zhipu AI
59Apr 2026
77
Grok-4-FastOpen Source
xAI
59Apr 2026
78
Grok-4.20Open Source
xAI
59Apr 2026
79
Grok-2-1212Open Source
xAI
57Apr 2026
80
GLM-4.6Open Source
Zhipu AI
57Apr 2026
81
Bielik-2.1Open Source
SpeakLeash
56Apr 2026
82
GPT-4.1-mini-2025-04-14Open Source
OpenAI
56Apr 2026
83
GPT-4-turboAPI
OpenAI
56Apr 2026
84
Kimi-K2Open Source
Moonshot.AI
54Apr 2026
85
Qwen3-MaxOpen Source
Alibaba
54Apr 2026
86
Qwen3.5-27BOpen Source
Alibaba
54Apr 2026
87
Llama-3.1-Tulu-3-405BOpen Source
Meta
53Apr 2026
88
Kimi-K2-0905Open Source
Moonshot.AI
53Apr 2026
89
Claude-3.5-Haiku-20241022Open Source
Anthropic
52Apr 2026
90
Mistral-Small-4Open Source
Mistral
52Apr 2026
91
MiniMax-M2.5Open Source
MiniMaxAI
52Apr 2026
92
PLLuM-12B-nc-chatOpen Source
PLLuM
52Apr 2026
93
GPT-4o-mini-2024-07-18Open Source
OpenAI
51Apr 2026
94
Command-A-03-2025Open Source
Cohere
49Apr 2026
95
GPT-4
OpenAI
48Apr 2026
96
GPT-5-nano-2025-08-07Open Source
OpenAI
47Apr 2026
97
Gemini-Flash-1.5Open Source
Google
47Apr 2026
98
GLM-4.5-AirOpen Source
Zhipu AI
47Apr 2026
99
O3-mini-2025-01-31Open Source
OpenAI
47Apr 2026
100
Command-R-Plus-04-2024Open Source
Cohere
46Apr 2026
101
Llama-PLLuM-70B-chatOpen Source
PLLuM
46Apr 2026
102
Bielik-Minitron-7B-v3.0-InstructOpen Source
SpeakLeash
46Apr 2026
103
Llama-PLLuM-70B-chat-250801Open Source
PLLuM
46Apr 2026
104
Claude-3.0-SonnetOpen Source
Anthropic
46Apr 2026
105
Claude-Haiku-4.5Open Source
Anthropic
45Apr 2026
106
Qwen3.5-35B-A3BOpen Source
Alibaba
45Apr 2026
107
Llama-4-MaverickOpen Source
Meta
45Apr 2026
108
Qwen-MaxOpen Source
Alibaba
45Apr 2026
109
PLLuM-8x7B-chatOpen Source
PLLuM
44Apr 2026
110
Llama-3.1-405bOpen Source
Meta
43Apr 2026
111
Qwen3-235B-A22B
Alibaba
43Apr 2026
112
Command-R-Plus-08-2024Open Source
Cohere
43Apr 2026
113
Mistral-Large-2411Open Source
Mistral
42Apr 2026
114
Llama-4-ScoutOpen Source
Meta
42Apr 2026
115
GPT-5.4-nano-2026-03-17 (no reasoning)Open Source
OpenAI
41Apr 2026
116
Mistral-Large-2407Open Source
Mistral
40Apr 2026
117
O1-mini-2024-09-12Open Source
OpenAI
40Apr 2026
118
Bielik-4.5B-v3.0-InstructOpen Source
SpeakLeash
39Apr 2026
119
Ministral-14b-2512Open Source
Mistral
39Apr 2026
120
GPT-4.1-nano-2025-04-14Open Source
OpenAI
38Apr 2026
121
Qwen3.5-9BOpen Source
Alibaba
38Apr 2026
122
WizardLM-2-8x22bOpen Source
Microsoft
38Apr 2026
123
GPT-OSS-120bOpen Source
OpenAI
38Apr 2026
124
Qwen-PlusOpen Source
Alibaba
38Apr 2026
125
Gemma-3-27b
Google
37Apr 2026
126
Mistral-Small-3.1-24B-2503Open Source
Mistral
37Apr 2026
127
Bielik-0.1Open Source
SpeakLeash
37Apr 2026
128
Qwen3-32BOpen Source
Alibaba
37Apr 2026
129
Llama-3.3-70BOpen Source
Meta
37Apr 2026
130
Qwen3-Next-80B-A3B-ThinkingOpen Source
Alibaba
37Apr 2026
131
Gemma-2-27bOpen Source
Google
37Apr 2026
132
Mistral-Small-24B-2501Open Source
Mistral
36Apr 2026
133
GPT-3.5-turboOpen Source
OpenAI
36Apr 2026
134
Qwen-2.5-72bOpen Source
Alibaba
36Apr 2026
135
Ministral-8b-2512Open Source
Mistral
35Apr 2026
136
Llama-PLLuM-8B-chatOpen Source
PLLuM
35Apr 2026
137
Mixtral-8x22bOpen Source
Mistral
35Apr 2026
138
Mistral-Small-3.2-24B-2506Open Source
Mistral
35Apr 2026
139
Llama-3.1-70BOpen Source
Meta
34Apr 2026
140
Qwen3-14BOpen Source
Alibaba
34Apr 2026
141
EuroLLM-9BOpen Source
UTTER
34Apr 2026
142
Qwen3.5-4BOpen Source
Alibaba
34Apr 2026
143
Qwen-2.5-32bOpen Source
Alibaba
33Apr 2026
144
PLLuM-12B-chatOpen Source
PLLuM
33Apr 2026
145
Qwen3-Next-80B-A3B-InstructOpen Source
Alibaba
32Apr 2026
146
Magistral-Small-2506Open Source
Mistral
31Apr 2026
147
Qwen-Turbo-2024-11-01Open Source
Alibaba
31Apr 2026
148
Gemma-2-9bOpen Source
Google
30Apr 2026
149
GLM-4.7-FlashOpen Source
Zhipu AI
30Apr 2026
150
Qwen-2.5-14bOpen Source
Alibaba
28Apr 2026
151
Qwen3-30B-A3BOpen Source
Alibaba
27Apr 2026
152
Phi-4
Microsoft
26Apr 2026
153
Qwen3-8BOpen Source
Alibaba
25Apr 2026
154
Bielik-1.5B-v3.0-InstructOpen Source
SpeakLeash
23Apr 2026
155
GPT-OSS-20bOpen Source
OpenAI
23Apr 2026
156
Command-R7BOpen Source
Cohere
22Apr 2026
157
Ministral-3b-2512Open Source
Mistral
22Apr 2026
158
Ministral-8bOpen Source
Mistral
22Apr 2026
159
Llama-3.0-70BOpen Source
Meta
22Apr 2026
160
Qwen-2.5-7bOpen Source
Alibaba
21Apr 2026
161
Mistral-NemoOpen Source
Mistral
20Apr 2026
162
Qwen3.5-2BOpen Source
Alibaba
20Apr 2026
163
Mixtral-8x7bOpen Source
Mistral
20Apr 2026
164
Llama-3.1-8BOpen Source
Meta
19Apr 2026
165
Mistral-7b-v0.3Open Source
Mistral
16Apr 2026