Polish Emotional Intelligence2025en

Polish Emotional Intelligence Benchmark (EQ-Bench v2 PL)

Evaluates LLMs on emotional intelligence in Polish. Based on EQ-Bench v2 methodology adapted for Polish language. Models predict emotional intensity changes across 171 questions. Score adjusted for parseability: Benchmark Score × (Parseable / 171). Created by SpeakLeash.

Samples:101
Metrics:eq-score
Paper / WebsiteDownload
Current State of the Art

Mistral-Large-Instruct-2407

mistralai

78.07

eq-score

Polish EQ-Bench — eq-score

101 results · 1 SOTA advances · higher is better

All results
SOTA frontier
0102030405060708020262027eq-scoreMistral-Large-Instruct-2407

Model Size vs Score — Pareto Frontier

54 models · log scale · Pareto frontier shown

Global
Bielik
PLLuM
Pareto
18.018.519.019.520.020.521.021.522.022.523.023.524.024.525.025.526.026.527.027.528.028.529.029.530.030.531.031.532.032.533.033.534.034.535.035.536.036.537.037.538.038.539.039.540.040.541.041.542.042.543.043.544.044.545.045.546.046.547.047.548.048.549.049.550.050.551.051.552.052.553.053.554.054.555.055.556.056.557.057.558.058.559.059.560.060.561.061.562.062.563.063.564.064.565.065.566.066.567.067.568.068.569.069.570.070.571.071.572.072.573.073.574.074.575.075.576.076.577.077.578.01B2B3B7B11B14B24B32B70B120B235B700BParameters (log scale)eq-scoreBielik-11B-v2.6Llama-PLLuM-70B-chatBielik-11B-v2.5Bielik-11B-v3.0Bielik-11B-v2.3Llama-PLLuM-70B-instructBielik-11B-v2.2Bielik-11B-v2.0Bielik-11B-v2.1pllum-12b-nc-chat-250715Bielik-4.5B-v3.0PLLuM-12B-chatPLLuM-8x7B-nc-chatLlama-PLLuM-8B-chatPLLuM-8x7B-chatPLLuM-8x7B-nc-instructPLLuM-8x7B-instructPLLuM-12B-instructBielik-7B-Instruct-v0.1

Top Models Performance Comparison

Top 10 models ranked by eq-score

eq-score1Mistral-Large-Instruct-240778.1100.0%2Mistral-Large-Instruct-241177.399.0%3Meta-Llama-3.1-405B-Instr...77.298.9%4GPT-4o-2024-08-0675.296.3%5gpt-4-turbo-2024-04-0974.695.5%6Bielik-11B-v2.6-Instruct73.794.4%7🚧DeepSeek-V3-032473.594.1%8Mistral-Small-Instruct-240972.893.3%9Llama-PLLuM-70B-chat72.692.9%10Meta-Llama-3.1-70B-Instruct72.592.9%0%25%50%75%100%% of best
Best Score
78.1
Top Model
Mistral-Large-Ins...
Models Compared
10
Score Range
5.5

eq-scorePrimary

#ModelScorePaper / CodeDate
1
Mistral-Large-Instruct-2407Open Source
mistralai
78.07Apr 2026
2
Mistral-Large-Instruct-2411Open Source
mistralai
77.29Apr 2026
3
Meta-Llama-3.1-405B-Instruct-FP8Open Source
meta-llama
77.23Apr 2026
4
GPT-4o-2024-08-06Open Source
OpenAI
75.15Apr 2026
5
gpt-4-turbo-2024-04-09Open Source
74.586433Apr 2026
6
Bielik-11B-v2.6-InstructOpen Source
speakleash
73.696491Apr 2026
7
🚧DeepSeek-V3-0324Open Source
deepseek-ai
73.46Apr 2026
8
Mistral-Small-Instruct-2409Open Source
mistralai
72.85Apr 2026
9
Llama-PLLuM-70B-chatOpen Source
CYFRAGOVPL
72.563158Apr 2026
10
Meta-Llama-3.1-70B-InstructOpen Source
meta-llama
72.53Apr 2026
11
Bielik-11B-v2.5-InstructOpen Source
speakleash
71.996491Apr 2026
12
Qwen2-72B-InstructOpen Source
Qwen
71.227076Apr 2026
13
Meta-Llama-3-70B-InstructOpen Source
meta-llama
71.21Apr 2026
14
Bielik-11B-v3.0-InstructOpen Source
speakleash
71.2Apr 2026
15
GPT-4o-mini-2024-07-18Open Source
OpenAI
71.15Apr 2026
16
Qwen2.5-32B-InstructOpen Source
Qwen
71.15Apr 2026
17
Bielik-11B-v2.3-InstructOpen Source
speakleash
70.86Apr 2026
18
Llama-3.3-70B-InstructOpen Source
meta-llama
70.729591Apr 2026
19
Mistral-Small-24B-Instruct-2501Open Source
mistralai
70.52Apr 2026
20
Llama-PLLuM-70B-instructOpen Source
CYFRAGOVPL
69.99Apr 2026
21
WizardLM-2-8x22BOpen Source
alpindale
69.56Apr 2026
22
Qwen2.5-14B-InstructOpen Source
Qwen
69.173099Apr 2026
23
Bielik-11B-v2.2-InstructOpen Source
speakleash
69.05Apr 2026
24
Qwen2-72BOpen Source
Qwen
68.934211Apr 2026
25
Qwen2.5-72B-InstructOpen Source
Qwen
68.487135Apr 2026
26
Bielik-11B-v2.0-InstructOpen Source
speakleash
68.24Apr 2026
27
Qwen1.5-72B-ChatOpen Source
Qwen
68.03Apr 2026
28
Mixtral-8x22B-Instruct-v0.1Open Source
mistralai
67.63Apr 2026
29
glm-4-9b-chatOpen Source
THUDM
61.79Apr 2026
30
Mistral-Nemo-Instruct-2407Open Source
mistralai
61.76Apr 2026
31
Bielik-11B-v2.1-InstructOpen Source
speakleash
60.069298Apr 2026
32
Qwen1.5-32B-ChatOpen Source
Qwen
59.625263Apr 2026
33
openchat-3.5-0106-gemmaOpen Source
openchat
59.579532Apr 2026
34
phi-4Open Source
microsoft
59.099942Apr 2026
35
Qwen2.5-7B-InstructOpen Source
Qwen
58.58Apr 2026
36
aya-23-35BOpen Source
CohereForAI
58.41Apr 2026
37
GPT-3.5-turboOpen Source
OpenAI
57.7Apr 2026
38
Qwen2-57B-A14B-InstructOpen Source
Qwen
57.64Apr 2026
39
Mixtral-8x7B-Instruct-v0.1Open Source
mistralai
57.611228Apr 2026
40
c4ai-command-r-v01Open Source
CohereForAI
56.43Apr 2026
41
Phi-3-medium-4k-instructOpen Source
microsoft
56.402515Apr 2026
42
SOLAR-10.7B-Instruct-v1.0Open Source
upstage
55.213333Apr 2026
43
pllum-12b-nc-chat-250715Open Source
CYFRAGOVPL
55.165263Apr 2026
44
Hermes-2-Theta-Llama-3-8BOpen Source
NousResearch
54.88Apr 2026
45
NeuralDaredevil-8B-abliteratedOpen Source
mlabonne
54.74Apr 2026
46
Hermes-2-Pro-Llama-3-8BOpen Source
NousResearch
54.57Apr 2026
47
EuroLLM-9B-InstructOpen Source
utter-project
54.109649Apr 2026
48
Qwen1.5-32BOpen Source
Qwen
54.032164Apr 2026
49
Qwen2-7B-InstructOpen Source
Qwen
53.74Apr 2026
50
Bielik-4.5B-v3.0-InstructOpen Source
speakleash
53.580292Apr 2026
51
recurrentgemma-9b-itOpen Source
google
52.82Apr 2026
52
PLLuM-12B-chatOpen Source
CYFRAGOVPL
52.264561Apr 2026
53
Qwen1.5-72BOpen Source
Qwen
51.435556Apr 2026
54
Phi-4-mini-instructOpen Source
microsoft
50.522807Apr 2026
55
Starling-LM-7B-alphaOpen Source
berkeley-nest
49.63Apr 2026
56
Nous-Hermes-2-SOLAR-10.7BOpen Source
NousResearch
49.266959Apr 2026
57
openchat-3.5-1210Open Source
openchat
49.04Apr 2026
58
Delexa-7bOpen Source
lex-hue
48.45655Apr 2026
59
Qwen1.5-14B-ChatOpen Source
Qwen
47.962573Apr 2026
60
PLLuM-8x7B-nc-chatOpen Source
CYFRAGOVPL
47.29Apr 2026
61
Mistral-7B-Instruct-v0.2Open Source
mistralai
47.02193Apr 2026
62
Meta-Llama-3-8B-InstructOpen Source
meta-llama
46.53Apr 2026
63
Yi-1.5-9B-ChatOpen Source
01-ai
46.497895Apr 2026
64
Yi-1.5-34B-ChatOpen Source
01-ai
46.32Apr 2026
65
Llama-PLLuM-8B-chatOpen Source
CYFRAGOVPL
46.200877Apr 2026
66
Llama-3.2-3B-InstructOpen Source
meta-llama
46.188304Apr 2026
67
aya-23-8BOpen Source
CohereForAI
45.43Apr 2026
68
openchat-3.5-0106Open Source
openchat
45.422807Apr 2026
69
NVIDIA-Nemotron-3-Nano-30B-A3B-BF16Open Source
nvidia
45.284094Apr 2026
70
PLLuM-8x7B-chatOpen Source
CYFRAGOVPL
45.22Apr 2026
71
Mistral-7B-Instruct-v0.3Open Source
mistralai
45.21Apr 2026
72
Kruk-7B-SP-001Open Source
Remek
44.44Apr 2026
73
Starling-LM-7B-betaOpen Source
Nexusflow
43.781287Apr 2026
74
OpenChat3.5-0106-Spichlerz-BocianOpen Source
Remek
42.839649Apr 2026
75
falcon-11BOpen Source
tiiuae
42.41Apr 2026
76
PLLuM-8x7B-nc-instructOpen Source
CYFRAGOVPL
41.75Apr 2026
77
OpenChat3.5-0106-Spichlerz-Inst-001Open Source
Remek
41.6Apr 2026
78
internlm2-chat-7b-sftOpen Source
internlm
41.376608Apr 2026
79
PLLuM-8x7B-instructOpen Source
CYFRAGOVPL
39.55Apr 2026
80
internlm2-chat-7bOpen Source
internlm
39.532164Apr 2026
81
Llama3-ChatQA-1.5-8BOpen Source
nvidia
39.364327Apr 2026
82
Meta-Llama-3-70BOpen Source
meta-llama
39.090643Apr 2026
83
OpenHermes-2.5-Mistral-7BOpen Source
teknium
37.48Apr 2026
84
internlm2-chat-20bOpen Source
internlm
36.306433Apr 2026
85
PLLuM-12B-instructOpen Source
CYFRAGOVPL
36.212515Apr 2026
86
Qwen2.5-3B-InstructOpen Source
Qwen
35.869006Apr 2026
87
Qwen2-7BOpen Source
Qwen
35.510409Apr 2026
88
OpenHermes-13BOpen Source
teknium
34.910526Apr 2026
89
Bielik-SOLAR-LIKE-10.7B-Instruct-v0.1Open Source
TeeZee
34.171462Apr 2026
90
Bielik-7B-Instruct-v0.1Open Source
speakleash
31.26386Apr 2026
91
Qwen2.5-1.5B-InstructOpen Source
Qwen
27.627485Apr 2026
92
Llama-3-8B-Omnibus-1-PL-v01-INSTRUCTOpen Source
Remek
26.63Apr 2026
93
Phi-3-mini-4k-instructOpen Source
microsoft
26.081579Apr 2026
94
trurl-2-13b-academicOpen Source
Voicelab
24.555789Apr 2026
95
Qwen1.5-7B-ChatOpen Source
Qwen
23.976608Apr 2026
96
Qwen1.5-7BOpen Source
Qwen
20.947661Apr 2026
97
Llama-3.2-1B-InstructOpen Source
meta-llama
17.820585Apr 2026
98
gemma-1.1-2b-itOpen Source
google
16.47Apr 2026
99
Qwen2-1.5B-InstructOpen Source
Qwen
14.792105Apr 2026
100
internlm2-chat-1_8bOpen Source
internlm
12.131579Apr 2026
101
Yi-1.5-6B-ChatOpen Source
01-ai
4.886491Apr 2026
Polish EQ-Bench Benchmark - Polish Emotional Intelligence | CodeSOTA