Who leads the Polish EQ-Bench benchmark?

mistralai/Mistral-Large-Instruct-2407 currently leads Polish EQ-Bench with a score of 78.07 on eq-score.

What is the state-of-the-art score on Polish EQ-Bench?

The state-of-the-art result on Polish EQ-Bench is 78.07 (eq-score), achieved by mistralai/Mistral-Large-Instruct-2407 as of 2026.

How many models are tracked on Polish EQ-Bench?

Codesota tracks 101 models on Polish EQ-Bench.

When was the Polish EQ-Bench leaderboard last updated?

The Polish EQ-Bench leaderboard on Codesota includes results through 2026.

Codesota · Natural Language Processing · Polish Emotional Intelligence · Polish EQ-BenchTasks/Natural Language Processing/Polish Emotional Intelligence

Polish Emotional Intelligence · benchmark dataset · 2025 · PL

Polish Emotional Intelligence Benchmark (EQ-Bench v2 PL).

Name: Polish Emotional Intelligence Benchmark (EQ-Bench v2 PL) Benchmark Results
Creator: Codesota
Published: 2026-01-01
License: https://creativecommons.org/licenses/by/4.0/

Evaluates LLMs on emotional intelligence in Polish. Based on EQ-Bench v2 methodology adapted for Polish language. Models predict emotional intensity changes across 171 questions. Score adjusted for parseability: Benchmark Score × (Parseable / 171). Created by SpeakLeash.

Paper ↗Download dataset Submit a result ↵

§ 01 · Leaderboard

Best published scores.

101 results indexed across 1 metric. Shaded row marks current SOTA; ties broken by submission date.

Primary: eq-score · higher is better

eq-score· primary

101 rows

#	Model	Org	Submitted	Paper / code	eq-score
01	mistralai/Mistral-Large-Instruct-2407Open	mistralai	Apr 2026	SpeakLeash/Polish-EQ-Bench	78.07
02	mistralai/Mistral-Large-Instruct-2411Open	mistralai	Apr 2026	SpeakLeash/Polish-EQ-Bench	77.29
03	Meta-Llama-3.1-405B-Instruct-FP8Open	meta-llama	Apr 2026	SpeakLeash/Polish-EQ-Bench	77.23
04	GPT-4o-2024-08-06Open	OpenAI	Apr 2026	SpeakLeash/Polish-EQ-Bench	75.15
05	gpt-4-turbo-2024-04-09Open	—	Apr 2026	SpeakLeash/Polish-EQ-Bench	74.59
06	speakleash/Bielik-11B-v2.6-InstructOpen	speakleash	Apr 2026	SpeakLeash/Polish-EQ-Bench	73.70
07	deepseek-ai/DeepSeek-V3-0324 (API)API	deepseek-ai	Apr 2026	SpeakLeash/Polish-EQ-Bench	73.46
08	Mistral-Small-Instruct-2409Open	mistralai	Apr 2026	SpeakLeash/Polish-EQ-Bench	72.85
09	CYFRAGOVPL/Llama-PLLuM-70B-chatOpen	CYFRAGOVPL	Apr 2026	SpeakLeash/Polish-EQ-Bench	72.56
10	meta-llama/Meta-Llama-3.1-70B-InstructOpen	meta-llama	Apr 2026	SpeakLeash/Polish-EQ-Bench	72.53
11	speakleash/Bielik-11B-v2.5-InstructOpen	speakleash	Apr 2026	SpeakLeash/Polish-EQ-Bench	72.00
12	Qwen/Qwen2-72B-InstructOpen	Qwen	Apr 2026	SpeakLeash/Polish-EQ-Bench	71.23
13	meta-llama/Meta-Llama-3-70B-InstructOpen	meta-llama	Apr 2026	SpeakLeash/Polish-EQ-Bench	71.21
14	speakleash/Bielik-11B-v3.0-InstructOpen	speakleash	Apr 2026	SpeakLeash/Polish-EQ-Bench	71.20
15	GPT-4o-mini-2024-07-18Open	OpenAI	Apr 2026	SpeakLeash/Polish-EQ-Bench	71.15
16	Qwen/Qwen2.5-32B-InstructOpen	Qwen	Apr 2026	SpeakLeash/Polish-EQ-Bench	71.15
17	speakleash/Bielik-11B-v2.3-InstructOpen	speakleash	Apr 2026	SpeakLeash/Polish-EQ-Bench	70.86
18	meta-llama/Llama-3.3-70B-InstructOpen	meta-llama	Apr 2026	SpeakLeash/Polish-EQ-Bench	70.73
19	mistralai/Mistral-Small-24B-Instruct-2501Open	mistralai	Apr 2026	SpeakLeash/Polish-EQ-Bench	70.52
20	CYFRAGOVPL/Llama-PLLuM-70B-instructOpen	CYFRAGOVPL	Apr 2026	SpeakLeash/Polish-EQ-Bench	69.99
21	alpindale/WizardLM-2-8x22B (API)API	alpindale	Apr 2026	SpeakLeash/Polish-EQ-Bench	69.56
22	Qwen/Qwen2.5-14B-InstructOpen	Qwen	Apr 2026	SpeakLeash/Polish-EQ-Bench	69.17
23	speakleash/Bielik-11B-v2.2-InstructOpen	speakleash	Apr 2026	SpeakLeash/Polish-EQ-Bench	69.05
24	Qwen2-72BOpen	Qwen	Apr 2026	SpeakLeash/Polish-EQ-Bench	68.93
25	Qwen/Qwen2.5-72B-InstructOpen	Qwen	Apr 2026	SpeakLeash/Polish-EQ-Bench	68.49
26	speakleash/Bielik-11B-v2.0-InstructOpen	speakleash	Apr 2026	SpeakLeash/Polish-EQ-Bench	68.24
27	Qwen/Qwen1.5-72B-ChatOpen	Qwen	Apr 2026	SpeakLeash/Polish-EQ-Bench	68.03
28	mistralai/Mixtral-8x22B-Instruct-v0.1 (API)API	mistralai	Apr 2026	SpeakLeash/Polish-EQ-Bench	67.63
29	THUDM/glm-4-9b-chatOpen	THUDM	Apr 2026	SpeakLeash/Polish-EQ-Bench	61.79
30	mistralai/Mistral-Nemo-Instruct-2407Open	mistralai	Apr 2026	SpeakLeash/Polish-EQ-Bench	61.76
31	speakleash/Bielik-11B-v2.1-InstructOpen	speakleash	Apr 2026	SpeakLeash/Polish-EQ-Bench	60.07
32	Qwen1.5-32B-ChatOpen	Qwen	Apr 2026	SpeakLeash/Polish-EQ-Bench	59.63
33	openchat/openchat-3.5-0106-gemmaOpen	openchat	Apr 2026	SpeakLeash/Polish-EQ-Bench	59.58
34	microsoft/phi-4Open	microsoft	Apr 2026	SpeakLeash/Polish-EQ-Bench	59.10
35	Qwen/Qwen2.5-7B-InstructOpen	Qwen	Apr 2026	SpeakLeash/Polish-EQ-Bench	58.58
36	aya-23-35BOpen	CohereForAI	Apr 2026	SpeakLeash/Polish-EQ-Bench	58.41
37	GPT-3.5-turboOpen	OpenAI	Apr 2026	SpeakLeash/Polish-EQ-Bench	57.70
38	Qwen2-57B-A14B-InstructOpen	Qwen	Apr 2026	SpeakLeash/Polish-EQ-Bench	57.64
39	mistralai/Mixtral-8x7B-Instruct-v0.1Open	mistralai	Apr 2026	SpeakLeash/Polish-EQ-Bench	57.61
40	c4ai-command-r-v01Open	CohereForAI	Apr 2026	SpeakLeash/Polish-EQ-Bench	56.43
41	Phi-3-medium-4k-instructOpen	microsoft	Apr 2026	SpeakLeash/Polish-EQ-Bench	56.40
42	upstage/SOLAR-10.7B-Instruct-v1.0Open	upstage	Apr 2026	SpeakLeash/Polish-EQ-Bench	55.21
43	CYFRAGOVPL/pllum-12b-nc-chat-250715Open	CYFRAGOVPL	Apr 2026	SpeakLeash/Polish-EQ-Bench	55.17
44	Hermes-2-Theta-Llama-3-8BOpen	NousResearch	Apr 2026	SpeakLeash/Polish-EQ-Bench	54.88
45	NeuralDaredevil-8B-abliteratedOpen	mlabonne	Apr 2026	SpeakLeash/Polish-EQ-Bench	54.74
46	Hermes-2-Pro-Llama-3-8BOpen	NousResearch	Apr 2026	SpeakLeash/Polish-EQ-Bench	54.57
47	utter-project/EuroLLM-9B-InstructOpen	utter-project	Apr 2026	SpeakLeash/Polish-EQ-Bench	54.11
48	Qwen1.5-32BOpen	Qwen	Apr 2026	SpeakLeash/Polish-EQ-Bench	54.03
49	Qwen2-7B-InstructOpen	Qwen	Apr 2026	SpeakLeash/Polish-EQ-Bench	53.74
50	speakleash/Bielik-4.5B-v3.0-InstructOpen	speakleash	Apr 2026	SpeakLeash/Polish-EQ-Bench	53.58
51	recurrentgemma-9b-itOpen	google	Apr 2026	SpeakLeash/Polish-EQ-Bench	52.82
52	CYFRAGOVPL/PLLuM-12B-chatOpen	CYFRAGOVPL	Apr 2026	SpeakLeash/Polish-EQ-Bench	52.26
53	Qwen1.5-72BOpen	Qwen	Apr 2026	SpeakLeash/Polish-EQ-Bench	51.44
54	microsoft/Phi-4-mini-instructOpen	microsoft	Apr 2026	SpeakLeash/Polish-EQ-Bench	50.52
55	berkeley-nest/Starling-LM-7B-alphaOpen	berkeley-nest	Apr 2026	SpeakLeash/Polish-EQ-Bench	49.63
56	Nous-Hermes-2-SOLAR-10.7BOpen	NousResearch	Apr 2026	SpeakLeash/Polish-EQ-Bench	49.27
57	openchat-3.5-1210Open	openchat	Apr 2026	SpeakLeash/Polish-EQ-Bench	49.04
58	Delexa-7bOpen	lex-hue	Apr 2026	SpeakLeash/Polish-EQ-Bench	48.46
59	Qwen1.5-14B-ChatOpen	Qwen	Apr 2026	SpeakLeash/Polish-EQ-Bench	47.96
60	CYFRAGOVPL/PLLuM-8x7B-nc-chatOpen	CYFRAGOVPL	Apr 2026	SpeakLeash/Polish-EQ-Bench	47.29
61	Mistral-7B-Instruct-v0.2Open	mistralai	Apr 2026	SpeakLeash/Polish-EQ-Bench	47.02
62	meta-llama/Meta-Llama-3-8B-InstructOpen	meta-llama	Apr 2026	SpeakLeash/Polish-EQ-Bench	46.53
63	Yi-1.5-9B-ChatOpen	01-ai	Apr 2026	SpeakLeash/Polish-EQ-Bench	46.50
64	01-ai/Yi-1.5-34B-ChatOpen	01-ai	Apr 2026	SpeakLeash/Polish-EQ-Bench	46.32
65	CYFRAGOVPL/Llama-PLLuM-8B-chatOpen	CYFRAGOVPL	Apr 2026	SpeakLeash/Polish-EQ-Bench	46.20
66	meta-llama/Llama-3.2-3B-InstructOpen	meta-llama	Apr 2026	SpeakLeash/Polish-EQ-Bench	46.19
67	aya-23-8BOpen	CohereForAI	Apr 2026	SpeakLeash/Polish-EQ-Bench	45.43
68	openchat/openchat-3.5-0106Open	openchat	Apr 2026	SpeakLeash/Polish-EQ-Bench	45.42
69	nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16Open	nvidia	Apr 2026	SpeakLeash/Polish-EQ-Bench	45.28
70	CYFRAGOVPL/PLLuM-8x7B-chatOpen	CYFRAGOVPL	Apr 2026	SpeakLeash/Polish-EQ-Bench	45.22
71	mistralai/Mistral-7B-Instruct-v0.3Open	mistralai	Apr 2026	SpeakLeash/Polish-EQ-Bench	45.21
72	Kruk-7B-SP-001Open	Remek	Apr 2026	SpeakLeash/Polish-EQ-Bench	44.44
73	Starling-LM-7B-betaOpen	Nexusflow	Apr 2026	SpeakLeash/Polish-EQ-Bench	43.78
74	OpenChat3.5-0106-Spichlerz-BocianOpen	Remek	Apr 2026	SpeakLeash/Polish-EQ-Bench	42.84
75	falcon-11BOpen	tiiuae	Apr 2026	SpeakLeash/Polish-EQ-Bench	42.41
76	CYFRAGOVPL/PLLuM-8x7B-nc-instructOpen	CYFRAGOVPL	Apr 2026	SpeakLeash/Polish-EQ-Bench	41.75
77	OpenChat3.5-0106-Spichlerz-Inst-001Open	Remek	Apr 2026	SpeakLeash/Polish-EQ-Bench	41.60
78	internlm2-chat-7b-sftOpen	internlm	Apr 2026	SpeakLeash/Polish-EQ-Bench	41.38
79	CYFRAGOVPL/PLLuM-8x7B-instructOpen	CYFRAGOVPL	Apr 2026	SpeakLeash/Polish-EQ-Bench	39.55
80	internlm2-chat-7bOpen	internlm	Apr 2026	SpeakLeash/Polish-EQ-Bench	39.53
81	Llama3-ChatQA-1.5-8BOpen	nvidia	Apr 2026	SpeakLeash/Polish-EQ-Bench	39.36
82	Meta-Llama-3-70BOpen	meta-llama	Apr 2026	SpeakLeash/Polish-EQ-Bench	39.09
83	OpenHermes-2.5-Mistral-7BOpen	teknium	Apr 2026	SpeakLeash/Polish-EQ-Bench	37.48
84	internlm/internlm2-chat-20bOpen	internlm	Apr 2026	SpeakLeash/Polish-EQ-Bench	36.31
85	CYFRAGOVPL/PLLuM-12B-instructOpen	CYFRAGOVPL	Apr 2026	SpeakLeash/Polish-EQ-Bench	36.21
86	Qwen/Qwen2.5-3B-InstructOpen	Qwen	Apr 2026	SpeakLeash/Polish-EQ-Bench	35.87
87	Qwen2-7BOpen	Qwen	Apr 2026	SpeakLeash/Polish-EQ-Bench	35.51
88	OpenHermes-13BOpen	teknium	Apr 2026	SpeakLeash/Polish-EQ-Bench	34.91
89	Bielik-SOLAR-LIKE-10.7B-Instruct-v0.1Open	TeeZee	Apr 2026	SpeakLeash/Polish-EQ-Bench	34.17
90	speakleash/Bielik-7B-Instruct-v0.1Open	speakleash	Apr 2026	SpeakLeash/Polish-EQ-Bench	31.26
91	Qwen/Qwen2.5-1.5B-InstructOpen	Qwen	Apr 2026	SpeakLeash/Polish-EQ-Bench	27.63
92	Llama-3-8B-Omnibus-1-PL-v01-INSTRUCTOpen	Remek	Apr 2026	SpeakLeash/Polish-EQ-Bench	26.63
93	Phi-3-mini-4k-instructOpen	microsoft	Apr 2026	SpeakLeash/Polish-EQ-Bench	26.08
94	Voicelab/trurl-2-13b-academicOpen	Voicelab	Apr 2026	SpeakLeash/Polish-EQ-Bench	24.56
95	Qwen1.5-7B-ChatOpen	Qwen	Apr 2026	SpeakLeash/Polish-EQ-Bench	23.98
96	Qwen1.5-7BOpen	Qwen	Apr 2026	SpeakLeash/Polish-EQ-Bench	20.95
97	meta-llama/Llama-3.2-1B-InstructOpen	meta-llama	Apr 2026	SpeakLeash/Polish-EQ-Bench	17.82
98	gemma-1.1-2b-itOpen	google	Apr 2026	SpeakLeash/Polish-EQ-Bench	16.47
99	Qwen2-1.5B-InstructOpen	Qwen	Apr 2026	SpeakLeash/Polish-EQ-Bench	14.79
100	internlm2-chat-1_8bOpen	internlm	Apr 2026	SpeakLeash/Polish-EQ-Bench	12.13
101	Yi-1.5-6B-ChatOpen	01-ai	Apr 2026	SpeakLeash/Polish-EQ-Bench	4.89

Fig 2 · Rows sorted by score within each metric. Shaded row marks SOTA. Dates reflect model or paper release where available, otherwise the date Codesota accessed the source.

§ 03 · Progress

1 steps
of state of the art.

Each row below marks a model that broke the previous record on eq-score. Intermediate submissions are kept in the leaderboard above; only SOTA-setting entries are re-listed here.

Higher scores win. Each subsequent entry improved upon the previous best.

SOTA line · eq-score

Apr 2, 2026mistralai/Mistral-Large-Instruct-2407mistralai78.07

Fig 3 · SOTA-setting models only. 1 entries span Apr 2026 → Apr 2026.

§ 06 · Contribute

Have a score that beats
this table?

Submit a checkpoint and a reproduction script. We will run it, publish the score, and — if it takes the top — annotate the step on the progress chart with your name.

Submit a result ↵Read submission guide

What a submission needs

01A public checkpoint or API endpoint
02A reproduction script with frozen commit + seed
03Declared evaluation environment (Python, deps)
04One row per metric declared by this dataset
05A contact so we can follow up on discrepancies

Polish Emotional Intelligence Benchmark (EQ-Bench v2 PL).

Best published scores.

1 stepsof state of the art.

Have a score that beatsthis table?

1 steps
of state of the art.

Have a score that beats
this table?