Recent studyBlind TTS Elo is live. Compare two anonymous voice samples, vote after listening, and help separate real preference signal from noise.Vote in the study ->
Codesota · Natural Language Processing · Polish Emotional Intelligence · Polish EQ-BenchTasks/Natural Language Processing/Polish Emotional Intelligence
Polish Emotional Intelligence · benchmark dataset · 2025 · PL

Polish Emotional Intelligence Benchmark (EQ-Bench v2 PL).

Evaluates LLMs on emotional intelligence in Polish. Based on EQ-Bench v2 methodology adapted for Polish language. Models predict emotional intensity changes across 171 questions. Score adjusted for parseability: Benchmark Score × (Parseable / 171). Created by SpeakLeash.

Paper Download datasetSubmit a result
§ 01 · Leaderboard

Best published scores.

101 results indexed across 1 metric. Shaded row marks current SOTA; ties broken by submission date.


Primary
eq-score · higher is better
eq-score· primary
101 rows
#ModelOrgSubmittedPaper / codeeq-score
01Mistral-Large-Instruct-2407OSSmistralaiApr 2026SpeakLeash/Polish-EQ-Bench78.07
02Mistral-Large-Instruct-2411OSSmistralaiApr 2026SpeakLeash/Polish-EQ-Bench77.29
03Meta-Llama-3.1-405B-Instruct-FP8OSSmeta-llamaApr 2026SpeakLeash/Polish-EQ-Bench77.23
04GPT-4o-2024-08-06OSSOpenAIApr 2026SpeakLeash/Polish-EQ-Bench75.15
05gpt-4-turbo-2024-04-09OSSApr 2026SpeakLeash/Polish-EQ-Bench74.59
06Bielik-11B-v2.6-InstructOSSspeakleashApr 2026SpeakLeash/Polish-EQ-Bench73.70
07🚧DeepSeek-V3-0324OSSdeepseek-aiApr 2026SpeakLeash/Polish-EQ-Bench73.46
08Mistral-Small-Instruct-2409OSSmistralaiApr 2026SpeakLeash/Polish-EQ-Bench72.85
09Llama-PLLuM-70B-chatOSSCYFRAGOVPLApr 2026SpeakLeash/Polish-EQ-Bench72.56
10Meta-Llama-3.1-70B-InstructOSSmeta-llamaApr 2026SpeakLeash/Polish-EQ-Bench72.53
11Bielik-11B-v2.5-InstructOSSspeakleashApr 2026SpeakLeash/Polish-EQ-Bench72.00
12Qwen2-72B-InstructOSSQwenApr 2026SpeakLeash/Polish-EQ-Bench71.23
13Meta-Llama-3-70B-InstructOSSmeta-llamaApr 2026SpeakLeash/Polish-EQ-Bench71.21
14Bielik-11B-v3.0-InstructOSSspeakleashApr 2026SpeakLeash/Polish-EQ-Bench71.20
15GPT-4o-mini-2024-07-18OSSOpenAIApr 2026SpeakLeash/Polish-EQ-Bench71.15
16Qwen2.5-32B-InstructOSSQwenApr 2026SpeakLeash/Polish-EQ-Bench71.15
17Bielik-11B-v2.3-InstructOSSspeakleashApr 2026SpeakLeash/Polish-EQ-Bench70.86
18Llama-3.3-70B-InstructOSSmeta-llamaApr 2026SpeakLeash/Polish-EQ-Bench70.73
19Mistral-Small-24B-Instruct-2501OSSmistralaiApr 2026SpeakLeash/Polish-EQ-Bench70.52
20Llama-PLLuM-70B-instructOSSCYFRAGOVPLApr 2026SpeakLeash/Polish-EQ-Bench69.99
21WizardLM-2-8x22BOSSalpindaleApr 2026SpeakLeash/Polish-EQ-Bench69.56
22Qwen2.5-14B-InstructOSSQwenApr 2026SpeakLeash/Polish-EQ-Bench69.17
23Bielik-11B-v2.2-InstructOSSspeakleashApr 2026SpeakLeash/Polish-EQ-Bench69.05
24Qwen2-72BOSSQwenApr 2026SpeakLeash/Polish-EQ-Bench68.93
25Qwen2.5-72B-InstructOSSQwenApr 2026SpeakLeash/Polish-EQ-Bench68.49
26Bielik-11B-v2.0-InstructOSSspeakleashApr 2026SpeakLeash/Polish-EQ-Bench68.24
27Qwen1.5-72B-ChatOSSQwenApr 2026SpeakLeash/Polish-EQ-Bench68.03
28Mixtral-8x22B-Instruct-v0.1OSSmistralaiApr 2026SpeakLeash/Polish-EQ-Bench67.63
29glm-4-9b-chatOSSTHUDMApr 2026SpeakLeash/Polish-EQ-Bench61.79
30Mistral-Nemo-Instruct-2407OSSmistralaiApr 2026SpeakLeash/Polish-EQ-Bench61.76
31Bielik-11B-v2.1-InstructOSSspeakleashApr 2026SpeakLeash/Polish-EQ-Bench60.07
32Qwen1.5-32B-ChatOSSQwenApr 2026SpeakLeash/Polish-EQ-Bench59.63
33openchat-3.5-0106-gemmaOSSopenchatApr 2026SpeakLeash/Polish-EQ-Bench59.58
34phi-4OSSmicrosoftApr 2026SpeakLeash/Polish-EQ-Bench59.10
35Qwen2.5-7B-InstructOSSQwenApr 2026SpeakLeash/Polish-EQ-Bench58.58
36aya-23-35BOSSCohereForAIApr 2026SpeakLeash/Polish-EQ-Bench58.41
37GPT-3.5-turboOSSOpenAIApr 2026SpeakLeash/Polish-EQ-Bench57.70
38Qwen2-57B-A14B-InstructOSSQwenApr 2026SpeakLeash/Polish-EQ-Bench57.64
39Mixtral-8x7B-Instruct-v0.1OSSmistralaiApr 2026SpeakLeash/Polish-EQ-Bench57.61
40c4ai-command-r-v01OSSCohereForAIApr 2026SpeakLeash/Polish-EQ-Bench56.43
41Phi-3-medium-4k-instructOSSmicrosoftApr 2026SpeakLeash/Polish-EQ-Bench56.40
42SOLAR-10.7B-Instruct-v1.0OSSupstageApr 2026SpeakLeash/Polish-EQ-Bench55.21
43pllum-12b-nc-chat-250715OSSCYFRAGOVPLApr 2026SpeakLeash/Polish-EQ-Bench55.17
44Hermes-2-Theta-Llama-3-8BOSSNousResearchApr 2026SpeakLeash/Polish-EQ-Bench54.88
45NeuralDaredevil-8B-abliteratedOSSmlabonneApr 2026SpeakLeash/Polish-EQ-Bench54.74
46Hermes-2-Pro-Llama-3-8BOSSNousResearchApr 2026SpeakLeash/Polish-EQ-Bench54.57
47EuroLLM-9B-InstructOSSutter-projectApr 2026SpeakLeash/Polish-EQ-Bench54.11
48Qwen1.5-32BOSSQwenApr 2026SpeakLeash/Polish-EQ-Bench54.03
49Qwen2-7B-InstructOSSQwenApr 2026SpeakLeash/Polish-EQ-Bench53.74
50Bielik-4.5B-v3.0-InstructOSSspeakleashApr 2026SpeakLeash/Polish-EQ-Bench53.58
51recurrentgemma-9b-itOSSgoogleApr 2026SpeakLeash/Polish-EQ-Bench52.82
52PLLuM-12B-chatOSSCYFRAGOVPLApr 2026SpeakLeash/Polish-EQ-Bench52.26
53Qwen1.5-72BOSSQwenApr 2026SpeakLeash/Polish-EQ-Bench51.44
54Phi-4-mini-instructOSSmicrosoftApr 2026SpeakLeash/Polish-EQ-Bench50.52
55Starling-LM-7B-alphaOSSberkeley-nestApr 2026SpeakLeash/Polish-EQ-Bench49.63
56Nous-Hermes-2-SOLAR-10.7BOSSNousResearchApr 2026SpeakLeash/Polish-EQ-Bench49.27
57openchat-3.5-1210OSSopenchatApr 2026SpeakLeash/Polish-EQ-Bench49.04
58Delexa-7bOSSlex-hueApr 2026SpeakLeash/Polish-EQ-Bench48.46
59Qwen1.5-14B-ChatOSSQwenApr 2026SpeakLeash/Polish-EQ-Bench47.96
60PLLuM-8x7B-nc-chatOSSCYFRAGOVPLApr 2026SpeakLeash/Polish-EQ-Bench47.29
61Mistral-7B-Instruct-v0.2OSSmistralaiApr 2026SpeakLeash/Polish-EQ-Bench47.02
62Meta-Llama-3-8B-InstructOSSmeta-llamaApr 2026SpeakLeash/Polish-EQ-Bench46.53
63Yi-1.5-9B-ChatOSS01-aiApr 2026SpeakLeash/Polish-EQ-Bench46.50
64Yi-1.5-34B-ChatOSS01-aiApr 2026SpeakLeash/Polish-EQ-Bench46.32
65Llama-PLLuM-8B-chatOSSCYFRAGOVPLApr 2026SpeakLeash/Polish-EQ-Bench46.20
66Llama-3.2-3B-InstructOSSmeta-llamaApr 2026SpeakLeash/Polish-EQ-Bench46.19
67aya-23-8BOSSCohereForAIApr 2026SpeakLeash/Polish-EQ-Bench45.43
68openchat-3.5-0106OSSopenchatApr 2026SpeakLeash/Polish-EQ-Bench45.42
69NVIDIA-Nemotron-3-Nano-30B-A3B-BF16OSSnvidiaApr 2026SpeakLeash/Polish-EQ-Bench45.28
70PLLuM-8x7B-chatOSSCYFRAGOVPLApr 2026SpeakLeash/Polish-EQ-Bench45.22
71Mistral-7B-Instruct-v0.3OSSmistralaiApr 2026SpeakLeash/Polish-EQ-Bench45.21
72Kruk-7B-SP-001OSSRemekApr 2026SpeakLeash/Polish-EQ-Bench44.44
73Starling-LM-7B-betaOSSNexusflowApr 2026SpeakLeash/Polish-EQ-Bench43.78
74OpenChat3.5-0106-Spichlerz-BocianOSSRemekApr 2026SpeakLeash/Polish-EQ-Bench42.84
75falcon-11BOSStiiuaeApr 2026SpeakLeash/Polish-EQ-Bench42.41
76PLLuM-8x7B-nc-instructOSSCYFRAGOVPLApr 2026SpeakLeash/Polish-EQ-Bench41.75
77OpenChat3.5-0106-Spichlerz-Inst-001OSSRemekApr 2026SpeakLeash/Polish-EQ-Bench41.60
78internlm2-chat-7b-sftOSSinternlmApr 2026SpeakLeash/Polish-EQ-Bench41.38
79PLLuM-8x7B-instructOSSCYFRAGOVPLApr 2026SpeakLeash/Polish-EQ-Bench39.55
80internlm2-chat-7bOSSinternlmApr 2026SpeakLeash/Polish-EQ-Bench39.53
81Llama3-ChatQA-1.5-8BOSSnvidiaApr 2026SpeakLeash/Polish-EQ-Bench39.36
82Meta-Llama-3-70BOSSmeta-llamaApr 2026SpeakLeash/Polish-EQ-Bench39.09
83OpenHermes-2.5-Mistral-7BOSStekniumApr 2026SpeakLeash/Polish-EQ-Bench37.48
84internlm2-chat-20bOSSinternlmApr 2026SpeakLeash/Polish-EQ-Bench36.31
85PLLuM-12B-instructOSSCYFRAGOVPLApr 2026SpeakLeash/Polish-EQ-Bench36.21
86Qwen2.5-3B-InstructOSSQwenApr 2026SpeakLeash/Polish-EQ-Bench35.87
87Qwen2-7BOSSQwenApr 2026SpeakLeash/Polish-EQ-Bench35.51
88OpenHermes-13BOSStekniumApr 2026SpeakLeash/Polish-EQ-Bench34.91
89Bielik-SOLAR-LIKE-10.7B-Instruct-v0.1OSSTeeZeeApr 2026SpeakLeash/Polish-EQ-Bench34.17
90Bielik-7B-Instruct-v0.1OSSspeakleashApr 2026SpeakLeash/Polish-EQ-Bench31.26
91Qwen2.5-1.5B-InstructOSSQwenApr 2026SpeakLeash/Polish-EQ-Bench27.63
92Llama-3-8B-Omnibus-1-PL-v01-INSTRUCTOSSRemekApr 2026SpeakLeash/Polish-EQ-Bench26.63
93Phi-3-mini-4k-instructOSSmicrosoftApr 2026SpeakLeash/Polish-EQ-Bench26.08
94trurl-2-13b-academicOSSVoicelabApr 2026SpeakLeash/Polish-EQ-Bench24.56
95Qwen1.5-7B-ChatOSSQwenApr 2026SpeakLeash/Polish-EQ-Bench23.98
96Qwen1.5-7BOSSQwenApr 2026SpeakLeash/Polish-EQ-Bench20.95
97Llama-3.2-1B-InstructOSSmeta-llamaApr 2026SpeakLeash/Polish-EQ-Bench17.82
98gemma-1.1-2b-itOSSgoogleApr 2026SpeakLeash/Polish-EQ-Bench16.47
99Qwen2-1.5B-InstructOSSQwenApr 2026SpeakLeash/Polish-EQ-Bench14.79
100internlm2-chat-1_8bOSSinternlmApr 2026SpeakLeash/Polish-EQ-Bench12.13
101Yi-1.5-6B-ChatOSS01-aiApr 2026SpeakLeash/Polish-EQ-Bench4.89
Fig 2 · Rows sorted by score within each metric. Shaded row marks SOTA. Dates reflect model or paper release where available, otherwise the date Codesota accessed the source.
§ 03 · Progress

1 steps
of state of the art.

Each row below marks a model that broke the previous record on eq-score. Intermediate submissions are kept in the leaderboard above; only SOTA-setting entries are re-listed here.

Higher scores win. Each subsequent entry improved upon the previous best.

SOTA line · eq-score
  1. Apr 2, 2026Mistral-Large-Instruct-2407mistralai78.07
Fig 3 · SOTA-setting models only. 1 entries span Apr 2026 Apr 2026.
§ 06 · Contribute

Have a score that beats
this table?

Submit a checkpoint and a reproduction script. We will run it, publish the score, and — if it takes the top — annotate the step on the progress chart with your name.

Submit a result Read submission guide
What a submission needs
  • 01A public checkpoint or API endpoint
  • 02A reproduction script with frozen commit + seed
  • 03Declared evaluation environment (Python, deps)
  • 04One row per metric declared by this dataset
  • 05A contact so we can follow up on discrepancies