Codesota · Models · Starling-LM-7B-alphaberkeley-nest14 results · 3 benchmarks
Model card

Starling-LM-7B-alpha.

berkeley-nestopen-source7.24B params
§ 01 · Benchmarks

Every benchmark Starling-LM-7B-alpha has a recorded score for.

#BenchmarkArea · TaskMetricValueRankDateSource
01Polish EQ-BenchNatural Language Processing · Polish Emotional Intelligenceeq-score49.6%#55/101source ↗
02CPTU-BenchNatural Language Processing · Polish Text Understandingphraseology2.9%#66/93source ↗
03CPTU-BenchNatural Language Processing · Polish Text Understandinglanguage-understanding2.9%#73/93source ↗
04CPTU-BenchNatural Language Processing · Polish Text Understandingtricky-questions1.7%#74/93source ↗
05CPTU-BenchNatural Language Processing · Polish Text Understandingaverage2.6%#75/93source ↗
06CPTU-BenchNatural Language Processing · Polish Text Understandingsentiment3.1%#79/93source ↗
07Open PL LLM LeaderboardNatural Language Processing · Polish LLM Generaldyk61.1%#106/489source ↗
08Open PL LLM LeaderboardNatural Language Processing · Polish LLM Generaleq-bench36.7%#112/299source ↗
09Open PL LLM LeaderboardNatural Language Processing · Polish LLM Generalpolemo2-in79.0%#114/490source ↗
10Open PL LLM LeaderboardNatural Language Processing · Polish LLM Generalcbd31.1%#122/490source ↗
11Open PL LLM LeaderboardNatural Language Processing · Polish LLM Generalaverage50.0%#134/491source ↗
12Open PL LLM LeaderboardNatural Language Processing · Polish LLM Generalbelebele77.2%#161/490source ↗
13Open PL LLM LeaderboardNatural Language Processing · Polish LLM Generalppc60.9%#335/490source ↗
14Open PL LLM LeaderboardNatural Language Processing · Polish LLM Generalpolqa-open-book76.9%#336/489source ↗
Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.
§ 02 · Strengths by area

Where Starling-LM-7B-alpha actually performs.

Natural Language Processing
3
benchmarks
avg rank #131.6
§ 05 · Sources & freshness

Where these numbers come from.

speakleash/open_pl_llm_leaderboard
8
results
SpeakLeash/CPTU-Bench
5
results
SpeakLeash/Polish-EQ-Bench
1
result
14 of 14 rows marked verified.