Codesota · Models · b11p347yth03847tyhy03847yt10 results · 1 benchmarks
Model card

b11p.

347yth03847tyhy03847ytopen-source
§ 02 · Benchmarks

Every benchmark b11p has a recorded score for.

#BenchmarkArea · TaskMetricValueRankDateSource
01Open PL LLM LeaderboardNatural Language Processing · Polish LLM Generalpolqa-open-book92.8%#2/489source ↗
02Open PL LLM LeaderboardNatural Language Processing · Polish LLM Generalcbd35.6%#48/490source ↗
03Open PL LLM LeaderboardNatural Language Processing · Polish LLM Generalpolemo2-in84.8%#50/490source ↗
04Open PL LLM LeaderboardNatural Language Processing · Polish LLM Generalaverage60.2%#73/491source ↗
05Open PL LLM LeaderboardNatural Language Processing · Polish LLM Generalppc75.5%#103/490source ↗
06Open PL LLM LeaderboardNatural Language Processing · Polish LLM Generalpoquad-open-book64.9%#109/337source ↗
07Open PL LLM LeaderboardNatural Language Processing · Polish LLM Generalbelebele83.3%#117/490source ↗
08Open PL LLM LeaderboardNatural Language Processing · Polish LLM Generaldyk59.0%#119/489source ↗
09Open PL LLM LeaderboardNatural Language Processing · Polish LLM Generalklej-ner-generative48.8%#125/489source ↗
10Open PL LLM LeaderboardNatural Language Processing · Polish LLM Generalklej-ner-mc46.0%#161/490source ↗
Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.
§ 03 · Strengths by area

Where b11p actually performs.

Natural Language Processing
1
benchmark
avg rank #90.7
§ 05 · Related models

Other 347yth03847tyhy03847yt models scored on Codesota.

b11t2
0 results
§ 06 · Sources & freshness

Where these numbers come from.

speakleash/open_pl_llm_leaderboard
10
results
10 of 10 rows marked verified.