Codesota · Models · Meta-Llama-3.1-8B-Instructmeta-llama13 results · 2 benchmarks
Model card

Meta-Llama-3.1-8B-Instruct.

meta-llamaopen-source8.03B params
§ 01 · Benchmarks

Every benchmark Meta-Llama-3.1-8B-Instruct has a recorded score for.

#BenchmarkArea · TaskMetricValueRankDateSource
01CPTU-BenchNatural Language Processing · Polish Text Understandingsentiment4.0%#28/93source ↗
02CPTU-BenchNatural Language Processing · Polish Text Understandinglanguage-understanding3.4%#56/93source ↗
03CPTU-BenchNatural Language Processing · Polish Text Understandingaverage3.0%#60/93source ↗
04CPTU-BenchNatural Language Processing · Polish Text Understandingtricky-questions2.1%#62/93source ↗
05CPTU-BenchNatural Language Processing · Polish Text Understandingphraseology2.6%#75/93source ↗
06Open PL LLM LeaderboardNatural Language Processing · Polish LLM Generalbelebele83.6%#114/490source ↗
07Open PL LLM LeaderboardNatural Language Processing · Polish LLM Generalaverage51.4%#124/491source ↗
08Open PL LLM LeaderboardNatural Language Processing · Polish LLM Generaldyk57.4%#127/489source ↗
09Open PL LLM LeaderboardNatural Language Processing · Polish LLM Generalpolemo2-in78.0%#129/490source ↗
10Open PL LLM LeaderboardNatural Language Processing · Polish LLM Generaleq-bench27.3%#157/299source ↗
11Open PL LLM LeaderboardNatural Language Processing · Polish LLM Generalpolqa-open-book84.2%#220/489source ↗
12Open PL LLM LeaderboardNatural Language Processing · Polish LLM Generalcbd25.7%#228/490source ↗
13Open PL LLM LeaderboardNatural Language Processing · Polish LLM Generalppc69.0%#239/490source ↗
Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.
§ 02 · Strengths by area

Where Meta-Llama-3.1-8B-Instruct actually performs.

Natural Language Processing
2
benchmarks
avg rank #124.5
§ 04 · Related models

Other meta-llama models scored on Codesota.

Llama-3.3-70B-Instruct
70.6B params · 1 result
Llama-2-7b-chat-hf
0 results
Llama-2-7b-hf
0 results
Llama-3.2-1B
0 results
Llama-3.2-1B-Instruct
1.24B params · 0 results
Llama-3.2-3B
0 results
Llama-3.2-3B-Instruct
3.21B params · 0 results
Llama-4-Scout-17B-16E
0 results
§ 05 · Sources & freshness

Where these numbers come from.

speakleash/open_pl_llm_leaderboard
8
results
SpeakLeash/CPTU-Bench
5
results
13 of 13 rows marked verified.