Codesota · Models · Meta-Llama-3.1-405B-Instruct-FP8meta-llama9 results · 2 benchmarks
Model card

Meta-Llama-3.1-405B-Instruct-FP8.

meta-llamaopen-source
§ 01 · Benchmarks

Every benchmark Meta-Llama-3.1-405B-Instruct-FP8 has a recorded score for.

#BenchmarkArea · TaskMetricValueRankDateSource
01Open PL LLM LeaderboardNatural Language Processing · Polish LLM Generalbelebele93.4%#1/490source ↗
02Open PL LLM LeaderboardNatural Language Processing · Polish LLM Generalaverage69.4%#2/491source ↗
03Open PL LLM LeaderboardNatural Language Processing · Polish LLM Generalppc82.5%#2/490source ↗
04Open PL LLM LeaderboardNatural Language Processing · Polish LLM Generalpolemo2-in89.1%#2/490source ↗
05Open PL LLM LeaderboardNatural Language Processing · Polish LLM Generaleq-bench64.0%#3/299source ↗
06Polish EQ-BenchNatural Language Processing · Polish Emotional Intelligenceeq-score77.2%#3/101source ↗
07Open PL LLM LeaderboardNatural Language Processing · Polish LLM Generaldyk73.7%#5/489source ↗
08Open PL LLM LeaderboardNatural Language Processing · Polish LLM Generalcbd42.7%#8/490source ↗
09Open PL LLM LeaderboardNatural Language Processing · Polish LLM Generalpolqa-open-book91.0%#36/489source ↗
Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.
§ 02 · Strengths by area

Where Meta-Llama-3.1-405B-Instruct-FP8 actually performs.

Natural Language Processing
2
benchmarks
avg rank #6.9
§ 04 · Related models

Other meta-llama models scored on Codesota.

Llama-3.3-70B-Instruct
70.6B params · 1 result
Llama-2-7b-chat-hf
0 results
Llama-2-7b-hf
0 results
Llama-3.2-1B
0 results
Llama-3.2-1B-Instruct
1.24B params · 0 results
Llama-3.2-3B
0 results
Llama-3.2-3B-Instruct
3.21B params · 0 results
Llama-4-Scout-17B-16E
0 results
§ 05 · Sources & freshness

Where these numbers come from.

speakleash/open_pl_llm_leaderboard
8
results
SpeakLeash/Polish-EQ-Bench
1
result
9 of 9 rows marked verified.