Codesota · Models · Gemma 3 (27B, IT)Google9 results · 1 benchmarks
Model card

Gemma 3 (27B, IT).

Googleopen-source
§ 02 · Benchmarks

Every benchmark Gemma 3 (27B, IT) has a recorded score for.

#BenchmarkArea · TaskMetricValueRankDateSource
01Polish MT-BenchNatural Language Processing · Polish Conversation Qualitywriting9.7%#1/50source ↗
02Polish MT-BenchNatural Language Processing · Polish Conversation Qualityextraction9.9%#1/50source ↗
03Polish MT-BenchNatural Language Processing · Polish Conversation Qualityhumanities10.0%#1/50source ↗
04Polish MT-BenchNatural Language Processing · Polish Conversation Qualitymath8.3%#1/50source ↗
05Polish MT-BenchNatural Language Processing · Polish Conversation Qualitypl-score9.3%#1/50source ↗
06Polish MT-BenchNatural Language Processing · Polish Conversation Qualityroleplay9.9%#1/50source ↗
07Polish MT-BenchNatural Language Processing · Polish Conversation Qualitycoding8.1%#3/50source ↗
08Polish MT-BenchNatural Language Processing · Polish Conversation Qualitystem9.9%#3/50source ↗
09Polish MT-BenchNatural Language Processing · Polish Conversation Qualityreasoning8.4%#7/50source ↗
Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.
§ 03 · Strengths by area

Where Gemma 3 (27B, IT) actually performs.

Natural Language Processing
1
benchmark
avg rank #2.1
§ 05 · Related models

Other Google models scored on Codesota.

Gemini 2.5 Pro
16 results · 2 SOTA
Gemini-3.1-Pro
7 results · 2 SOTA
Gemini 1.5 Pro
15 results · 1 SOTA
Gemini 3 Pro
Undisclosed params · 12 results · 1 SOTA
ViT-H/14
632M params · 2 results · 1 SOTA
CoCa (finetuned)
2.1B params · 1 result · 1 SOTA
Gemini 2.0 Flash
1 result · 1 SOTA
Noise2Music
Unknown params · 1 result · 1 SOTA
§ 06 · Sources & freshness

Where these numbers come from.

SpeakLeash/MT-Bench-PL
9
results
9 of 9 rows marked verified.