Codesota · Models · Qwen2.5-72B-InstructAlibaba4 results · 4 benchmarks
Model card

Qwen2.5-72B-Instruct.

Alibabaopen-source72B paramsDense Transformer

Qwen2.5-72B-Instruct. Released September 2024. Strong open-source model. Instruct-tuned version of the Qwen2.5-72B base. Top open-source model on many reasoning benchmarks at release.

§ 02 · Benchmarks

Every benchmark Qwen2.5-72B-Instruct has a recorded score for.

#BenchmarkArea · TaskMetricValueRankDateSource
01GSM8KReasoning · Mathematical Reasoningaccuracy95.8%#21/48source ↗
02MATHReasoning · Mathematical Reasoningaccuracy83.1%#25/46source ↗
03MMLUReasoning · Commonsense Reasoningaccuracy86.1%#40/64source ↗
04GPQA DiamondReasoning · Multi-step Reasoningaccuracy49.0%#67/74source ↗
Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.
§ 03 · Strengths by area

Where Qwen2.5-72B-Instruct actually performs.

Reasoning
4
benchmarks
avg rank #38.3
§ 05 · Related models

Other Alibaba models scored on Codesota.

Qwen3-235B-A22B
235B (22B active) params · 9 results · 1 SOTA
Qwen2-VL 72B
9 results
Qwen3.5-397B-A17B
8 results
Qwen3.5-122B-A10B
6 results
Qwen3.5-27B
6 results
Qwen3.5-35B-A3B
6 results
Qwen2-VL 7B
7B params · 5 results
Qwen2.5-Coder 32B
32B params · 4 results
§ 06 · Sources & freshness

Where these numbers come from.

qwen25-tech-report
4
results
4 of 4 rows marked verified.