Qwen3-235B-A22B (Base).

QwenLarge language model

Added from Papers with Code MMLU-Pro refresh on 2026-05-19.

§ 02 · Benchmarks

Every benchmark Qwen3-235B-A22B (Base) has a recorded score for.

#	Benchmark	Area · Task	Metric	Value	Rank	Date	Source
01	MMLU-Pro	Reasoning · Commonsense Reasoning	accuracy	68.2%	#65/73	2025-05-14	source ↗

Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.

§ 03 · Strengths by area

Where Qwen3-235B-A22B (Base) actually performs.

Reasoning

benchmark

avg rank #65.0

§ 05 · Related models

Other Qwen models scored on Codesota.

Qwen3-VL-235B-A22B-Instruct

9 results · 1 SOTA

Qwen3-VL-235B-A22B-Thinking

§ 06 · Sources & freshness

Where these numbers come from.

paperswithcode

result

0 of 1 rows marked verified.