Codesota · Models · Qwen3.6 PlusAlibaba Cloud4 results · 4 benchmarks
Model card

Qwen3.6 Plus.

Alibaba Cloud
§ 01 · Benchmarks

Every benchmark Qwen3.6 Plus has a recorded score for.

#BenchmarkArea · TaskMetricValueRankDateSource
01MMMUMultimodal · Visual Question Answeringaccuracy86.0%#1/182026-03-15source ↗
02MMLU-ProReasoning · Commonsense Reasoningaccuracy88.5%#5/202026-04-20source ↗
03MMMU-ProMultimodal · Visual Question Answeringaccuracy73.8%#5/52026-03-15source ↗
04SWE-bench VerifiedAgentic AI · SWE-benchresolve-rate78.8%#8/81source ↗
Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.
§ 02 · Strengths by area

Where Qwen3.6 Plus actually performs.

Multimodal
2
benchmarks
avg rank #3.0
Reasoning
1
benchmark
avg rank #5.0
Agentic AI
1
benchmark
avg rank #8.0
§ 04 · Related models

Other Alibaba Cloud models scored on Codesota.

Qwen3-Coder 480B A35B
2 results
Qwen3.5-397B-A17B
2 results
Qwen3 Max
1 result
Qwen3.5-122B-A10B
1 result
Qwen3.5-27B
1 result
Qwen3.5-35B-A3B
1 result
§ 05 · Sources & freshness

Where these numbers come from.

llm-stats.com
1
result
llm-stats
1
result
artificialanalysis.ai
1
result
editorial
1
result
3 of 4 rows marked verified. · first result 2026-03-15, latest 2026-04-20.