Unknown
mmmu is a state-of-the-art machine learning benchmark indexed on Codesota. This page tracks published model results, top scores per metric, and the SOTA timeline for mmmu.
Higher is better
| Rank | Model | Source | Score | Year | Paper |
|---|---|---|---|---|---|
| 1 | InternVL3-78B MMMU val. InternVL3-78B. Table 2. arxiv:2501.12891 | Community | 73.3 | 2026 | Source |
| 2 | Gemini 2.0 Flash MMMU val. Gemini 2.0 Flash. Technical report. | Community | 71.9 | 2026 | Source |
| 3 | Qwen2.5-VL 72B MMMU val. Qwen2.5-VL 72B. Table 2. arxiv:2502.13923 | Community | 70.2 | 2026 | Source |
| 4 | GPT-4o MMMU val. GPT-4o system card Table 1. arxiv:2410.21276 | Community | 69.1 | 2026 | Source |
| 5 | Claude 3.5 Sonnet MMMU val. Claude 3.5 Sonnet (Oct 2024). Anthropic model card. | Community | 68.3 | 2026 | Source |
| 6 | InternVL2-76B MMMU val. InternVL2-76B. Table 10. arxiv:2404.16821 | Community | 67.4 | 2026 | Source |
| 7 | Qwen2-VL 72B MMMU val. Qwen2-VL 72B. Table 6. arxiv:2409.12191 | Community | 64.5 | 2026 | Source |
| 8 | Gemini 1.5 Pro MMMU val. Table 5. Gemini 1.5 paper arxiv:2403.05530 | Community | 62.2 | 2026 | Source |
| 9 | Llama 3.2 Vision 90B MMMU val. Llama 3.2 Vision 90B. Table 3. arxiv:2407.21783 | Community | 60.3 | 2026 | Source |
| 10 | Claude 3 Opus MMMU val. 0-shot. Anthropic Claude 3 family model card. March 2024. | Community | 59.4 | 2026 | Source |
| 11 | GPT-4V MMMU val. 0-shot. MMMU benchmark paper Table 1. Source cross-referenced with GPT-4 Technical Report. | Community | 56.8 | 2026 | Source |