974 crowd-sourced Python programming problems suitable for beginners. Covers programming fundamentals and standard library.
Pass@1 is the reported evaluation metric for MBPP. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.
Higher is better
| Rank | Model | Trust | Score | Year | Source |
|---|---|---|---|---|---|
| 01 | o4-mini | verified | 94.9 | 2026 | Source ↗ |
| 02 | o3-mini | verified | 93.3 | 2026 | Source ↗ |
| 03 | Claude Opus 4 | verified | 92 | 2026 | Source ↗ |
| 04 | Claude 3.5 Sonnet (Oct 2024) | verified | 91 | 2024 | Source ↗ |
| 05 | GPT-4.1 | verified | 90.9 | 2026 | Source ↗ |
| 06 | Qwen2.5-Coder 32B | verified | 90.2 | 2024 | Source ↗ |
| 07 | Qwen2.5-Coder-32B-Instruct | verified | 90.2 | 2024 | Source ↗ |
| 08 | Claude Sonnet 4 | verified | 89.6 | 2026 | Source ↗ |
| 09 | DeepSeek-Coder-V2-Instruct | verified | 89.4 | 2024 | Source ↗ |
| 10 | DeepSeek-V3 | verified | 89.3 | 2026 | Source ↗ |
| 11 | Claude 3.5 Sonnet | unverified | 89.2 | 2025 | Source ↗ |
| 12 | claude-35-sonnet | paper | 89.2 | 2025 | Source ↗ |
| 13 | GPT-4o | unverified | 87.8 | 2025 | Source ↗ |
| 14 | GPT-4o (Aug 2024) | verified | 86.8 | 2024 | Source ↗ |
| 15 | Qwen2.5-Coder-7B-Instruct | verified | 83.5 | 2024 | Source ↗ |
| 16 | Codestral 22B v0.1 | verified | 78.2 | 2024 | Source ↗ |
| 17 | Llama 4 Maverick (17B-128E) | verified | 77.6 | 2025 | Source ↗ |
| 18 | Llama-4-Maverick | verified | 77.6 | 2025 | Source ↗ |
| 19 | Codestral 22B | verified | 75.4 | 2024 | Source ↗ |
| 20 | Gemma-3-27b | verified | 74.4 | 2025 | Source ↗ |
| 21 | Gemma 3 27B IT | verified | 74.4 | 2025 | Source ↗ |
| 22 | Gemma 3 12B IT | verified | 73 | 2025 | Source ↗ |
| 23 | Llama 4 Scout (17B-16E) | verified | 67.8 | 2025 | Source ↗ |
| 24 | Llama-4-Scout | verified | 67.8 | 2025 | Source ↗ |
| 25 | Gemma 3 4B IT | verified | 63.2 | 2025 | Source ↗ |
| 26 | Code Llama 34B | verified | 62.6 | 2026 | Source ↗ |
| 27 | StarCoder2 15B | verified | 54.4 | 2024 | Source ↗ |