MBPP

Unknown

974 crowd-sourced Python programming problems suitable for beginners. Covers programming fundamentals and standard library.

Benchmark Stats

Models2
Papers2
Metrics1

SOTA History

Not enough data to show trend.

Only 2 models on this benchmark

Help build the community leaderboard — submit your model results.

pass@1

pass@1

Higher is better

RankModelSourceScoreYearPaper
1claude-35-sonnetEditorial89.22025Source
2gpt-4oEditorial87.82025Source

Submit a Result