MBPP
Unknown
974 crowd-sourced Python programming problems suitable for beginners. Covers programming fundamentals and standard library.
Benchmark Stats
Models2
Papers2
Metrics1
SOTA History
Not enough data to show trend.
Only 2 models on this benchmark
Help build the community leaderboard — submit your model results.
pass@1
pass@1
Higher is better