Code Generation2021python

Mostly Basic Python Problems

974 crowd-sourced Python programming problems suitable for beginners. Covers programming fundamentals and standard library.

Metrics:pass@1, pass@10
Paper / WebsiteDownload
Current State of the Art

Claude 3.5 Sonnet

Anthropic

89.2

pass@1

pass@1Primary

#ModelScorePaper / CodeDate
1
Claude 3.5 SonnetAPI
Anthropic
89.2Dec 2025
2
GPT-4oAPI
OpenAI
87.8Dec 2025

Other Code Generation Datasets

MBPP Benchmark - Code Generation | CodeSOTA