Home/OCR/Benchmarks/humaneval

humaneval

Unknown

OCR benchmark

5
Total Results
5
Models Tested
1
Metrics
2025-12-21
Last Updated

pass@1

Higher is better

RankModelScoreSource
1o1-preview

Classic Python code generation benchmark.

92.4openai-blog
2claude-35-sonnet92anthropic-blog
3gpt-4o90.2openai-blog
4deepseek-v382.6deepseek-blog
5llama-3-70b81.7meta-blog

Explore More OCR Content