gpqa

Unknown

OCR benchmark

4
Total Results
4
Models Tested
1
Metrics
2025-12-21
Last Updated

accuracy

Higher is better

RankModelScoreSource
1o1-preview

Graduate-level Google-Proof Q&A. PhD-level science questions.

78openai-blog
2claude-35-sonnet59.4anthropic-blog
3gpt-4o53.6openai-blog
4gemini-15-pro46.2google-blog

Explore More OCR Content