gpqa

Unknown

OCR benchmark

17
Total Results
17
Models Tested
1
Metrics
2026-03-06
Last Updated

accuracy

Higher is better

RankModelScoreSource
1o382.8openai-simple-evals
2o4-mini77.6openai-simple-evals
3o175.7openai-simple-evals
4o3-mini74.9openai-simple-evals
5o1-preview73.3openai-simple-evals
6gpt-45-preview69.5openai-simple-evals
7gpt-4166.3openai-simple-evals
8o1-mini60openai-simple-evals
9claude-35-sonnet59.4openai-simple-evals
10grok-256openai-simple-evals
11llama-31-405b50.7openai-simple-evals
12claude-3-opus50.4openai-simple-evals
13gpt-4o49.9openai-simple-evals
14gpt-4-turbo49.3openai-simple-evals
15gemini-15-pro46.2openai-simple-evals
16llama-31-70b41.7openai-simple-evals
17gpt-4o-mini40.2openai-simple-evals

Explore More OCR Content