aime-2024
Unknown
OCR benchmark
3
Total Results
3
Models Tested
1
Metrics
2025-12-19
Last Updated
accuracy
Higher is better
| Rank | Model | Score | Source |
|---|---|---|---|
| 1 | o1-preview American Invitational Mathematics Examination. Elite competition math. | 83.3 | openai-blog |
| 2 | claude-35-opus | 16 | anthropic-blog |
| 3 | gpt-4o Significant gap between o1 and GPT-4o on competition math. | 13.4 | openai-blog |