Mathematical Reasoning2024en
American Invitational Mathematics Examination 2024
30 challenging math problems from the 2024 AIME competition. Tests advanced mathematical reasoning.
Metrics:accuracy, pass@1
Paper / WebsiteCurrent State of the Art
o1-preview
OpenAI
83.3
accuracy
Top Models Performance Comparison
Top 3 models ranked by accuracy
Best Score
83.3
Top Model
o1-preview
Models Compared
3
Score Range
69.9
accuracyPrimary
| # | Model | Score | Paper / Code | Date |
|---|---|---|---|---|
| 1 | o1-preview OpenAI | 83.3 | Dec 2025 | |
| 2 | Claude 3.5 Opus Anthropic | 16 | Dec 2025 | |
| 3 | GPT-4oAPI OpenAI | 13.4 | Dec 2025 |