Arithmetic Reasoning2016en
Math Word Problem Repository
3,320 arithmetic word problems from various sources, testing basic arithmetic reasoning.
Metrics:accuracy
Paper / WebsiteCurrent State of the Art
GPT-4o
OpenAI
97.2
accuracy
Top Models Performance Comparison
Top 3 models ranked by accuracy
Best Score
97.2
Top Model
GPT-4o
Models Compared
3
Score Range
3.1
accuracyPrimary
| # | Model | Score | Paper / Code | Date |
|---|---|---|---|---|
| 1 | GPT-4oAPI OpenAI | 97.2 | Dec 2025 | |
| 2 | Claude 3.5 SonnetAPI Anthropic | 95.8 | Dec 2025 | |
| 3 | Llama 3 70BOpen Source Meta | 94.1 | Dec 2025 |