Logical Reasoning2025en
Abstraction and Reasoning Corpus for AGI (v2)
Harder successor to ARC-AGI-1, released 2025. Designed to be more resistant to test-time compute scaling. Scores reported as % on public evaluation set.
Current State of the Art
Gemini 2.5 Pro
5
accuracy
Top Models Performance Comparison
Top 3 models ranked by accuracy
Best Score
5.0
Top Model
Gemini 2.5 Pro
Models Compared
3
Score Range
2.0
accuracyPrimary
| # | Model | Score | Paper / Code | Date |
|---|---|---|---|---|
| 1 | Gemini 2.5 ProAPI Google | 5 | Mar 2026 | |
| 2 | o3API OpenAI | 4 | Mar 2026 | |
| 3 | o4-miniAPI OpenAI | 3 | Mar 2026 |