Suite of 57 Atari 2600 games. Standard benchmark for deep reinforcement learning agents.
12 results indexed across 1 metric. Shaded row marks current SOTA; ties broken by submission date.
| # | Model | Org | Submitted | Paper / code | human-normalized-score |
|---|---|---|---|---|---|
| 01 | Go-ExploreOSS | Uber AI | Dec 2025 | nature-paper | 40000 |
| 02 | LBCOSS | Tsinghua University / Baidu | — | source | 10078 |
| 03 | Agent57OSS | DeepMind | Dec 2025 | deepmind-research | 4731.30 |
| 04 | MEMEOSS | Google DeepMind | — | source | 4087 |
| 05 | Disco57OSS | Google DeepMind | — | source | 1386 |
| 06 | BBOS-1OSS | — | Dec 2025 | — | 1100 |
| 07 | GDI-H3OSS | Research | Dec 2025 | — | 950 |
| 08 | DreamerV3OSS | Google DeepMind | Dec 2025 | arxiv-paper | 840 |
| 09 | MuZeroOSS | DeepMind | Dec 2025 | nature-paper | 731 |
| 10 | Rainbow DQNOSS | DeepMind | Dec 2025 | aaai-paper | 231 |
| 11 | Human Professional | Biology | Dec 2025 | baseline | 100 |
| 12 | DQN (Human-level)OSS | DeepMind | Dec 2025 | nature-paper | 79 |
Each row below marks a model that broke the previous record on human-normalized-score. Intermediate submissions are kept in the leaderboard above; only SOTA-setting entries are re-listed here.
Higher scores win. Each subsequent entry improved upon the previous best.
Submit a checkpoint and a reproduction script. We will run it, publish the score, and — if it takes the top — annotate the step on the progress chart with your name.