Suite of 57 Atari 2600 games. Standard benchmark for deep reinforcement learning agents.
12 results indexed across 1 metric. Shaded row marks current SOTA; ties broken by submission date.
| # | Model | Org | Submitted | Paper / code | human-normalized-score |
|---|---|---|---|---|---|
| 01 | Go-ExploreOpen | Uber AI | Dec 2025 | nature-paper | 40000 |
| 02 | LBCOpen | Tsinghua University / Baidu | — | source | 10078 |
| 03 | Agent57Open | DeepMind | Dec 2025 | deepmind-research | 4731.30 |
| 04 | MEMEOpen | Google DeepMind | — | source | 4087 |
| 05 | Disco57Open | Google DeepMind | — | source | 1386 |
| 06 | BBOS-1Open | — | Dec 2025 | — | 1100 |
| 07 | GDI-H3Open | Research | Dec 2025 | — | 950 |
| 08 | DreamerV3Open | Google DeepMind | Dec 2025 | arxiv-paper | 840 |
| 09 | MuZeroOpen | DeepMind | Dec 2025 | nature-paper | 731 |
| 10 | Rainbow DQNOpen | DeepMind | Dec 2025 | aaai-paper | 231 |
| 11 | Human Professional | Biology | Dec 2025 | baseline | 100 |
| 12 | DQN (Human-level)Open | DeepMind | Dec 2025 | nature-paper | 79 |
Each row below marks a model that broke the previous record on human-normalized-score. Intermediate submissions are kept in the leaderboard above; only SOTA-setting entries are re-listed here.
Higher scores win. Each subsequent entry improved upon the previous best.
Submit a checkpoint and a reproduction script. We will run it, publish the score, and — if it takes the top — annotate the step on the progress chart with your name.