Reinforcement Learning

Training agents to make decisions? Benchmark your policies on game playing, continuous control, and offline learning tasks.

3 tasks2 datasets9 results

Tasks & Benchmarks

Playing Atari video games (Atari 2600 benchmark).

Control tasks with continuous action spaces (MuJoCo).

Learning from fixed datasets without environment interaction.

Show all datasets and SOTA results

Atari 2600Arcade Learning Environment (Atari 2600)2013

SOTA:40000(human-normalized-score)

Go-Explore

Suite of 57 Atari 2600 games. Standard benchmark for deep reinforcement learning agents.

MuJoCoMulti-Joint dynamics with Contact2012

Physics engine for continuous control tasks like walking, running, and manipulation.

No datasets indexed yet. Contribute on GitHub