Continuous Control

Continuous control — learning smooth motor commands in simulated physics — was transformed by MuJoCo and the OpenAI Gym suite in the mid-2010s. SAC (2018) and TD3 became reliable baselines, but the field shifted toward harder locomotion (humanoid parkour, dexterous hands) and sim-to-real transfer after DeepMind's dm_control and Isaac Gym raised the bar. DreamerV3 (2023) showed that world-model approaches can match or beat model-free methods across dozens of control tasks with a single hyperparameter set, signaling a move toward generalist RL agents.

Datasets

Results

average-return

Canonical metric

Canonical Benchmark

MuJoCo

Physics-based continuous control benchmark. Evaluated on 15 DMControl tasks; metric is mean normalized score (0=random, 1000=expert) at 1M environment steps.

Primary metric: average-return

View full leaderboard

Top 10

Leading models on MuJoCo.

Rank	Model	average-return	Year	Source
1	TD3	5592	2026	paper
2	SAC	5179	2026	paper
3	PPO	2038	2026	paper
4	TD-MPC2 (317M params)	960	2026	paper
5	TD-MPC2 (19M params)	953	2026	paper
6	FOWM	945	2026	paper
7	BRO	941	2026	paper
8	TD-MPC2 (5M params)	929	2026	paper
9	DreamerV3	897	2026	paper
10	TD-MPC	857	2026	paper

What were you looking for on Continuous Control?

Didn't find the model, metric, or dataset you needed? Tell us in one line. We read every message and reply within 48 hours.

All datasets

1 dataset tracked for this task.

MuJoCo

CANONICAL

12results·average-return

Top: TD3 — 5592

Related tasks

Other tasks in Reinforcement Learning.

Atari Games Offline RL

Reply within 48 hours · No newsletter

Didn't find what you came for?

Still looking for something on Continuous Control? A missing model, a stale score, a benchmark we should cover — drop it here and we'll handle it.

Real humans read every message. We track what people are asking for and prioritize accordingly.