Continuous Control2012n/a

Multi-Joint dynamics with Contact

Physics-based continuous control benchmark. Evaluated on 15 DMControl tasks; metric is mean normalized score (0=random, 1000=expert) at 1M environment steps.

Metrics:average-return
Paper / Website
Current State of the Art

TD-MPC2 (317M params)

UC San Diego

960

average-return

average-return Progress Over Time

Showing 5 breakthroughs from Jan 2018 to Oct 2023

758.7813.6868.5923.4978.3Jan 2018Jun 2019Nov 2020Apr 2022Oct 2023average-returnDate

Key Milestones

Jan 2018
SAC (state-based)

SAC (state-based). Mean normalized score across DMControl tasks. Classic baseline from TD-MPC2 Table 1.

777.0
Jul 2021
DrQ-v2

DrQ-v2, pixel-based. Mean normalized score across 15 DMControl tasks, 1M steps. From TD-MPC2 Table 1.

799.0
+2.8%
Mar 2022
TD-MPC

TD-MPC (original). Mean normalized score across DMControl tasks, 1M steps. ICML 2022 baseline from TD-MPC2 paper.

857.0
+7.3%
Jan 2023
DreamerV3

DreamerV3. Mean normalized score across 15 DMControl tasks, 1M steps. From TD-MPC2 Table 1 comparison.

897.0
+4.7%
Oct 2023
TD-MPC2 (317M params)Current SOTA

TD-MPC2, 317M-param shared model. Mean normalized score across 15 DMControl tasks, 1M steps. ICLR 2024.

960.0
+7.0%
Total Improvement
23.6%
Time Span
5y 10m
Breakthroughs
5
Current SOTA
960.0

Top Models Performance Comparison

Top 9 models ranked by average-return

average-return1TD-MPC2 (317M params)960.0100.0%2TD-MPC2 (19M params)953.099.3%3FOWM945.098.4%4BRO941.098.0%5TD-MPC2 (5M params)929.096.8%6DreamerV3897.093.4%7TD-MPC857.089.3%8DrQ-v2799.083.2%9SAC (state-based)777.080.9%0%25%50%75%100%% of best
Best Score
960.0
Top Model
TD-MPC2 (317M par...
Models Compared
9
Score Range
183.0

average-returnPrimary

#ModelScorePaper / CodeDate
1
TD-MPC2 (317M params)Open Source
UC San Diego
960Mar 2026
2
TD-MPC2 (19M params)Open Source
UC San Diego
953Mar 2026
3
FOWMOpen Source
CMU
945Mar 2026
4
BROOpen Source
DeepMind / TU Warsaw
941Mar 2026
5
TD-MPC2 (5M params)Open Source
UC San Diego
929Mar 2026
6
DreamerV3Open Source
Google DeepMind
897Mar 2026
7
TD-MPCOpen Source
UC San Diego
857Mar 2026
8
DrQ-v2Open Source
NYU / Google
799Mar 2026
9
SAC (state-based)Open Source
UC Berkeley
777Mar 2026
MuJoCo Benchmark - Continuous Control | CodeSOTA