Offline RL2020en

D4RL: Datasets for Deep Data-Driven Reinforcement Learning (halfcheetah-medium-v2)

Canonical offline RL benchmark environment from D4RL. The halfcheetah-medium-v2 dataset contains 1M transitions collected from a medium-level SAC policy. Scores are reported as normalized return where 0 = random policy and 100 = expert SAC policy.

No benchmark results indexed for this dataset yet.

Contribute results on GitHub