LBC.

Tsinghua University / Baiduopen-sourceLearnable Behavior Control (distributed off-policy actor-critic)

First to break 24 Atari human world records within 1B frames. ICLR 2023 Oral. Hybrid behavior mapping with bandit-based meta-controller.

§ 02 · Benchmarks

Every benchmark LBC has a recorded score for.

#	Benchmark	Area · Task	Metric	Value	Rank	Date	Source
01	Atari 2600	Reinforcement Learning · Atari Games	human-normalized-score	10078.00	#2/12	—	source ↗

Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.

§ 03 · Strengths by area

Where LBC actually performs.

Reinforcement Learning

benchmark

avg rank #2.0

§ 06 · Sources & freshness

Where these numbers come from.

unknown

result

0 of 1 rows marked verified.