Who leads the LIBERO-Long benchmark?

MolmoAct2-Think currently leads LIBERO-Long with a score of 98.1 on Success Rate.

What is the state-of-the-art score on LIBERO-Long?

The state-of-the-art result on LIBERO-Long is 98.1 (Success Rate), achieved by MolmoAct2-Think as of 2026.

How many models are tracked on LIBERO-Long?

Codesota tracks 7 models on LIBERO-Long across 2 metrics.

When was the LIBERO-Long leaderboard last updated?

The LIBERO-Long leaderboard on Codesota includes results through 2026, with the earliest tracked result from 2024.

Codesota · Benchmark · LIBERO-LongHome/Leaderboards/LIBERO-Long

Unknown

LIBERO-Long.

Name: LIBERO-Long Benchmark Results
Creator: Unknown
Published: 2024-01-01
License: https://creativecommons.org/licenses/by/4.0/

LIBERO-Long (also called LIBERO-10) is one of four task suites in the LIBERO benchmark for lifelong robot learning. It contains 10 long-horizon manipulation tasks requiring multi-step reasoning and diverse object/spatial/goal knowledge. Reported as success rate (%).

Paper ↗Leaderboard ↓

§ 01 · SOTA history

Year over year.

§ 02 · Leaderboard

Results by metric.

Found a wrong score or missing run?

Use row edits to send a sourced correction into moderation.

Add / edit result ↗Report issue ↗

Success Rate

Success Rate is the reported evaluation metric for LIBERO-Long. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.

Higher is better

Trust tiers for Success Rateverifiedpapervendorcommunityunverified

Muted rows were not state of the art when published — an earlier or same-year result already scored better.

Rank	Model	Trust	Score	Year	Links	Fix
01	MolmoAct2-Think	unverified	98.1	2026	Paper ↗Code ↗	Looks wrong?
02	MolmoAct2	unverified	97.2	2026	Paper ↗Code ↗	Looks wrong?
03	UD-VLA	unverified	92.7	2025	Paper ↗Code ↗	Looks wrong?
04	SmolVLA (2.25B)	unverified	88.75	2025	Paper ↗Code ↗	Looks wrong?
05	OpenVLA	unverified	76.5	2024	Paper ↗Code ↗	Looks wrong?

Success Rate

Higher is better

Trust tiers for Success Rateverifiedpapervendorcommunityunverified

Muted rows were not state of the art when published — an earlier or same-year result already scored better.

Rank	Model	Trust	Score	Year	Links	Fix
01	π0 (Pi-Zero) LIBERO-Long success rate for π0 fine-tuned model. seed — verify	paper	85.2	2026	Source ↗	Looks wrong?
02	OpenVLA LIBERO-Long success rate, OpenVLA paper (Kim et al. 2024). seed — verify	paper	53.7	2026	Source ↗	Looks wrong?
03	Octo-Base LIBERO-Long (LIBERO-10) success rate, reported in OpenVLA paper Table comparing LIBERO performance.	paper	51.1	2026	Source ↗	Looks wrong?

§ 04 · Submit a result

Add to the leaderboard.

← Back to Leaderboards