Who leads the GenEval benchmark?

Lumina-DiMOO w/ Self-GRPO currently leads GenEval with a score of 0.91 on Geneval Score.

What is the state-of-the-art score on GenEval?

The state-of-the-art result on GenEval is 0.91 (Geneval Score), achieved by Lumina-DiMOO w/ Self-GRPO as of 2026.

How many models are tracked on GenEval?

Codesota tracks 8 models on GenEval.

When was the GenEval leaderboard last updated?

The GenEval leaderboard on Codesota includes results through 2026, with the earliest tracked result from 2025.

Codesota · Benchmark · GenEvalHome/Leaderboards/Multimodal Media/Text-to-Image/GenEval

Unknown

GenEval.

Name: GenEval Benchmark Results
Creator: Unknown
Published: 2025-01-01
License: https://creativecommons.org/licenses/by/4.0/

Evaluates compositional text-to-image generation with object-level criteria

Paper ↗Leaderboard ↓

§ 01 · SOTA history

Year over year.

§ 02 · Leaderboard

Results by metric.

Found a wrong score or missing run?

Use row edits to send a sourced correction into moderation.

Add / edit result ↗Report issue ↗

Geneval Score

Geneval Score is the reported evaluation metric for GenEval. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.

Higher is better

Trust tiers for Geneval Scoreverifiedpapervendorcommunityunverified

Muted rows were not state of the art when published — an earlier or same-year result already scored better.

Rank	Model	Trust	Score	Year	Links	Fix
01	Lumina-DiMOO w/ Self-GRPO	unverified	0.91	2025	Paper ↗Code ↗	Looks wrong?
02	BLIP3o-NEXT-GRPO-GenEval (3B)	unverified	0.91	2025	Paper ↗Code ↗	Looks wrong?
03	SenseNova-U1-A3B-MoT	unverified	0.91	2026	Paper ↗Code ↗	Looks wrong?
04	BAGEL (7B MoT) with LLM rewriter	unverified	0.88	2025	Paper ↗Code ↗	Looks wrong?
05	Emu3.5 (34B, AR)	unverified	0.86	2025	Paper ↗Code ↗	Looks wrong?
06	BLIP3-o (8B)	unverified	0.84	2025	Paper ↗Code ↗	Looks wrong?
07	AsymFLUX.2 klein	unverified	0.82	2026	Paper ↗Code ↗	Looks wrong?
08	Spectral Progressive Diffusion (PixelGen, TF)	unverified	0.78	2026	Paper ↗	Looks wrong?

§ 04 · Submit a result

Add to the leaderboard.

← Back to Text-to-Image