Evaluates compositional text-to-image generation with object-level criteria
Geneval Score is the reported evaluation metric for GenEval. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.
Higher is better
| Rank | Model | Trust | Score | Year | Links | Edit |
|---|---|---|---|---|---|---|
| 01 | Lumina-DiMOO w/ Self-GRPO | unverified | 0.91 | 2025 | Paper ↗Code ↗ | Edit result |
| 02 | BLIP3o-NEXT-GRPO-GenEval (3B) | unverified | 0.91 | 2025 | Paper ↗Code ↗ | Edit result |
| 03 | SenseNova-U1-A3B-MoT | unverified | 0.91 | 2026 | Paper ↗Code ↗ | Edit result |
| 04 | BAGEL (7B MoT) with LLM rewriter | unverified | 0.88 | 2025 | Paper ↗Code ↗ | Edit result |
| 05 | Emu3.5 (34B, AR) | unverified | 0.86 | 2025 | Paper ↗Code ↗ | Edit result |
| 06 | BLIP3-o (8B) | unverified | 0.84 | 2025 | Paper ↗Code ↗ | Edit result |
| 07 | AsymFLUX.2 klein | unverified | 0.82 | 2026 | Paper ↗Code ↗ | Edit result |
| 08 | Spectral Progressive Diffusion (PixelGen, TF) | unverified | 0.78 | 2026 | Paper ↗ | Edit result |