IBM Research's large-scale document layout analysis dataset with 80,863 annotated pages across 6 document categories: financial reports, scientific papers, patents, government tenders, manuals, and laws & regulations. 11 semantic region labels. The standard benchmark for general-domain document layout segmentation.
MAP is the reported evaluation metric for DocLayNet. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.
Higher is better
Muted rows were not state of the art when published — an earlier or same-year result already scored better.
| Rank | Model | Trust | Score | Year | Links | Fix |
|---|---|---|---|---|---|---|
| 01 | DocFormerv2-Large | paper | 84.1 | 2026 | Source ↗ | Looks wrong? |
| 02 | DiT-L (Cascade R-CNN) | paper | 82.6 | 2026 | Source ↗ | Looks wrong? |
| 03 | DiT-Large | paper | 79.5 | 2026 | Source ↗ | Looks wrong? |
| 04 | LayoutLMv3-Large | paper | 79.5 | 2026 | Source ↗ | Looks wrong? |
| 05 | LayoutLMv3 | paper | 76.8 | 2026 | Source ↗ | Looks wrong? |
| 06 | DINO (ResNet-50) | paper | 73.4 | 2026 | Source ↗ | Looks wrong? |
| 07 | YOLOv8-DocLayNet | vendor | 73.2 | 2026 | Source ↗ | Looks wrong? |