IBM Research's large-scale document layout analysis dataset with 80,863 annotated pages across 6 document categories: financial reports, scientific papers, patents, government tenders, manuals, and laws & regulations. 11 semantic region labels. The standard benchmark for general-domain document layout segmentation.
MAP is the reported evaluation metric for DocLayNet. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.
Higher is better
| Rank | Model | Trust | Score | Year | Source |
|---|---|---|---|---|---|
| 01 | DocFormerv2-Large | paper | 84.1 | 2026 | Source ↗ |
| 02 | DocFormerv2-Large | paper | 84.1 | 2026 | Source ↗ |
| 03 | DiT-L (Cascade R-CNN) | paper | 82.6 | 2026 | Source ↗ |
| 04 | DiT-L (Cascade R-CNN) | paper | 82.6 | 2026 | Source ↗ |
| 05 | DiT-Large | paper | 79.5 | 2026 | Source ↗ |
| 06 | LayoutLMv3-Large | paper | 79.5 | 2026 | Source ↗ |
| 07 | LayoutLMv3-Large | paper | 79.5 | 2026 | Source ↗ |
| 08 | LayoutLMv3 | paper | 76.8 | 2026 | Source ↗ |
| 09 | DINO (ResNet-50) | paper | 73.4 | 2026 | Source ↗ |
| 10 | DINO (ResNet-50) | paper | 73.4 | 2026 | Source ↗ |
| 11 | YOLOv8-DocLayNet | vendor | 73.2 | 2026 | Source ↗ |