Document Understanding2022en
DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis
IBM dataset with 80,863 pages across 6 document categories (financial, scientific, patents, law, government, manuals). 11 layout element classes. Supersedes PubLayNet for general-purpose layout analysis.
Current State of the Art
DocFormerv2-Large
Adobe Research
84.1
mAP
mAP Progress Over Time
Showing 2 breakthroughs from Mar 2022 to Jun 2023
Key Milestones
Mar 2022
DiT-L (Cascade R-CNN)
DiT-L with Cascade R-CNN on DocLayNet. 82.6 mAP (COCO-style). Table 4 of DocLayNet paper (arXiv 2206.01062). Best result in the original IBM benchmark paper at publication.
82.6
Jun 2023
DocFormerv2-LargeCurrent SOTA
DocFormerv2-Large on DocLayNet. 84.1 mAP (COCO-style). Table 2 in DocFormerv2 paper (arXiv 2306.01733, 2023). Adobe Research.
84.1
+1.8%
Total Improvement
1.8%
Time Span
1y 3m
Breakthroughs
2
Current SOTA
84.1
Top Models Performance Comparison
Top 7 models ranked by mAP
Best Score
84.1
Top Model
DocFormerv2-Large
Models Compared
7
Score Range
10.9
mAPPrimary
| # | Model | Score | Paper / Code | Date |
|---|---|---|---|---|
| 1 | DocFormerv2-LargeOpen Source Adobe Research | 84.1 | Mar 2026 | |
| 2 | DiT-L (Cascade R-CNN)Open Source Microsoft Research | 82.6 | Mar 2026 | |
| 3 | DiT-LargeOpen Source Microsoft | 79.5 | Mar 2026 | |
| 4 | LayoutLMv3-LargeOpen Source Microsoft Research | 79.5 | Mar 2026 | |
| 5 | LayoutLMv3Open Source Microsoft | 76.8 | Mar 2026 | |
| 6 | DINO (ResNet-50)Open Source Research (IDEA Research) | 73.4 | Mar 2026 | |
| 7 | YOLOv8-DocLayNetOpen Source Research | 73.2 | Mar 2026 |