Document Understanding2022en

DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis

IBM dataset with 80,863 pages across 6 document categories (financial, scientific, patents, law, government, manuals). 11 layout element classes. Supersedes PubLayNet for general-purpose layout analysis.

Samples:80,863
Metrics:mAP, mAP@50, mAP@75
Paper / Website
Current State of the Art

DocFormerv2-Large

Adobe Research

84.1

mAP

mAP Progress Over Time

Showing 2 breakthroughs from Mar 2022 to Jun 2023

82.482.983.383.884.2Mar 2022Jun 2023mAPDate

Key Milestones

Mar 2022
DiT-L (Cascade R-CNN)

DiT-L with Cascade R-CNN on DocLayNet. 82.6 mAP (COCO-style). Table 4 of DocLayNet paper (arXiv 2206.01062). Best result in the original IBM benchmark paper at publication.

82.6
Jun 2023
DocFormerv2-LargeCurrent SOTA

DocFormerv2-Large on DocLayNet. 84.1 mAP (COCO-style). Table 2 in DocFormerv2 paper (arXiv 2306.01733, 2023). Adobe Research.

84.1
+1.8%
Total Improvement
1.8%
Time Span
1y 3m
Breakthroughs
2
Current SOTA
84.1

Top Models Performance Comparison

Top 7 models ranked by mAP

mAP1DocFormerv2-Large84.1100.0%2DiT-L (Cascade R-CNN)82.698.2%3DiT-Large79.594.5%4LayoutLMv3-Large79.594.5%5LayoutLMv376.891.3%6DINO (ResNet-50)73.487.3%7YOLOv8-DocLayNet73.287.0%0%25%50%75%100%% of best
Best Score
84.1
Top Model
DocFormerv2-Large
Models Compared
7
Score Range
10.9

mAPPrimary

#ModelScorePaper / CodeDate
1
DocFormerv2-LargeOpen Source
Adobe Research
84.1Mar 2026
2
DiT-L (Cascade R-CNN)Open Source
Microsoft Research
82.6Mar 2026
3
DiT-LargeOpen Source
Microsoft
79.5Mar 2026
4
LayoutLMv3-LargeOpen Source
Microsoft Research
79.5Mar 2026
5
LayoutLMv3Open Source
Microsoft
76.8Mar 2026
6
DINO (ResNet-50)Open Source
Research (IDEA Research)
73.4Mar 2026
7
YOLOv8-DocLayNetOpen Source
Research
73.2Mar 2026

Other Document Understanding Datasets