Large Vocabulary Instance Segmentation v1.0
1,203 object categories with federated, long-tail distribution across 164K COCO images. Tests real-world detection with rare and fine-grained categories.
DINO-X
IDEA Research
67
mask-ap
mask-ap Progress Over Time
Showing 5 breakthroughs from Dec 2021 to Nov 2024
Key Milestones
Mask2Former with Swin-L backbone. 56.1 mask AP on LVIS v1.0 minival. Table 7, arxiv:2112.01527. CVPR 2022.
ViTDet-H with MAE pretraining and Cascade Mask RCNN head. 59.5 mask AP on LVIS v1.0 minival. Table 3, arxiv:2203.16527. ECCV 2022.
InternImage-H (1B params) with DINO detection head. 65.4 mask AP on LVIS v1.0 minival. Table 5, arxiv:2211.05778. CVPR 2023.
APE-Large with EVA-02 ViT-L backbone. 66.4 mask AP on LVIS v1.0 minival in closed-set evaluation. Table 2, arxiv:2312.02153. CVPR 2024.
DINO-X unified model. 67.0 mask AP on LVIS v1.0 minival — SOTA at time of release Nov 2024. Table 3, arxiv:2411.14347.
Top Models Performance Comparison
Top 9 models ranked by mask-ap
box-ap
| # | Model | Score | Paper / Code | Date |
|---|---|---|---|---|
| 1 | DINO-XOpen Source IDEA Research | 71.4 | Nov 2024 | |
| 2 | APE-LargeOpen Source Tsinghua / MEGVII | 70.3 | Dec 2023 | |
| 3 | Co-DINO (ViT-L)Open Source Sensetime / Sense-X | 68 | Nov 2022 | |
| 4 | ViTDet-H (MAE)Open Source Meta AI | 64 | Mar 2022 |
mask-apPrimary
| # | Model | Score | Paper / Code | Date |
|---|---|---|---|---|
| 1 | DINO-XOpen Source IDEA Research | 67 | Nov 2024 | |
| 2 | APE-LargeOpen Source Tsinghua / MEGVII | 66.4 | Dec 2023 | |
| 3 | InternImage-HOpen Source Shanghai AI Lab | 65.4 | Nov 2022 | |
| 4 | EVA-02-LOpen Source BAAI / PKU | 62.1 | Mar 2023 | |
| 5 | ViTDet-H (MAE)Open Source Meta AI | 59.5 | Mar 2022 | |
| 6 | Mask2Former (Swin-L)Open Source Meta AI / UIUC | 56.1 | Dec 2021 | |
| 7 | ViTDet-HOpen Source Meta AI | 53.4 | Mar 2026 | |
| 8 | EVA-02-L (LVIS)Open Source BAAI | 50.7 | Mar 2026 | |
| 9 | Mask2Former (Swin-L) LVISOpen Source Meta AI | 44.6 | Mar 2026 |
mask-ap-rare
| # | Model | Score | Paper / Code | Date |
|---|---|---|---|---|
| 1 | APE-LargeOpen Source Tsinghua / MEGVII | 65.4 | Dec 2023 | |
| 2 | EVA-02-LOpen Source BAAI / PKU | 61 | Mar 2023 | |
| 3 | Mask2Former (Swin-L)Open Source Meta AI / UIUC | 53.5 | Dec 2021 |