Image Classification2012en

ImageNet Large Scale Visual Recognition Challenge 2012

1.28M training images, 50K validation images across 1,000 object classes. The standard benchmark for image classification since 2012.

Samples:1,281,167
Metrics:top-1-accuracy, top-5-accuracy
Paper / WebsiteDownload
Current State of the Art

CoCa (finetuned)

Google

91

top-1-accuracy

top-1-accuracy Progress Over Time

Showing 5 breakthroughs from Dec 2015 to May 2022

77.481.184.888.592.2Dec 2015Jul 2017Feb 2019Sep 2020May 2022top-1-accuracyDate

Key Milestones

Dec 2015
ResNet-152

10-crop evaluation. Original deep residual network.

78.6
May 2019
EfficientNet-B7

8.4x smaller than GPipe. 66M parameters.

84.4
+7.4%
Oct 2020
ViT-H/14

Huge ViT variant. 632M parameters.

88.5
+4.9%
Jun 2021
ViT-G/14

Giant ViT variant. 1.8B parameters.

90.5
+2.1%
May 2022
CoCa (finetuned)Current SOTA

Current SOTA on ImageNet-1K. 2.1B parameters. Contrastive Captioner architecture.

91.0
+0.6%
Total Improvement
15.8%
Time Span
6y 6m
Breakthroughs
5
Current SOTA
91.0

Top Models Performance Comparison

Top 10 models ranked by top-1-accuracy

top-1-accuracy1CoCa (finetuned)91.0100.0%2ViT-G/1490.599.4%3ConvNeXt V2 Huge88.997.7%4ViT-H/1488.597.3%5Swin Transformer Large87.395.9%6EfficientNetV2-L85.794.2%7DeiT-B Distilled85.293.6%8EfficientNet-B784.492.7%9DeiT-B83.191.3%10ConvNeXt V2 Tiny83.091.2%0%25%50%75%100%% of best
Best Score
91.0
Top Model
CoCa (finetuned)
Models Compared
10
Score Range
8.0

top-1-accuracyPrimary

#ModelScorePaper / CodeDate
1
CoCa (finetuned)Open Source
Google
91Dec 2025
2
ViT-G/14Open Source
Google
90.45Dec 2025
3
ConvNeXt V2 HugeOpen Source
Meta
88.9Dec 2025
4
ViT-H/14Open Source
Google
88.55Dec 2025
5
Swin Transformer LargeOpen Source
Microsoft
87.3Dec 2025
6
EfficientNetV2-LOpen Source
Google
85.7Dec 2025
7
DeiT-B DistilledOpen Source
Meta
85.2Dec 2025
8
EfficientNet-B7Open Source
Google
84.4Dec 2025
9
DeiT-BOpen Source
Meta
83.1Dec 2025
10
ConvNeXt V2 TinyOpen Source
Meta
83Dec 2025
11
ViT-L/16Open Source
Google
82.7Dec 2025
12
ViT-B/16Open Source
Google
81.2Dec 2025
13
ResNet-50 (A3 training)Open Source
Timm
80.4Dec 2025
14
ResNet-152Open Source
Microsoft
78.6Dec 2025
15
EfficientNet-B0Open Source
Google
77.1Dec 2025
16
ResNet-50Open Source
Microsoft
76.15Dec 2025

Other Image Classification Datasets