Computer Vision

Image Classification

Categorizing images into predefined classes (ImageNet, CIFAR).

4 datasets25 results

Image Classification is a key task in computer vision. Below you will find the standard benchmarks used to evaluate models, along with current state-of-the-art results.

Benchmarks & SOTA

ImageNet-1K

ImageNet Large Scale Visual Recognition Challenge 2012

201216 results

1.28M training images, 50K validation images across 1,000 object classes. The standard benchmark for image classification since 2012.

State of the Art

CoCa (finetuned)

Google

top-1-accuracy

CIFAR-100

Canadian Institute for Advanced Research 100

20094 results

60K 32x32 color images in 100 fine-grained classes grouped into 20 superclasses. More challenging than CIFAR-10.

State of the Art

ViT-H/14

Google

94.55

accuracy

CIFAR-10

Canadian Institute for Advanced Research 10

20093 results

60K 32x32 color images in 10 classes. Classic small-scale image classification benchmark with 50K training and 10K test images.

State of the Art

DeiT-B Distilled

ImageNet-V2

ImageNet-V2 Matched Frequency

20192 results

10K new test images following ImageNet collection process. Tests model generalization beyond the original test set.

State of the Art

Swin Transformer V2 Large

Microsoft

top-1-accuracy

Image Classification Benchmarks - Computer Vision - CodeSOTA | CodeSOTA

Image Classification

Benchmarks & SOTA

ImageNet-1K

CIFAR-100

CIFAR-10

ImageNet-V2

Related Tasks

General OCR Capabilities

Polish OCR

Object Detection

Semantic Segmentation