Codesota · Computer Vision · Image generation · ImageNet 256x256Tasks/Computer Vision/Image generation
Image generation · benchmark dataset · EN

ImageNet (ILSVRC2012) 256x256.

ImageNet 256x256 is a commonly used resized variant of the ImageNet ILSVRC2012 (ImageNet-1k) image classification dataset where images have been resized / center-cropped and rescaled to 256x256 pixels. The underlying dataset (ILSVRC2012) contains 1.28M training images, 50K validation images and 100K test images across 1000 classes. The 256x256 variant is provided as a convenience for faster downloads and for workflows that perform random crops (e.g., 224x224 crops from 256). This variant is widely used in image-generation and generative-model evaluation papers (e.g., for computing FID and Inception Score using 50K generated images — 50 samples per class for the 1000 classes). The dataset is not a new collection but a transformed / resized version of the standard ImageNet (ILSVRC2012) split. Representative Hugging Face repacks include evanarlian/imagenet_1k_resized_256 and benjamin-paine/imagenet-1k-256x256 which describe the exact resize/center-crop procedures and point to the original ImageNet/ILSVRC references.

Paper Submit a result
§ 01 · Leaderboard

Best published scores.

No results indexed yet — be the first to submit a score.

No benchmark results indexed yet
§ 06 · Contribute

Have a score that beats
this table?

Submit a checkpoint and a reproduction script. We will run it, publish the score, and — if it takes the top — annotate the step on the progress chart with your name.

Submit a result Read submission guide
What a submission needs
  • 01A public checkpoint or API endpoint
  • 02A reproduction script with frozen commit + seed
  • 03Declared evaluation environment (Python, deps)
  • 04One row per metric declared by this dataset
  • 05A contact so we can follow up on discrepancies
ImageNet 256x256 — Image generation benchmark · Codesota | CodeSOTA