Codesota · Computer Vision · Image segmentation · Oxford-IIIT PetsTasks/Computer Vision/Image segmentation
Image segmentation · benchmark dataset · EN

Oxford-IIIT Pet Dataset.

The Oxford-IIIT Pet Dataset is a fine-grained image dataset of pets created by the Visual Geometry Group (VGG) at the University of Oxford. It contains images of 37 pet breeds (cats and dogs) with large variations in scale, pose and lighting. The dataset provides per-image annotations including the breed label and species (cat/dog), a tight head ROI (bounding box), and pixel-level trimap segmentation (foreground / background / ignore). Common splits used in ML libraries (e.g., TensorFlow Datasets) have 3,680 training images and 3,669 test images (7,349 images total). Typical uses: breed classification (37-way), binary species classification (cat vs dog), and segmentation/foreground extraction. Licensing: CC BY-SA 4.0 (as listed on several mirrors). Source / references: original VGG dataset page (robots.ox.ac.uk), TensorFlow Datasets entry (oxford_iiit_pet), and Hugging Face dataset mirrors (e.g., timm/oxford-iiit-pet).

Paper Submit a result
§ 01 · Leaderboard

Best published scores.

No results indexed yet — be the first to submit a score.

No benchmark results indexed yet
§ 06 · Contribute

Have a score that beats
this table?

Submit a checkpoint and a reproduction script. We will run it, publish the score, and — if it takes the top — annotate the step on the progress chart with your name.

Submit a result Read submission guide
What a submission needs
  • 01A public checkpoint or API endpoint
  • 02A reproduction script with frozen commit + seed
  • 03Declared evaluation environment (Python, deps)
  • 04One row per metric declared by this dataset
  • 05A contact so we can follow up on discrepancies