Codesota · General · Retrieval · INRIA Copydays (strong subset)Tasks/General/Retrieval
Retrieval · benchmark dataset · EN

INRIA CopyDays.

INRIA CopyDays (Copydays) is a standard benchmark dataset from INRIA (H. Jégou and collaborators) for image copy-detection / near-duplicate image retrieval. The dataset contains original (unmodified) images and corresponding transformed copies produced with various image distortions; the dataset is provided with named subsets, including a "strong" subset that contains heavily modified copies (examples of strong modifications: cropping, rotation, compression, large photometric/geometric changes). INRIA CopyDays is widely used to evaluate robustness of image-retrieval and copy-detection systems; many works evaluate on the CopyDays strong subset and commonly augment the evaluation by adding distractors from large web collections such as YFCC100M (the paper reports results on the strong subset with 10k YFCC100M distractors). Sources: INRIA dataset page for Jégou's datasets (INRIA Holidays / CopyDays) and the Hugging Face dataset entry (randall-lab/INRIA-CopyDays).

Paper Submit a result
§ 01 · Leaderboard

Best published scores.

No results indexed yet — be the first to submit a score.

No benchmark results indexed yet
§ 06 · Contribute

Have a score that beats
this table?

Submit a checkpoint and a reproduction script. We will run it, publish the score, and — if it takes the top — annotate the step on the progress chart with your name.

Submit a result Read submission guide
What a submission needs
  • 01A public checkpoint or API endpoint
  • 02A reproduction script with frozen commit + seed
  • 03Declared evaluation environment (Python, deps)
  • 04One row per metric declared by this dataset
  • 05A contact so we can follow up on discrepancies
INRIA Copydays (strong subset) — Retrieval benchmark · Codesota | CodeSOTA