Computer Vision

Research focused on enabling computers to interpret and understand visual information from images and videos, including tasks such as image classification, object detection, segmentation, and visual recognition.

3 tasks97 datasets0 results

Tasks & Benchmarks

Show all datasets and SOTA results

3D generation

No datasets indexed yet. Contribute on GitHub

Few-Shot Image Classification

COCO 2017 Captions
COCO 2017 Panoptic Segmentation
COCO 2017 Stuff
COCO Captions2015
COCO minival2014
COCO test-challenge2014
COCO val2017 (Instance Segmentation)
COCO-Stuff2018
COCO-Text2016
COCO-WholeBody2020
Crossmodal-3600 (XM3600)
DL3DV-Benchmarks (140)
HELMET
HiRoom
IMC (Image Matching Challenge)
ImageNet-Hard
LOFT
LVD-142M
LVD-1689M
Language benchmarks (overall)
MMVP
MRCR
NTIRE 2024 Transparent Surface Challenge (relative)
OCRBench v2
SAT-493M
SciVideoBench
TAP-Vid (RGB-S)
Tanks and Temples (6)

Video generation

No datasets indexed yet. Contribute on GitHub

Get notified when these results update

New models drop weekly. We track them so you don't have to.