Visual Question Answering2019en

GQA: Visual Reasoning in the Real World

22M compositional questions grounded in real images via scene graphs. Tests multi-step visual reasoning, spatial understanding, and attribute comparison.

Metrics:accuracy
Paper / Website

No benchmark results indexed for this dataset yet.

Contribute results on GitHub

Other Visual Question Answering Datasets

GQA Benchmark - Visual Question Answering | CodeSOTA