Visual Question Answering2019en
GQA: Visual Reasoning in the Real World
22M compositional questions grounded in real images via scene graphs. Tests multi-step visual reasoning, spatial understanding, and attribute comparison.
Metrics:accuracy
Paper / WebsiteNo benchmark results indexed for this dataset yet.
Contribute results on GitHub