22M compositional questions grounded in real images via scene graphs. Tests multi-step visual reasoning, spatial understanding, and attribute comparison.
No benchmark results available yet for GQA.
Check back soon as we continue collecting data.