Image-Text-to-Text2024en

MMStar

Vision-indispensable multimodal benchmark filtering text-solvable questions

No benchmark results indexed for this dataset yet.

Contribute results on GitHub

Other Image-Text-to-Text Datasets