Codesota · Papers · Multimodal2025-02-19
§ 01 · Benchmark results
3 results reproduced from this paper.
| # | Model | Vendor | Benchmark | Metric | Value | SOTA | Date | Source |
|---|---|---|---|---|---|---|---|---|
| 01 | Qwen2.5-VL 72B | Alibaba | MMBench | accuracy | 90.5% | 2025-02-19 | source ↗ | |
| 02 | Qwen2.5-VL 72B | Alibaba | MMMU | accuracy | 70.2% | 2025-02-19 | source ↗ | |
| 03 | Qwen2.5-VL 72B | Alibaba | TextVQA | accuracy | 85.5% | 2025-02-19 | source ↗ |
§ 04 · Related papers
Other Multimodal papers tracked on Codesota.
- 2025-01-22 · 2 resultsInternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models
- 2025-01-15 · 1 resultGemini 2.0 Flash Technical Report
- 2024-10-25 · 4 resultsSWE-bench Verified
- 2024-10-22 · 1 resultClaude 3.5 Sonnet Model Card
- 2024-09-18 · 4 resultsQwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution
- 2024-04-25 · 4 resultsInternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks