Codesota · Papers · Multimodal2025-01-22
Paper
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models
§ 01 · Benchmark results
2 results reproduced from this paper.
| # | Model | Vendor | Benchmark | Metric | Value | SOTA | Date | Source |
|---|---|---|---|---|---|---|---|---|
| 01 | InternVL3-78B | Shanghai AI Lab | MMBench | accuracy | 90.1% | 2025-01-22 | source ↗ | |
| 02 | InternVL3-78B | Shanghai AI Lab | MMMU | accuracy | 73.3% | 2025-01-22 |
§ 04 · Related papers
Other Multimodal papers tracked on Codesota.
- 2025-02-19 · 3 resultsQwen2.5-VL Technical Report
- 2025-01-15 · 1 resultGemini 2.0 Flash Technical Report
- 2024-10-25 · 4 resultsSWE-bench Verified
- 2024-10-22 · 1 resultClaude 3.5 Sonnet Model Card
- 2024-09-18 · 4 resultsQwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution
- 2024-04-25 · 4 resultsInternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks