Recent studyBlind TTS Elo is live. Compare two anonymous voice samples, vote after listening, and help separate real preference signal from noise.Vote in the study ->
Codesota · Models · Kimi-VL-A3B-Instruct5 results · 5 benchmarks
Model card

Kimi-VL-A3B-Instruct.

unknown
§ 02 · Benchmarks

Every benchmark Kimi-VL-A3B-Instruct has a recorded score for.

#BenchmarkArea · TaskMetricValueRankDateSource
01MMStarMultimodal · Image-Text-to-Textaccuracy61.3%#19/21source ↗
02RealWorldQAMultimodal · Visual Question Answeringaccuracy68.1%#19/23source ↗
03Video-MMEMultimodal · Video Understandingaccuracy67.8%#20/24source ↗
04MMMUMultimodal · Image-Text-to-Textaccuracy57.0%#26/36source ↗
05OSWorldAgentic AI · Web & Desktop Agentssuccess-rate8.2%#27/28source ↗
Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.
§ 03 · Strengths by area

Where Kimi-VL-A3B-Instruct actually performs.

Multimodal
4
benchmarks
avg rank #21.0
Agentic AI
1
benchmark
avg rank #27.0
§ 04 · Papers

1 paper with results for Kimi-VL-A3B-Instruct.

  1. 2025-04-10· 5 results

    Kimi-VL Technical Report

§ 06 · Sources & freshness

Where these numbers come from.

pwc-dump
5
results
0 of 5 rows marked verified.