Recent studyBlind TTS Elo is live. Compare two anonymous voice samples, vote after listening, and help separate real preference signal from noise.Vote in the study ->
Codesota · Benchmark · MME-VideoOCRHome/Leaderboards/MME-VideoOCR
NTU

MME-VideoOCR.

1,464 videos with 2,000 QA pairs across 25 tasks. Tests OCR capabilities in video content.

Paper Leaderboard
§ 01 · Leaderboard

Results by metric.

Found a wrong score or missing run?
Use row edits to send a sourced correction into moderation.
Add / edit result Report issue

Total Accuracy

Total Accuracy is the reported evaluation metric for MME-VideoOCR. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.

Higher is better

Trust tiers for Total Accuracyverifiedpapervendorcommunityunverified
RankModelTrustScoreYearLinksEdit
01gemini-25-pro
1,464 videos, 2,000 QA pairs, 25 tasks
paper73.72025Source ↗Edit result
02Gemini 2.5 Pro
1,464 videos, 2,000 QA pairs, 25 tasks
unverified73.72025Source ↗Edit result
03Qwen2.5-VL 72Bunverified692025Source ↗Edit result
04qwen25-vl-72bpaper692025Source ↗Edit result
05internvl3-78bpaper67.22025Source ↗Edit result
06gpt-4opaper66.42025Source ↗Edit result
07gemini-15-propaper64.92025Source ↗Edit result
08Gemini 1.5 Prounverified64.92025Source ↗Edit result
09Qwen2.5-VL 32Bunverified612025Source ↗Edit result
10qwen25-vl-32bpaper612025Source ↗Edit result
§ 04 · Submit a result

Add to the leaderboard.

← Back to Leaderboards