Codesota · Benchmark · MME-VideoOCRHome/Leaderboards/MME-VideoOCR
NTU

MME-VideoOCR.

1,464 videos with 2,000 QA pairs across 25 tasks. Tests OCR capabilities in video content.

Paper Leaderboard
§ 01 · Leaderboard

Results by metric.

Found a wrong score or missing run?
Use row edits to send a sourced correction into moderation.
Add / edit result Report issue

Total Accuracy

Total Accuracy is the reported evaluation metric for MME-VideoOCR. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.

Higher is better

Trust tiers for Total Accuracyverifiedpapervendorcommunityunverified

Muted rows were not state of the art when published — an earlier or same-year result already scored better.

RankModelTrustScoreYearLinksFix
01gemini-25-pro
1,464 videos, 2,000 QA pairs, 25 tasks
paper73.72025Source ↗Looks wrong?
02Gemini 2.5 Pro
1,464 videos, 2,000 QA pairs, 25 tasks
unverified73.72025Source ↗Looks wrong?
03Qwen2.5-VL 72Bunverified692025Source ↗Looks wrong?
04qwen25-vl-72bpaper692025Source ↗Looks wrong?
05internvl3-78bpaper67.22025Source ↗Looks wrong?
06gpt-4opaper66.42025Source ↗Looks wrong?
07gemini-15-propaper64.92025Source ↗Looks wrong?
08Gemini 1.5 Prounverified64.92025Source ↗Looks wrong?
09Qwen2.5-VL 32Bunverified612025Source ↗Looks wrong?
10qwen25-vl-32bpaper612025Source ↗Looks wrong?
§ 04 · Submit a result

Add to the leaderboard.

← Back to Leaderboards