Codesota · Models · PaddleOCR-VLBaidu5 results · 2 benchmarks
Model card

PaddleOCR-VL.

Baiduopen-source0.9B-7B paramsVision-Language ModelApache 2.01 current SOTA

#1 on OmniDocBench

§ 02 · Benchmarks

Every benchmark PaddleOCR-VL has a recorded score for.

#BenchmarkArea · TaskMetricValueRankDateSource
01OmniDocBenchComputer Vision · Document Parsingtable-teds93.5%#1/4source ↗
02OmniDocBenchComputer Vision · Document Parsingaccuracy92.6%#4/13source ↗
03OmniDocBenchComputer Vision · Document Parsingcomposite92.9%#4/34source ↗
04olmOCR-BenchComputer Vision · Document Parsingpass-rate80.0%#6/21source ↗
05olmOCR-BenchComputer Vision · Document Parsingaccuracy80.0%#9/18source ↗
Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.
§ 03 · Strengths by area

Where PaddleOCR-VL actually performs.

Computer Vision
2
benchmarks
avg rank #4.8 · 1 SOTA
§ 04 · Papers

1 paper with results for PaddleOCR-VL.

  1. 2025-10-16· 2 results

    PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model

§ 05 · Related models

Other Baidu models scored on Codesota.

ERNIE 5.0
1 result · 1 SOTA
PP-StructureV3
1 result
PaddleOCR
1 result
PaddleOCR-VL 0.9B
0.9B params · 1 result
PaddleOCR-VL-1.5
1 result
RT-DETRv2-X
Unknown params · 1 result
ERNIE 3.0
0 results
§ 06 · Sources & freshness

Where these numbers come from.

alphaxiv-leaderboard
2
results
pwc-dump
2
results
AlphaXiv
1
result
0 of 5 rows marked verified.