Codesota · Models · PaddleOCR-VLBaidu3 results · 2 benchmarks
Model card

PaddleOCR-VL.

Baiduopen-source0.9B-7B paramsVision-Language ModelApache 2.01 current SOTA

#1 on OmniDocBench

§ 01 · Benchmarks

Every benchmark PaddleOCR-VL has a recorded score for.

#BenchmarkArea · TaskMetricValueRankDateSource
01OmniDocBenchComputer Vision · Document Parsingtable-teds93.5%#1/4source ↗
02OmniDocBenchComputer Vision · Document Parsingcomposite92.9%#4/33source ↗
03olmOCR-BenchComputer Vision · Document Parsingpass-rate80.0%#6/21source ↗
Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.
§ 02 · Strengths by area

Where PaddleOCR-VL actually performs.

Computer Vision
2
benchmarks
avg rank #3.7 · 1 SOTA
§ 04 · Related models

Other Baidu models scored on Codesota.

ERNIE 5.0
1 result · 1 SOTA
PP-StructureV3
1 result
PaddleOCR
1 result
PaddleOCR-VL 0.9B
0.9B params · 1 result
PaddleOCR-VL-1.5
1 result
RT-DETRv2-X
Unknown params · 1 result
ERNIE 3.0
0 results
§ 05 · Sources & freshness

Where these numbers come from.

alphaxiv-leaderboard
2
results
AlphaXiv
1
result
0 of 3 rows marked verified.