Codesota · Models · Qianfan-OCRBaidu Qianfan12 results · 2 benchmarks
Model card

Qianfan-OCR.

Baidu Qianfanopen-source4B paramsEnd-to-end VLM (4B params)Apache 2.03 current SOTA

Unified end-to-end document intelligence model. Highest overall score among end-to-end models on olmOCR-Bench (79.8).

§ 01 · Benchmarks

Every benchmark Qianfan-OCR has a recorded score for.

#BenchmarkArea · TaskMetricValueRankDateSource
01OmniDocBenchComputer Vision · Document Parsingformula-cdm92.4%#1/1source ↗
02olmOCR-BenchComputer Vision · Document Parsingmulti-column92.2%#1/4source ↗
03olmOCR-BenchComputer Vision · Document Parsingold-scans73.1%#1/5source ↗
04OmniDocBenchComputer Vision · Document Parsingtable-teds91.0%#2/4source ↗
05OmniDocBenchComputer Vision · Document Parsingcomposite93.1%#3/33source ↗
06OmniDocBenchComputer Vision · Document Parsingtext-edit-distance0.0%#3/3source ↗
07olmOCR-BenchComputer Vision · Document Parsingbase99.6%#3/4source ↗
08olmOCR-BenchComputer Vision · Document Parsinglong-tiny-text80.4%#4/4source ↗
09olmOCR-BenchComputer Vision · Document Parsingheaders-footers42.0%#4/4source ↗
10olmOCR-BenchComputer Vision · Document Parsingarxiv80.1%#5/5source ↗
11olmOCR-BenchComputer Vision · Document Parsingtables81.6%#5/5source ↗
12olmOCR-BenchComputer Vision · Document Parsingpass-rate79.8%#7/21source ↗
Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.
§ 02 · Strengths by area

Where Qianfan-OCR actually performs.

Computer Vision
2
benchmarks
avg rank #3.3 · 3 SOTA
§ 05 · Sources & freshness

Where these numbers come from.

paper
8
results
Hugging Face
4
results
0 of 12 rows marked verified.