Codesota · Models · Qianfan-OCRBaidu Qianfan16 results · 4 benchmarks
Model card

Qianfan-OCR.

Baidu Qianfanopen-source4B paramsEnd-to-end VLM (4B params)Apache 2.03 current SOTA

Unified end-to-end document intelligence model. Highest overall score among end-to-end models on olmOCR-Bench (79.8).

§ 02 · Benchmarks

Every benchmark Qianfan-OCR has a recorded score for.

#BenchmarkArea · TaskMetricValueRankDateSource
01OmniDocBenchComputer Vision · Document Parsingformula-cdm92.4%#1/1source ↗
02olmOCR-BenchComputer Vision · Document Parsingmulti-column92.2%#1/4source ↗
03olmOCR-BenchComputer Vision · Document Parsingold-scans73.1%#1/5source ↗
04OmniDocBenchComputer Vision · Document Parsingtable-teds91.0%#2/4source ↗
05OmniDocBenchComputer Vision · Document Parsingaccuracy93.1%#2/13source ↗
06OmniDocBenchComputer Vision · Document Parsingtext-edit-distance0.0%#3/3source ↗
07OmniDocBenchComputer Vision · Document Parsingcomposite93.1%#3/34source ↗
08olmOCR-BenchComputer Vision · Document Parsingbase99.6%#3/4source ↗
09olmOCR-BenchComputer Vision · Document Parsinglong-tiny-text80.4%#4/4source ↗
10olmOCR-BenchComputer Vision · Document Parsingheaders-footers42.0%#4/4source ↗
11olmOCR-BenchComputer Vision · Document Parsingarxiv80.1%#5/5source ↗
12olmOCR-BenchComputer Vision · Document Parsingtables81.6%#5/5source ↗
13olmOCR-BenchComputer Vision · Document Parsingpass-rate79.8%#7/21source ↗
14olmOCR-BenchComputer Vision · Document Parsingaccuracy79.8%#10/18source ↗
15DocVQAComputer Vision · Document Understandinganls92.8%#11/21source ↗
16TextVQAMultimodal · Visual Question Answeringaccuracy80.0%#14/23source ↗
Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.
§ 03 · Strengths by area

Where Qianfan-OCR actually performs.

Computer Vision
3
benchmarks
avg rank #4.1 · 3 SOTA
Multimodal
1
benchmark
avg rank #14.0
§ 04 · Papers

1 paper with results for Qianfan-OCR.

  1. 2026-03-11· 4 results

    Qianfan-OCR: A Unified End-to-End Model for Document Intelligence

§ 06 · Sources & freshness

Where these numbers come from.

paper
8
results
Hugging Face
4
results
pwc-dump
4
results
0 of 16 rows marked verified.