CHURRO-DS

Stanford University

Historical documents from 46 languages, 99K pages. Tests handwritten and printed text recognition across diverse scripts.

Benchmark Stats

Models6
Papers8
Metrics2

SOTA History

Coming Soon
Visual timeline of state-of-the-art progression over time will appear here.

Handwritten Score

Normalized Levenshtein Similarity on handwritten documents

Higher is better

RankModelCodeScorePaper / Source
1churro-3b

Historical handwritten documents, 46 languages, 99K pages

70.1AlphaXiv
2gemini-25-pro-63.6AlphaXiv
3gemini-25-flash-58.7AlphaXiv
4qwen25-vl-72bHF54.5AlphaXiv
5claude-sonnet-4-37.1AlphaXiv
6gpt-4o-34.2AlphaXiv

Printed Score

Normalized Levenshtein Similarity on printed documents

Higher is better

RankModelCodeScorePaper / Source
1churro-3b

Historical printed documents

82.3AlphaXiv
2gemini-25-pro-80.9AlphaXiv