Codesota · Models · HEADoC-Base2 results · 2 benchmarks
Model card

HEADoC-Base.

multimodal27.7M paramsTransformer

HEADoC: Highly Efficient and Accurate Document Classifier Optimized Using Semantic Distances. BASE variant. Hybrid deep attention mechanism fusing visual and textual modalities. DOI:10.1007/s13748-025-00411-x.

§ 01 · Benchmarks

Every benchmark HEADoC-Base has a recorded score for.

#BenchmarkArea · TaskMetricValueRankDateSource
01tobacco-3482Computer Vision · Document Image Classificationaccuracy96.0%#2/14source ↗
02rvl-cdipComputer Vision · Document Image Classificationaccuracy93.0%#24/35source ↗
Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.
§ 02 · Strengths by area

Where HEADoC-Base actually performs.

Computer Vision
2
benchmarks
avg rank #13.0
§ 05 · Sources & freshness

Where these numbers come from.

HEADoC: Highly Efficient and Accurate Document Classifier Optimized Using Semantic Distances
1
result
springer
1
result
2 of 2 rows marked verified.