Who leads the svtp benchmark?

DTrOCR 105M currently leads svtp with a score of 98.6 on Accuracy.

What is the state-of-the-art score on svtp?

The state-of-the-art result on svtp is 98.6 (Accuracy), achieved by DTrOCR 105M as of 2023.

How many models are tracked on svtp?

Codesota tracks 19 models on svtp.

When was the svtp leaderboard last updated?

The svtp leaderboard on Codesota includes results through 2023, with the earliest tracked result from 2021.

Codesota · Benchmark · svtpHome/Leaderboards/Vision & Documents/Scene Text Recognition/svtp

Unknown

svtp.

Name: svtp Benchmark Results
Creator: Unknown
Published: 2021-01-01
License: https://creativecommons.org/licenses/by/4.0/

svtp is a state-of-the-art machine learning benchmark indexed on Codesota. This page tracks published model results, top scores per metric, and the SOTA timeline for svtp.

Paper ↗Leaderboard ↓

§ 01 · SOTA history

Year over year.

§ 02 · Leaderboard

Results by metric.

Found a wrong score or missing run?

Use row edits to send a sourced correction into moderation.

Add / edit result ↗Report issue ↗

Accuracy

Accuracy is the reported evaluation metric for svtp. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.

Higher is better

Trust tiers for Accuracyverifiedpapervendorcommunityunverified

Muted rows were not state of the art when published — an earlier or same-year result already scored better.

Rank	Model	Trust	Score	Year	Links	Fix
01	DTrOCR 105M From paper: DTrOCR: Decoder-only Transformer for Optical Character Recognition	verified	98.6	2023	Paper ↗Code ↗	Looks wrong?
02	MGP-STR From paper: Multi-Granularity Prediction for Scene Text Recognition	verified	98.3	2022	Paper ↗Code ↗	Looks wrong?
03	CLIP4STR-L (DataComp-1B) From paper: CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model	verified	98.1	2023	Paper ↗Code ↗	Looks wrong?
04	CLIP4STR-L From paper: An Empirical Study of Scaling Law for OCR	verified	97.4	2023	Paper ↗Code ↗Source ↗	Looks wrong?
05	CLIP4STR-B From paper: CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model	verified	97.2	2023	Paper ↗Code ↗	Looks wrong?
06	PARSeq Lowercase alphanum eval. ECCV 2022.	verified	96.9	2022	Paper ↗	Looks wrong?
07	CPPD From paper: Context Perception Parallel Decoder for Scene Text Recognition	verified	96.7	2023	Paper ↗Code ↗	Looks wrong?
08	CCD-ViT-Base From paper: Self-supervised Character-to-Character Distillation for Text Recognition	verified	96.1	2022	Paper ↗Code ↗	Looks wrong?
09	CCD-ViT-Small From paper: Self-supervised Character-to-Character Distillation for Text Recognition	verified	92.7	2022	Paper ↗Code ↗	Looks wrong?
10	CCD-ViT-Tiny From paper: Self-supervised Character-to-Character Distillation for Text Recognition	verified	91.6	2022	Paper ↗Code ↗	Looks wrong?
11	MATRN From paper: Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features	verified	90.6	2021	Paper ↗Code ↗	Looks wrong?
12	S-GTR From paper: Visual Semantics Allow for Textual Reasoning Better in Scene Text Recognition	verified	90.6	2021	Paper ↗Code ↗	Looks wrong?
13	SIGA_T From paper: Self-supervised Implicit Glyph Attention for Text Recognition	verified	90.5	2022	Paper ↗Code ↗	Looks wrong?
14	CDistNet (Ours) From paper: CDistNet: Perceiving Multi-Domain Character Distance for Robust Text Recognition	verified	89.77	2021	Paper ↗Code ↗	Looks wrong?
15	ABINet-LV ABINet Language-Vision variant. CVPR 2021.	verified	89.5	2021	Paper ↗	Looks wrong?
16	DiffusionSTR From paper: DiffusionSTR: Diffusion Model for Scene Text Recognition	verified	89.2	2023	Paper ↗	Looks wrong?
17	DPAN From paper: Look Back Again: Dual Parallel Attention Network for Accurate and Robust Scene Text Recognition	verified	89	2021	Paper ↗Code ↗	Looks wrong?
18	TrOCR-large 558M TrOCR-large, Syn+Benchmark training. Table 6. AAAI 2023.	verified	88.1	2021	Paper ↗	Looks wrong?
19	TrOCR-base 334M TrOCR-base, Syn+Benchmark training. Table 6. AAAI 2023.	verified	86.9	2021	Paper ↗	Looks wrong?

§ 04 · Submit a result

Add to the leaderboard.

← Back to Scene Text Recognition