Who leads the icdar2015 benchmark?

JSTR currently leads icdar2015 with a score of 98.70 on accuracy.

What is the state-of-the-art score on icdar2015?

The state-of-the-art result on icdar2015 is 98.70 (accuracy), achieved by JSTR as of 2025.

How many models are tracked on icdar2015?

Codesota tracks 30 models on icdar2015 across 2 metrics.

When was the icdar2015 leaderboard last updated?

The icdar2015 leaderboard on Codesota includes results through 2025, with the earliest tracked result from 2017.

Codesota · Computer Vision · Optical Character Recognition · icdar2015Tasks/Computer Vision/Optical Character Recognition

Optical Character Recognition · benchmark dataset · 2020 · EN

icdar2015.

Name: icdar2015 Benchmark Results
Creator: Codesota
Published: 2017-01-01
License: https://creativecommons.org/licenses/by/4.0/

Dataset from Papers With Code

Submit a result ↵

§ 01 · Leaderboard

Best published scores.

30 results indexed across 2 metrics. Shaded row marks current SOTA; ties broken by submission date.

Primary: accuracy · higher is better
All metrics: accuracy, f-measure

accuracy· primary

29 rows

#	Model	Org	Submitted	Paper / code	accuracy
01	JSTROpen	Fujitake	Apr 2024	JSTR: Judgment Improves Scene Text Recognition	98.70
02	TextBlockV2 (GPT-2)Open	Jiahao Lyu et al., Fudan University	Mar 2024	TextBlockV2: Towards Precise-Detection-Free Scene Text S…	97.70
03	DTrOCR 105M	—	Aug 2023	DTrOCR: Decoder-only Transformer for Optical Character R… · code	93.50
04	CPPD	—	Jul 2023	Context Perception Parallel Decoder for Scene Text Recog… · code	91.70
05	CLIP4STR-L (DataComp-1B)	—	May 2023	CLIP4STR: A Simple Baseline for Scene Text Recognition w… · code	91.40
06	MGP-STR	—	Sep 2022	Multi-Granularity Prediction for Scene Text Recognition · code	90.90
07	CLIP4STR-L	—	May 2023	CLIP4STR: A Simple Baseline for Scene Text Recognition w… · code	90.80
08	CLIP4STR-B	Research	May 2023	CLIP4STR: A Simple Baseline for Scene Text Recognition w… · code	90.60
09	OTSNetOpen	Anonymous / arxiv preprint	Nov 2025	OTSNet: A Unified Observation-Thinking-Spelling Network …	90.20
10	IGTR-AROpen	Yongkun Du et al.	Jan 2024	Instruction-Guided Scene Text Recognition	89.80
11	SIGA_S	—	Mar 2022	Self-supervised Implicit Glyph Attention for Text Recogn… · code	87.60
12	S-GTR	—	Dec 2021	Visual Semantics Allow for Textual Reasoning Better in S… · code	87.30
13	MATRN	Research	Nov 2021	Multi-modal Text Recognition Networks: Interactive Enhan… · code	86.60
14	CDistNet (Ours)	—	Nov 2021	CDistNet: Perceiving Multi-Domain Character Distance for… · code	86.25
15	DiffusionSTR	—	Jun 2023	DiffusionSTR: Diffusion Model for Scene Text Recognition	86
16	DPAN	—	Aug 2021	papers-with-code · code	85.50
17	RCEED	—	Jun 2021	Representation and Correlation Enhanced Encoder-Decoder … · code	82.20
18	CSTR	—	Feb 2021	Revisiting Classification Perspective on Scene Text Reco… · code	81.60
19	Yet Another Text Recognizer	—	Jul 2021	Why You Should Try the Real Data for the Scene Text Reco… · code	80.20
20	SEED	—	May 2020	SEED: Semantics Enhanced Encoder-Decoder Framework for S… · code	80
21	TextScanner	—	Dec 2019	TextScanner: Reading Characters in Order for Robust Scen…	79.40
22	SATRN	—	Oct 2019	On Recognizing Texts of Arbitrary Shapes with 2D Self-At… · code	79
23	SAFL	—	Jan 2022	SAFL: A Self-Attention Scene Text Recognizer with Focal … · code	77.50
24	ASTER	—	Jun 2018	papers-with-code · code	76.10
25	DAN	—	Dec 2019	Decoupled Attention Network for Text Recognition · code	74.50
26	AON	—	Nov 2017	AON: Towards Arbitrarily-Oriented Text Recognition · code	73
27	ViTSTR	—	May 2021	Vision Transformer for Fast and Efficient Scene Text Rec… · code	72.60
28	Baek et al.	—	Apr 2019	What Is Wrong With Scene Text Recognition Model Comparis… · code	71.80
29	SAR	—	Nov 2018	Show, Attend and Read: A Simple and Strong Baseline for … · code	69.20

f-measure

1 row

#	Model	Org	Submitted	Paper / code	f-measure
01	DAL	—	Dec 2020	Dynamic Anchor Learning for Arbitrary-Oriented Object De… · code	82.40

Fig 2 · Rows sorted by score within each metric. Shaded row marks SOTA. Dates reflect model or paper release where available, otherwise the date Codesota accessed the source.

§ 03 · Progress

18 steps
of state of the art.

Each row below marks a model that broke the previous record on accuracy. Intermediate submissions are kept in the leaderboard above; only SOTA-setting entries are re-listed here.

Higher scores win. Each subsequent entry improved upon the previous best.

SOTA line · accuracy

Nov 12, 2017AON73
Jun 25, 2018ASTER76.10
Oct 10, 2019SATRN79
Dec 28, 2019TextScanner79.40
May 22, 2020SEED80
Feb 22, 2021CSTR81.60
Jun 13, 2021RCEED82.20
Aug 1, 2021DPAN85.50
Nov 22, 2021CDistNet (Ours)86.25
Nov 30, 2021MATRNResearch86.60
Dec 24, 2021S-GTR87.30
Mar 7, 2022SIGA_S87.60
Sep 8, 2022MGP-STR90.90
May 23, 2023CLIP4STR-L (DataComp-1B)91.40
Jul 23, 2023CPPD91.70
Aug 30, 2023DTrOCR 105M93.50
Mar 15, 2024TextBlockV2 (GPT-2)Jiahao Lyu et al., Fudan University97.70
Apr 9, 2024JSTRFujitake98.70

Fig 3 · SOTA-setting models only. 18 entries span Nov 2017 → Apr 2024.

§ 04 · Literature

26 papers
tied to this benchmark.

Every paper below corresponds to at least one row in the leaderboard above. Click through for the arXiv preprint and, when available, the reference implementation.

OTSNet: A Unified Observation-Thinking-Spelling Network for Scene Text Recognition
Nov 2025·OTSNet
arXiv ↗
JSTR: Judgment Improves Scene Text Recognition
Apr 2024·JSTR
arXiv ↗
TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with Pre-trained Language Model
Mar 2024·TextBlockV2 (GPT-2)
arXiv ↗
Instruction-Guided Scene Text Recognition
Jan 2024·IGTR-AR
arXiv ↗
DTrOCR: Decoder-only Transformer for Optical Character Recognition
Aug 2023·DTrOCR 105M
arXiv ↗Code
Context Perception Parallel Decoder for Scene Text Recognition
Jul 2023·CPPD
arXiv ↗Code
DiffusionSTR: Diffusion Model for Scene Text Recognition
Jun 2023·DiffusionSTR
arXiv ↗
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model
May 2023·CLIP4STR-L (DataComp-1B), CLIP4STR-L, CLIP4STR-B
arXiv ↗Code
Multi-Granularity Prediction for Scene Text Recognition
Sep 2022·MGP-STR
arXiv ↗Code
Self-supervised Implicit Glyph Attention for Text Recognition
Mar 2022·SIGA_S
arXiv ↗Code
SAFL: A Self-Attention Scene Text Recognizer with Focal Loss
Jan 2022·SAFL
arXiv ↗Code
Visual Semantics Allow for Textual Reasoning Better in Scene Text Recognition
Dec 2021·S-GTR
arXiv ↗Code
Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features
Nov 2021·MATRN
arXiv ↗Code
CDistNet: Perceiving Multi-Domain Character Distance for Robust Text Recognition
Nov 2021·CDistNet (Ours)
arXiv ↗Code
Why You Should Try the Real Data for the Scene Text Recognition
Jul 2021·Yet Another Text Recognizer
arXiv ↗Code
Representation and Correlation Enhanced Encoder-Decoder Framework for Scene Text Recognition
Jun 2021·RCEED
arXiv ↗Code
Vision Transformer for Fast and Efficient Scene Text Recognition
May 2021·ViTSTR
arXiv ↗Code
Revisiting Classification Perspective on Scene Text Recognition
Feb 2021·CSTR
arXiv ↗Code
Dynamic Anchor Learning for Arbitrary-Oriented Object Detection
Dec 2020·DAL
arXiv ↗Code
SEED: Semantics Enhanced Encoder-Decoder Framework for Scene Text Recognition
May 2020·SEED
arXiv ↗Code
TextScanner: Reading Characters in Order for Robust Scene Text Recognition
Dec 2019·TextScanner
arXiv ↗
Decoupled Attention Network for Text Recognition
Dec 2019·DAN
arXiv ↗Code
On Recognizing Texts of Arbitrary Shapes with 2D Self-Attention
Oct 2019·SATRN
arXiv ↗Code
What Is Wrong With Scene Text Recognition Model Comparisons? Dataset and Model Analysis
Apr 2019·Baek et al.
arXiv ↗Code
Show, Attend and Read: A Simple and Strong Baseline for Irregular Text Recognition
Nov 2018·SAR
arXiv ↗Code
AON: Towards Arbitrarily-Oriented Text Recognition
Nov 2017·AON
arXiv ↗Code

§ 06 · Contribute

Have a score that beats
this table?

Submit a checkpoint and a reproduction script. We will run it, publish the score, and — if it takes the top — annotate the step on the progress chart with your name.

Submit a result ↵Read submission guide

What a submission needs

01A public checkpoint or API endpoint
02A reproduction script with frozen commit + seed
03Declared evaluation environment (Python, deps)
04One row per metric declared by this dataset
05A contact so we can follow up on discrepancies

icdar2015.

Best published scores.

18 stepsof state of the art.

26 paperstied to this benchmark.

Neighbouring benchmarks.

Have a score that beatsthis table?

18 steps
of state of the art.

26 papers
tied to this benchmark.

Have a score that beats
this table?