Handwriting Recognition1999en
IAM Handwriting Database
13,353 handwritten text lines from 657 writers. Standard handwriting benchmark.
Current State of the Art
DTrOCR 105M
Unknown
2.38
cer
cer Progress Over Time
Showing 5 breakthroughs from Sep 2018 to Aug 2023
Key Milestones
Sep 2018
Start, Follow, Read
From paper: Start, Follow, Read: End-to-End Full-Page Handwriting Recognition
6.4
May 2020
Transformer w/ CNN (+synth)
From paper: Pay Attention to What You Read: Non-recurrent Handwritten Text-Line Recognition
4.7
-27.0%
Dec 2020
VAN
From paper: End-to-end Handwritten Paragraph Text Recognition Using a Vertical Attention Network
4.3
-7.5%
Apr 2021
Self-Attention + CTC + language model
From paper: Rethinking Text Line Recognition Models
2.8
-36.3%
Aug 2023
DTrOCR 105MCurrent SOTA
From paper: DTrOCR: Decoder-only Transformer for Optical Character Recognition
2.4
-13.5%
Total Improvement
62.8%
Time Span
5y
Breakthroughs
5
Current SOTA
2.4
Top Models Performance Comparison
Top 10 models ranked by cer (lower is better)
Best Score
2.4
Top Model
DTrOCR 105M
Models Compared
10
Score Range
3.8
cerPrimary
| # | Model | Score | Paper / Code | Date |
|---|---|---|---|---|
| 1 | DTrOCR 105M | 2.38 | Aug 2023 | |
| 2 | Self-Attention + CTC + language model | 2.75 | Apr 2021 | |
| 3 | TrOCR-large 558M | 2.89 | Sep 2021 | |
| 4 | Transformer + CNN | 2.96 | Apr 2021 | |
| 5 | TrOCR-base 334M | 3.42 | Sep 2021 | |
| 6 | TrOCR-small 62M | 4.22 | Sep 2021 | |
| 7 | VAN | 4.32 | Dec 2020 | |
| 8 | Transformer w/ CNN (+synth) | 4.67 | May 2020 | |
| 9 | HTR-VT(line-level) | 4.7 | Sep 2024 | |
| 10 | Easter2.0 | 6.21 | May 2022 | |
| 11 | FPHR+Aug Paragraph Level (~145 dpi) | 6.3 | Mar 2021 | |
| 12 | Start, Follow, Read | 6.4 | Start, Follow, Read: End-to-End Full-Page Handwriting RecognitionCode | Sep 2018 |
| 13 | Decouple Attention Network | 6.4 | Dec 2019 | |
| 14 | FPHR+Aug Line Level (~145 dpi) | 6.5 | Mar 2021 | |
| 15 | Leaky LP Cell | 6.6 | Feb 2019 | |
| 16 | FPHR Paragraph Level (~145 dpi) | 6.7 | Mar 2021 | |
| 17 | Transformer w/ CNN | 7.62 | May 2020 |
wer
| # | Model | Score | Paper / Code | Date |
|---|---|---|---|---|
| 1 | HTR-VT(line-level) | 14.9 | Sep 2024 | |
| 2 | Leaky LP Cell | 15.9 | Feb 2019 | |
| 3 | VAN | 16.24 | Dec 2020 | |
| 4 | Decouple Attention Network | 19.6 | Dec 2019 | |
| 5 | Start, Follow, Read | 23.2 | Start, Follow, Read: End-to-End Full-Page Handwriting RecognitionCode | Sep 2018 |
Related Papers10
HTR-VT: Handwritten Text Recognition with Vision Transformer
Sep 2024Models: HTR-VT(line-level)
DTrOCR: Decoder-only Transformer for Optical Character Recognition
Aug 2023Models: DTrOCR 105M
Easter2.0: Improving convolutional models for handwritten text recognition
May 2022Models: Easter2.0
TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models
Sep 2021Models: TrOCR-small 62M, TrOCR-base 334M, TrOCR-large 558M
Rethinking Text Line Recognition Models
Apr 2021Models: Transformer + CNN, Self-Attention + CTC + language model
Full Page Handwriting Recognition via Image to Sequence Extraction
Mar 2021Models: FPHR Paragraph Level (~145 dpi), FPHR+Aug Line Level (~145 dpi), FPHR+Aug Paragraph Level (~145 dpi)
Pay Attention to What You Read: Non-recurrent Handwritten Text-Line Recognition
May 2020Models: Transformer w/ CNN, Transformer w/ CNN (+synth)
Decoupled Attention Network for Text Recognition
Dec 2019Models: Decouple Attention Network
No Padding Please: Efficient Neural Handwriting Recognition
Feb 2019Models: Leaky LP Cell