Recent studyBlind TTS Elo is live. Compare two anonymous voice samples, vote after listening, and help separate real preference signal from noise.Vote in the study ->
Codesota · Tasks · Scene Text DetectionHome/Tasks/Computer Vision/Scene Text Detection

Scene Text Detection.

Detecting text regions in natural scene images

11
Datasets
581
Results
accuracy
Canonical metric
§ 02 · Canonical benchmark

The reference dataset.

coco-text

Dataset from Papers With Code

Primary metric: accuracy
View full leaderboard →
§ 03 · Top 10

Leading models.

Leading models on coco-text.

#Model1-1-accuracyYearSource
CLIP4STR-L81.92023paper ↗
2MGP-STR81.72022paper ↗
3CLIP4STR-B81.12023paper ↗
4TCM65.92026paper ↗
5PANet (Joint)64.52026paper ↗
6Corner-based Region Proposals63.32018paper ↗
7LRANet61.72026paper ↗
8DPText-DETR61.62026paper ↗
9TextBoxes++_MS60.92018paper ↗
10MAEDet60.62026paper ↗

What were you looking for on Scene Text Detection?

Didn't find the model, metric, or dataset you needed? Tell us in one line. We read every message and reply within 48 hours.

§ 04 · All datasets

Tracked datasets.

11 datasets tracked for this task.

coco-text
CANONICAL
33 results · accuracy
Top: CLIP4STR-L 81.9
ICDAR 2015
188 results · f1
Top: TextFuseNet (ResNeXt-101) 94.0
Total-Text
126 results · f1
Top: FAST-T-448 153
msra-td500
79 results · accuracy
Top: FAST-T-512 137
icdar-2013
59 results · accuracy
Top: JSTR 99.2
icdar-2017-mlt
54 results · accuracy
Top: PMTD* 84.4
CTW1500
18 results · f1
Top: DBNet++ (ResNet-50) (1024) 88.5
ic19-art
11 results · accuracy
Top: CLIP4STR-L (DataComp-1B) 86.4
Union14M
8 results · accuracy
Top: CLIP4STR-B 70.8
ICDAR 2019 ArT
4 results · f1
Top: pil_maskrcnn 82.7
ic19-rects
1 result · accuracy
Top: BDN 93.4
§ 05 · Related tasks

Other tasks in Computer Vision.

Document Image ClassificationDocument Layout AnalysisDocument ParsingDocument UnderstandingGeneral OCR CapabilitiesHandwriting RecognitionImage Feature ExtractionImage-to-3D
Reply within 48 hours · No newsletter

Didn't find what you came for?

Still looking for something on Scene Text Detection? A missing model, a stale score, a benchmark we should cover — drop it here and we'll handle it.

Real humans read every message. We track what people are asking for and prioritize accordingly.