Who leads the wost benchmark?

CLIP4STR-H (DFN-5B) currently leads wost with a score of 90.9 on 1 1 Accuracy.

What is the state-of-the-art score on wost?

The state-of-the-art result on wost is 90.9 (1 1 Accuracy), achieved by CLIP4STR-H (DFN-5B) as of 2023.

How many models are tracked on wost?

Codesota tracks 5 models on wost.

When was the wost leaderboard last updated?

The wost leaderboard on Codesota includes results through 2023, with the earliest tracked result from 2022.

Codesota · Benchmark · wostHome/Leaderboards/Vision & Documents/Scene Text Recognition/wost

Unknown

wost.

Name: wost Benchmark Results
Creator: Unknown
Published: 2022-01-01
License: https://creativecommons.org/licenses/by/4.0/

wost is a state-of-the-art machine learning benchmark indexed on Codesota. This page tracks published model results, top scores per metric, and the SOTA timeline for wost.

Paper ↗Leaderboard ↓

§ 01 · SOTA history

Year over year.

§ 02 · Leaderboard

Results by metric.

Found a wrong score or missing run?

Use row edits to send a sourced correction into moderation.

Add / edit result ↗Report issue ↗

1 1 Accuracy

1 1 Accuracy is the reported evaluation metric for wost. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.

Higher is better

Trust tiers for 1 1 Accuracyverifiedpapervendorcommunityunverified

Muted rows were not state of the art when published — an earlier or same-year result already scored better.

Rank	Model	Trust	Score	Year	Links	Fix
01	CLIP4STR-H (DFN-5B) From paper: CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model	verified	90.9	2023	Paper ↗Code ↗	Looks wrong?
02	CLIP4STR-L (DataComp-1B) From paper: CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model	verified	90.6	2023	Paper ↗Code ↗	Looks wrong?
03	CLIP4STR-L From paper: CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model	verified	88.8	2023	Paper ↗Code ↗	Looks wrong?
04	CLIP4STR-B From paper: CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model	verified	87	2023	Paper ↗Code ↗	Looks wrong?
05	CCD-ViT-Base From paper: Self-supervised Character-to-Character Distillation for Text Recognition	verified	86	2022	Paper ↗Code ↗	Looks wrong?

§ 04 · Submit a result

Add to the leaderboard.

← Back to Scene Text Recognition