Codesota · Benchmark · wostHome/Leaderboards/Vision & Documents/Scene Text Recognition/wost
Unknown

wost.

wost is a state-of-the-art machine learning benchmark indexed on Codesota. This page tracks published model results, top scores per metric, and the SOTA timeline for wost.

Paper Leaderboard
§ 01 · SOTA history

Year over year.

§ 02 · Leaderboard

Results by metric.

Found a wrong score or missing run?
Use row edits to send a sourced correction into moderation.
Add / edit result Report issue

1 1 Accuracy

1 1 Accuracy is the reported evaluation metric for wost. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.

Higher is better

Trust tiers for 1 1 Accuracyverifiedpapervendorcommunityunverified
RankModelTrustScoreYearLinksFix
01CLIP4STR-H (DFN-5B)
From paper: CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model
verified90.92023Paper ↗Code ↗Looks wrong?
02CLIP4STR-L (DataComp-1B)
From paper: CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model
verified90.62023Paper ↗Code ↗Looks wrong?
03CLIP4STR-L
From paper: CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model
verified88.82023Paper ↗Code ↗Looks wrong?
04CLIP4STR-B
From paper: CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model
verified872023Paper ↗Code ↗Looks wrong?
05CCD-ViT-Base
From paper: Self-supervised Character-to-Character Distillation for Text Recognition
verified862022Paper ↗Code ↗Looks wrong?
§ 04 · Submit a result

Add to the leaderboard.

← Back to Scene Text Recognition