Optical Character Recognition2020en

mldoc-zero-shot-english-to-chinese

Dataset from Papers With Code

Metrics:accuracy, cer, wer, f1
Current State of the Art

XLMft UDA

Unknown

93.32

accuracy

accuracy Progress Over Time

Showing 2 breakthroughs from May 2018 to Sep 2019

72.978.484.089.695.2May 2018Sep 2019accuracyDate

Key Milestones

May 2018
MultiCCA + CNN

From paper: A Corpus for Multilingual Document Classification in Eight Languages

74.7
Sep 2019
XLMft UDACurrent SOTA

From paper: Bridging the domain gap in cross-lingual document classification

93.3
+24.9%
Total Improvement
24.9%
Time Span
1y 4m
Breakthroughs
2
Current SOTA
93.3

Top Models Performance Comparison

Top 5 models ranked by accuracy

accuracy1XLMft UDA93.3100.0%2MultiFiT, pseudo82.588.4%3MultiCCA + CNN74.780.1%4BiLSTM (UN)72.077.1%5Massively Multilingual Se...71.977.1%0%25%50%75%100%% of best
Best Score
93.3
Top Model
XLMft UDA
Models Compared
5
Score Range
21.4

accuracyPrimary

Related Papers4

Other Optical Character Recognition Datasets