HuggingFace ↔ CodeSOTA

Every HuggingFace pipeline task mapped to CodeSOTA benchmarks. Find which benchmarks evaluate models for any HF task.

52HF tasks
52Mapped
52With benchmarks
59CodeSOTA-only

Multimodal

Computer Vision

Natural Language Processing

Audio

Tabular

Reinforcement Learning

HF Pipeline Tag
CodeSOTA Task
Area
Benchmarks
Results

Other

HF Pipeline Tag
CodeSOTA Task
Area
Benchmarks
Results

CodeSOTA-only tasks

Tasks tracked by CodeSOTA that don't have a direct HuggingFace pipeline equivalent.

Agentic AI

HCAST1 benchmarks6 results
RE-Bench1 benchmarks5 results
SWE-bench1 benchmarks15 results
Time Horizon1 benchmarks5 results
Web & Desktop Agents2 benchmarks11 results

Computer Code

Bug Detection1 benchmarks6 results
Code Completion1 benchmarks6 results
Code Generation9 benchmarks112 results
Code Translation1 benchmarks7 results
Program Repair1 benchmarks5 results

Computer Vision

Document Image Classification7 benchmarks54 results
Document Layout Analysis5 benchmarks126 results
Document Parsing2 benchmarks56 results
General OCR Capabilities4 benchmarks50 results
Handwriting Recognition7 benchmarks38 results
Optical Character Recognition114 benchmarks696 results
Scene Text Detection11 benchmarks520 results
Scene Text Recognition11 benchmarks127 results
Table Recognition5 benchmarks38 results

Medical

Natural Language Processing

Reasoning

Arithmetic Reasoning2 benchmarks6 results
Commonsense Reasoning5 benchmarks45 results
Logical Reasoning4 benchmarks12 results
Mathematical Reasoning4 benchmarks62 results
Multi-step Reasoning4 benchmarks33 results

Reinforcement Learning

Continuous Control1 benchmarks9 results