General
A broad category encompassing machine learning research and tasks that don't fit specifically into vision or language domains, including general ML methods, optimization, and cross-domain approaches.
11 tasks87 datasets8 results
Tasks & Benchmarks
Video-Language Models
19 benchmarks4 results
Coding Agents
7 benchmarks4 results
Embedding models
0 benchmarks0 results
General
1 benchmarks0 results
Omni models
2 benchmarks0 results
Reasoning
0 benchmarks0 results
Reinforcement Learning
0 benchmarks0 results
Retrieval
7 benchmarks0 results
Vision-Language Models
40 benchmarks0 results
World Models
0 benchmarks0 results
Computer Use Agents
11 benchmarks0 results
Show all datasets and SOTA results
Video-Language Models
Coding Agents
87.8(Pass@1)Qwen2.5-Plus
55.5(Pass@1)Qwen2.5-72B-Instruct
88.2(Pass@1)Qwen2.5-72B-Instruct
77(Pass@1)Qwen2.5-Plus
Embedding models
No datasets indexed yet. Contribute on GitHub
General
Omni models
Reasoning
No datasets indexed yet. Contribute on GitHub
Reinforcement Learning
No datasets indexed yet. Contribute on GitHub
Retrieval
Vision-Language Models
RefCOCO2016
World Models
No datasets indexed yet. Contribute on GitHub
Get notified when these results update
New models drop weekly. We track them so you don't have to.