Recent studyBlind TTS Elo is live. Compare two anonymous voice samples, vote after listening, and help separate real preference signal from noise.Vote in the study ->
Codesota · Tasks · Keypoint DetectionHome/Tasks/Computer Vision/Keypoint Detection
Computer Vision· keypoint-detection

Keypoint Detection.

Keypoint detection localizes specific anatomical or structural landmarks — body joints, facial features, hand articulations — enabling pose estimation, gesture recognition, and motion capture. OpenPose (2017) first demonstrated real-time multi-person pose estimation, and the field has since progressed through HRNet, ViTPose, and RTMPose pushing both accuracy and speed. Modern systems detect 133 whole-body keypoints (body + hands + face) in real-time on mobile devices. The applications span from sports biomechanics (analyzing an athlete's form frame-by-frame) to sign language recognition and AR avatar puppeteering.

2
Datasets
1
Results
map
Canonical metric
§ 02 · Canonical benchmark

The reference dataset.

COCO Keypoints

Human pose estimation on COCO with 17 body keypoints

Primary metric: map
View full leaderboard →
§ 03 · Top 10

Leading models.

Leading models on COCO Keypoints.

#ModelmapYearSource
ViTPose-G80.92022paper ↗

What were you looking for on Keypoint Detection?

Didn't find the model, metric, or dataset you needed? Tell us in one line. We read every message and reply within 48 hours.

§ 04 · All datasets

Tracked datasets.

2 datasets tracked for this task.

COCO Keypoints
CANONICAL
1 result · map
Top: ViTPose-G 80.9
MPII Human Pose
0 results · accuracy
§ 05 · Related tasks

Other tasks in Computer Vision.

Document Image ClassificationDocument Layout AnalysisDocument ParsingDocument UnderstandingGeneral OCR CapabilitiesHandwriting RecognitionImage Feature ExtractionImage-to-3D
Reply within 48 hours · No newsletter

Didn't find what you came for?

Still looking for something on Keypoint Detection? A missing model, a stale score, a benchmark we should cover — drop it here and we'll handle it.

Real humans read every message. We track what people are asking for and prioritize accordingly.