Codesota · Tasks · Keypoint DetectionHome/Tasks/Computer Vision/Keypoint Detection

Computer Vision· keypoint-detection

Keypoint Detection.

Keypoint detection localizes specific anatomical or structural landmarks — body joints, facial features, hand articulations — enabling pose estimation, gesture recognition, and motion capture. OpenPose (2017) first demonstrated real-time multi-person pose estimation, and the field has since progressed through HRNet, ViTPose, and RTMPose pushing both accuracy and speed. Modern systems detect 133 whole-body keypoints (body + hands + face) in real-time on mobile devices. The applications span from sports biomechanics (analyzing an athlete's form frame-by-frame) to sign language recognition and AR avatar puppeteering.

2

Datasets

1

Results

map

Canonical metric

§ 02 · Canonical benchmark

The reference dataset.

COCO Keypoints

Human pose estimation on COCO with 17 body keypoints

Primary metric: map

View full leaderboard →

§ 03 · Top 10

Leading models.

Leading models on COCO Keypoints.

#	Model	map	Year	Source
★	ViTPose-G	80.9	2022	paper ↗

What were you looking for on Keypoint Detection?

Didn't find the model, metric, or dataset you needed? Tell us in one line. We read every message and reply within 48 hours.

§ 04 · All datasets

Tracked datasets.

2 datasets tracked for this task.

1 result · map

Top: ViTPose-G — 80.9

MPII Human Pose

0 results · accuracy

§ 05 · Related tasks

Other tasks in Computer Vision.

3D Understanding Depth estimation Document Image Classification Document Layout Analysis Document Parsing Document Understanding General OCR Capabilities Handwriting Recognition

Reply within 48 hours · No newsletter

Didn't find what you came for?

Still looking for something on Keypoint Detection? A missing model, a stale score, a benchmark we should cover — drop it here and we'll handle it.

Real humans read every message. We track what people are asking for and prioritize accordingly.