Learn From VideoHumans

Human Action to Robot Learning

CURDIE3 →

A large-scale synthetic pose estimation dataset with 1.4M+ photorealistic images rendered in Unreal Engine. Features 100 actor models across 1,400 movements from real humans with pixel-perfect ground truth including COCO-17 body keypoints, MediaPipe hand keypoints, metric depth maps, and instance segmentation masks.

Apache 2.0 1.4M Images 100 Actors 3 Environments 30 FPS

More datasets coming soon!