Human Action to Robot Learning
A large-scale synthetic pose estimation dataset with 1.4M+ photorealistic images rendered in Unreal Engine. Features 100 actor models across 1,400 movements from real humans with pixel-perfect ground truth including COCO-17 body keypoints, MediaPipe hand keypoints, metric depth maps, and instance segmentation masks.
More datasets coming soon!
Contact Us