Abstract: This paper proposes an autonomous hand pose estimation method for in-the-wild datasets by cascading a pretrained YOLO image segmentation module with Keypoint Transformer. Precise ...