Temporal-aware self-supervised learning for 3D hand pose and mesh estimation in videos

Liangjian Chen, Shih Yao Lin, Yusheng Xie, Yen-Yu Lin, Xiaohui Xie

研究成果: Conference contribution同行評審

22 引文 斯高帕斯(Scopus)

摘要

Estimating 3D hand pose directly from RGB images is challenging but has gained steady progress recently by training deep models with annotated 3D poses. However annotating 3D poses is difficult and as such only a few 3D hand pose datasets are available, all with limited sample sizes. In this study, we propose a new framework of training 3D pose estimation models from RGB images without using explicit 3D annotations, i.e., trained with only 2D information. Our framework is motivated by two observations: 1) Videos provide richer information for estimating 3D poses as opposed to static images; 2) Estimated 3D poses ought to be consistent whether the videos are viewed in the forward order or reverse order. We leverage these two observations to develop a self-supervised learning model called temporal-aware self-supervised network (TASSN). By enforcing temporal consistency constraints, TASSN learns 3D hand poses and meshes from videos with only 2D keypoint position annotations. Experiments show that our model achieves surprisingly good results, with 3D estimation accuracy on par with the state-of-the-art models trained with 3D annotations, highlighting the benefit of the temporal consistency in constraining 3D prediction models.

原文American English
主出版物標題Proceedings - 2021 IEEE Winter Conference on Applications of Computer Vision, WACV 2021
發行者Institute of Electrical and Electronics Engineers Inc.
頁面1049-1058
頁數10
ISBN(電子)9780738142661
DOIs
出版狀態Published - 1月 2021
事件2021 IEEE Winter Conference on Applications of Computer Vision, WACV 2021 - Virtual, Online, 美國
持續時間: 5 1月 20219 1月 2021

出版系列

名字Proceedings - 2021 IEEE Winter Conference on Applications of Computer Vision, WACV 2021

Conference

Conference2021 IEEE Winter Conference on Applications of Computer Vision, WACV 2021
國家/地區美國
城市Virtual, Online
期間5/01/219/01/21

指紋

深入研究「Temporal-aware self-supervised learning for 3D hand pose and mesh estimation in videos」主題。共同形成了獨特的指紋。

引用此