DGGAN: Depth-image guided generative adversarial networks for disentangling RGB and depth images in 3D hand pose estimation

Liangjian Chen, Shih Yao Lin, Yusheng Xie, Yen-Yu Lin, Wei Fan, Xiaohui Xie

研究成果: Conference contribution同行評審

30 引文 斯高帕斯(Scopus)

摘要

Estimating 3D hand poses from RGB images is essential to a wide range of potential applications, but is challenging owing to substantial ambiguity in the inference of depth information from RGB images. State-of-the-art estimators address this problem by regularizing 3D hand pose estimation models during training to enforce the consistency between the predicted 3D poses and the ground-truth depth maps. However, these estimators rely on both RGB images and the paired depth maps during training. In this study, we propose a conditional generative adversarial network (GAN) model, called Depth-image Guided GAN (DGGAN), to generate realistic depth maps conditioned on the input RGB image, and use the synthesized depth maps to regularize the 3D hand pose estimation model, therefore eliminating the need for ground-truth depth maps. Experimental results on multiple benchmark datasets show that the synthesized depth maps produced by DGGAN are quite effective in regularizing the pose estimation model, yielding new state-of-the-art results in estimation accuracy, notably reducing the mean 3D endpoint errors (EPE) by 4.7%, 16.5%, and 6.8% on the RHD, STB and MHP datasets, respectively.

原文American English
主出版物標題Proceedings - 2020 IEEE Winter Conference on Applications of Computer Vision, WACV 2020
發行者Institute of Electrical and Electronics Engineers Inc.
頁面400-408
頁數9
ISBN(電子)9781728165530
DOIs
出版狀態Published - 3月 2020
事件2020 IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2020 - Snowmass Village, 美國
持續時間: 1 3月 20205 3月 2020

出版系列

名字Proceedings - 2020 IEEE Winter Conference on Applications of Computer Vision, WACV 2020

Conference

Conference2020 IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2020
國家/地區美國
城市Snowmass Village
期間1/03/205/03/20

指紋

深入研究「DGGAN: Depth-image guided generative adversarial networks for disentangling RGB and depth images in 3D hand pose estimation」主題。共同形成了獨特的指紋。

引用此