Quatnet: Quaternion-based head pose estimation with multiregression loss

Heng Wei Hsu*, Tung Yu Wu, Sheng Wan, Wing Hung Wong, Chen-Yi Lee

*此作品的通信作者

研究成果: Article同行評審

120 引文 斯高帕斯(Scopus)

摘要

Head pose estimation has attracted immense research interest recently, as its inherent information significantly improves the performance of face-related applications such as face alignment and face recognition. In this paper, we conduct an in-depth study of head pose estimation and present a multiregression loss function, an L2 regression loss combined with an ordinal regression loss, to train a convolutional neural network (CNN) that is dedicated to estimating head poses from RGB images without depth information. The ordinal regression loss is utilized to address the nonstationary property observed as the facial features change with respect to different head pose angles and learn robust features. The L2 regression loss leverages these features to provide precise angle predictions for input images. To avoid the ambiguity problem in the commonly used Euler angle representation, we further formulate the head pose estimation problem in quaternions. Our quaternion-based multiregression loss method achieves state-of-The-Art performance on the AFLW2000, AFLW test set, and AFW datasets and is closing the gap with methods that utilize depth information on the BIWI dataset.

原文English
文章編號8444061
頁(從 - 到)1035-1046
頁數12
期刊IEEE Transactions on Multimedia
21
發行號4
DOIs
出版狀態Published - 4月 2019

指紋

深入研究「Quatnet: Quaternion-based head pose estimation with multiregression loss」主題。共同形成了獨特的指紋。

引用此