Quatnet: Quaternion-based head pose estimation with multiregression loss

Heng Wei Hsu*, Tung Yu Wu, Sheng Wan, Wing Hung Wong, Chen-Yi Lee


研究成果: Article同行評審

120 引文 斯高帕斯(Scopus)


Head pose estimation has attracted immense research interest recently, as its inherent information significantly improves the performance of face-related applications such as face alignment and face recognition. In this paper, we conduct an in-depth study of head pose estimation and present a multiregression loss function, an L2 regression loss combined with an ordinal regression loss, to train a convolutional neural network (CNN) that is dedicated to estimating head poses from RGB images without depth information. The ordinal regression loss is utilized to address the nonstationary property observed as the facial features change with respect to different head pose angles and learn robust features. The L2 regression loss leverages these features to provide precise angle predictions for input images. To avoid the ambiguity problem in the commonly used Euler angle representation, we further formulate the head pose estimation problem in quaternions. Our quaternion-based multiregression loss method achieves state-of-The-Art performance on the AFLW2000, AFLW test set, and AFW datasets and is closing the gap with methods that utilize depth information on the BIWI dataset.

頁(從 - 到)1035-1046
期刊IEEE Transactions on Multimedia
出版狀態Published - 4月 2019


深入研究「Quatnet: Quaternion-based head pose estimation with multiregression loss」主題。共同形成了獨特的指紋。