TY - GEN
T1 - Multi-task Cascaded and Densely Connected Convolutional Networks Applied to Human Face Detection and Facial Expression Recognition System
AU - Chou, Kuan Yu
AU - Cheng, Yi Wen
AU - Chen, Wei Ren
AU - Chen, Yon-Ping
N1 - Publisher Copyright:
© 2019 IEEE.
PY - 2019/11
Y1 - 2019/11
N2 - Face detection and recognition is an important issue and a difficult task in computer vision and human-computer interaction. Recently, with the development of deep learning, several related technologies have been proposed for face detection and facial expression recognition (FER), and the outstanding convolutional neural networks are the most common used in this field. This thesis applies the multi-task cascade convolutional neural network to face detection, and then designs the real-time FER system based on densely connected convolution network (DenseNet). The system first scales the input image to an image pyramid, and then uses the hierarchical network to determine whether a candidate window includes a human face. If a face exists, then send the candidate window to the FER system. Since DenseNet possesses the property of feature reuse, it can effectively reduce the amount of parameters and computation efforts, beneficial to develop the real-time system. In order to capture the variation of facial muscle in different expressions, this architecture adopts convolution operations with a stride 1 and tries different numbers of dense blocks. Through experiments, the proposed system can achieve real-time recognition in 30FPS and with recognition accuracy better than human eyes.
AB - Face detection and recognition is an important issue and a difficult task in computer vision and human-computer interaction. Recently, with the development of deep learning, several related technologies have been proposed for face detection and facial expression recognition (FER), and the outstanding convolutional neural networks are the most common used in this field. This thesis applies the multi-task cascade convolutional neural network to face detection, and then designs the real-time FER system based on densely connected convolution network (DenseNet). The system first scales the input image to an image pyramid, and then uses the hierarchical network to determine whether a candidate window includes a human face. If a face exists, then send the candidate window to the FER system. Since DenseNet possesses the property of feature reuse, it can effectively reduce the amount of parameters and computation efforts, beneficial to develop the real-time system. In order to capture the variation of facial muscle in different expressions, this architecture adopts convolution operations with a stride 1 and tries different numbers of dense blocks. Through experiments, the proposed system can achieve real-time recognition in 30FPS and with recognition accuracy better than human eyes.
KW - Densely Connected Convolutional Networks(DenseNet)
KW - Facial Expression Recognition(FER)
KW - Multi-task Cascaded Convolutional Networks(MTCNN)
UR - http://www.scopus.com/inward/record.url?scp=85082984953&partnerID=8YFLogxK
U2 - 10.1109/CACS47674.2019.9024357
DO - 10.1109/CACS47674.2019.9024357
M3 - Conference contribution
AN - SCOPUS:85082984953
T3 - 2019 International Automatic Control Conference, CACS 2019
BT - 2019 International Automatic Control Conference, CACS 2019
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2019 International Automatic Control Conference, CACS 2019
Y2 - 13 November 2019 through 16 November 2019
ER -