TY - GEN
T1 - Refining Valence-Arousal Estimation with Dual-Stream Label Density Smoothing
AU - Xie, Hongxia
AU - Li, I. Hsuan
AU - Lo, Ling
AU - Shuai, Hong Han
AU - Cheng, Wen Huang
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Emotion recognition through facial expressions remains a long-standing research pursuit, yet the challenges persist, particularly in dynamic real-world scenarios. In-The-wild datasets are hampered by limited emotion annotations due to resource constraints, hindering multi-Task methodology advancements. Recent years have witnessed a surge of approaches addressing the valence-Arousal problem. However, data imbalance, especially in valence-Arousal annotation, persists. This work proposes a novel two-stream valence-Arousal estimation network, inspired by MIMAMO Net, leveraging spatial and temporal learning to enhance emotion recognition. Label Density Smoothing (LDS) is introduced to counter skewed distributions. Experimental results showcase the approach's efficacy, achieving a Concordance Correlation Coefficient (CCC) of 0.591 for valence and 0.617 for arousal on the Aff-Wild2 validation set. This work contributes to the advancement of valence-Arousal modeling in facial expression recognition.
AB - Emotion recognition through facial expressions remains a long-standing research pursuit, yet the challenges persist, particularly in dynamic real-world scenarios. In-The-wild datasets are hampered by limited emotion annotations due to resource constraints, hindering multi-Task methodology advancements. Recent years have witnessed a surge of approaches addressing the valence-Arousal problem. However, data imbalance, especially in valence-Arousal annotation, persists. This work proposes a novel two-stream valence-Arousal estimation network, inspired by MIMAMO Net, leveraging spatial and temporal learning to enhance emotion recognition. Label Density Smoothing (LDS) is introduced to counter skewed distributions. Experimental results showcase the approach's efficacy, achieving a Concordance Correlation Coefficient (CCC) of 0.591 for valence and 0.617 for arousal on the Aff-Wild2 validation set. This work contributes to the advancement of valence-Arousal modeling in facial expression recognition.
UR - http://www.scopus.com/inward/record.url?scp=85187004508&partnerID=8YFLogxK
U2 - 10.1109/ICCE59016.2024.10444180
DO - 10.1109/ICCE59016.2024.10444180
M3 - Conference contribution
AN - SCOPUS:85187004508
T3 - Digest of Technical Papers - IEEE International Conference on Consumer Electronics
BT - 2024 IEEE International Conference on Consumer Electronics, ICCE 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2024 IEEE International Conference on Consumer Electronics, ICCE 2024
Y2 - 6 January 2024 through 8 January 2024
ER -