Abstract
Although facial expression recognition based on deep learning has become a major trend, existing methods have been found to prefer learning spurious statistical correlations and non-robust features during training. This degenerates the model's generalizability in practical situations. One of the research fields mitigating such misperception of correlations as causality is causal reasoning. In this article, we propose a learnable counterfactual attention mechanism, CA-FER, that uses causal reasoning to simultaneously optimize feature discrimination and diversity to mitigate spurious correlations in expression datasets. To the best of our knowledge, this is the first work to study the spurious correlations in facial expression recognition from a counterfactual attention perspective. Extensive experiments on a synthetic dataset and four public datasets demonstrate that our method outperforms previous methods, which shows the effectiveness and generalizability of our learnable counterfactual attention mechanism for the expression recognition task.
Original language | English |
---|---|
Pages (from-to) | 977-989 |
Number of pages | 13 |
Journal | IEEE Transactions on Affective Computing |
Volume | 15 |
Issue number | 3 |
DOIs | |
State | Published - 2024 |
Keywords
- Causal learning
- facial expression recognition
- spurious correlation