TY - GEN
T1 - Learning Outlier-Aware Representation with Synthetic Boundary Samples
AU - Chien, Jen Tzung
AU - Chen, Kuan
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - Out-of-distribution (OOD) detection provides an essential scheme to build a reliable deep learning system. Some of the previous works additionally acquired OOD samples for model training or hyperparameter tuning. However, OOD samples are usually missing in real-world scenarios and the prior knowledge of these samples is commonly unavailable. To cope with these issues, this paper presents a novel approach to synthesizing the informative samples near the boundary between in-distribution and OOD which sufficiently reflect OOD samples in training stage. These synthetic boundary samples are located right outside in-distribution (ID) where the latent variable is distributed by a multivariate Gaussian. Accordingly, this paper presents the outlier-aware representation learning which utilizes the synthetic boundary samples to train an OOD detector by only using the unlabeled ID data. The model is then trained to learn a compact decision boundary between ID and OOD samples. The experiments demonstrate that the proposed methods outperform state-of-the-art performance in presence of different OOD data.
AB - Out-of-distribution (OOD) detection provides an essential scheme to build a reliable deep learning system. Some of the previous works additionally acquired OOD samples for model training or hyperparameter tuning. However, OOD samples are usually missing in real-world scenarios and the prior knowledge of these samples is commonly unavailable. To cope with these issues, this paper presents a novel approach to synthesizing the informative samples near the boundary between in-distribution and OOD which sufficiently reflect OOD samples in training stage. These synthetic boundary samples are located right outside in-distribution (ID) where the latent variable is distributed by a multivariate Gaussian. Accordingly, this paper presents the outlier-aware representation learning which utilizes the synthetic boundary samples to train an OOD detector by only using the unlabeled ID data. The model is then trained to learn a compact decision boundary between ID and OOD samples. The experiments demonstrate that the proposed methods outperform state-of-the-art performance in presence of different OOD data.
KW - Out-of-distribution detection
KW - computer vision
KW - contrastive learning
KW - natural language processing
UR - http://www.scopus.com/inward/record.url?scp=85177187267&partnerID=8YFLogxK
U2 - 10.1109/MLSP55844.2023.10285976
DO - 10.1109/MLSP55844.2023.10285976
M3 - Conference contribution
AN - SCOPUS:85177187267
T3 - IEEE International Workshop on Machine Learning for Signal Processing, MLSP
BT - Proceedings of the 2023 IEEE 33rd International Workshop on Machine Learning for Signal Processing, MLSP 2023
A2 - Comminiello, Danilo
A2 - Scarpiniti, Michele
PB - IEEE Computer Society
T2 - 33rd IEEE International Workshop on Machine Learning for Signal Processing, MLSP 2023
Y2 - 17 September 2023 through 20 September 2023
ER -