TY - GEN
T1 - Revealing Hidden Context in Camouflage Instance Segmentation
AU - Phung, Thanh Hai
AU - Shuai, Hong Han
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.
PY - 2025
Y1 - 2025
N2 - Predicting the instance-level masks of objects hidden in complex contexts is the goal of Camouflage Instance Segmentation (CIS), a task complicated by the striking similarities between camouflaged objects and their backgrounds. The diverse appearances of camouflage objects, including varying angles, partial visibilities, and ambiguous morphologies, further heighten this challenge. Prior works considered classifying pixels in a high uncertainty area without considering their contextual semantics, leading to numerous false positives. We proposed a novel method called Mask2Camouflage, which simultaneously enhances the modeling of contextual features and refines instance-level predicted maps. Mask2Camouflage leverages multi-scale features to integrate the extracted features from the backbone. Then, a Global Refinement Cross-Attention Module (GCA) is introduced to complement the foreground mask and background mask each other to reduce the false positive. Furthermore, by simulating a global shift clustering process, we present the Global-Shift Multi-Head Self-Attention (GSA), which enables the object query to capture not only information from earlier features but also their structural concepts, thereby reducing intra-class issues in the camouflage object detection task when validated with evaluated data. Compared with 15 state-of-the-art approaches, our Mask2Camouflage significantly improves the performance of camouflage instance segmentation. Our code is available at https://github.com/underlmao/Mask2Camouflage.
AB - Predicting the instance-level masks of objects hidden in complex contexts is the goal of Camouflage Instance Segmentation (CIS), a task complicated by the striking similarities between camouflaged objects and their backgrounds. The diverse appearances of camouflage objects, including varying angles, partial visibilities, and ambiguous morphologies, further heighten this challenge. Prior works considered classifying pixels in a high uncertainty area without considering their contextual semantics, leading to numerous false positives. We proposed a novel method called Mask2Camouflage, which simultaneously enhances the modeling of contextual features and refines instance-level predicted maps. Mask2Camouflage leverages multi-scale features to integrate the extracted features from the backbone. Then, a Global Refinement Cross-Attention Module (GCA) is introduced to complement the foreground mask and background mask each other to reduce the false positive. Furthermore, by simulating a global shift clustering process, we present the Global-Shift Multi-Head Self-Attention (GSA), which enables the object query to capture not only information from earlier features but also their structural concepts, thereby reducing intra-class issues in the camouflage object detection task when validated with evaluated data. Compared with 15 state-of-the-art approaches, our Mask2Camouflage significantly improves the performance of camouflage instance segmentation. Our code is available at https://github.com/underlmao/Mask2Camouflage.
KW - Camouflage Instance Segmentation
KW - Global-to-Local Refinement
UR - http://www.scopus.com/inward/record.url?scp=85212968647&partnerID=8YFLogxK
U2 - 10.1007/978-981-96-0966-6_1
DO - 10.1007/978-981-96-0966-6_1
M3 - Conference contribution
AN - SCOPUS:85212968647
SN - 9789819609659
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 3
EP - 20
BT - Computer Vision – ACCV 2024 - 17th Asian Conference on Computer Vision, Proceedings
A2 - Cho, Minsu
A2 - Laptev, Ivan
A2 - Tran, Du
A2 - Yao, Angela
A2 - Zha, Hongbin
PB - Springer Science and Business Media Deutschland GmbH
T2 - 17th Asian Conference on Computer Vision, ACCV 2024
Y2 - 8 December 2024 through 12 December 2024
ER -