Language-Guided Negative Sample Mining for Open-Vocabulary Object Detection

Yu Wen Tseng, Hong Han Shuai, Ching Chun Huang, Yung Hui Li, Wen Huang Cheng

研究成果: Conference contribution同行評審

摘要

In the domain of computer vision, object detection serves as a fundamental perceptual task with critical implications. Traditional object detection frameworks are limited by their inability to recognize object classes not present in their training datasets, a significant drawback for practical applications where encountering novel objects is commonplace. To address the inherent lack of adaptability, more sophisticated paradigms such as zero-shot and open-vocabulary object detection have been introduced. Open-vocabulary object detection, in particular, often necessitates auxiliary image-text paired data to enhance model training. Our research proposes an innovative approach that refines the training process by mining potential unlabeled objects from negative sample pools. Leveraging a large-scale vision-language model, we harness the entropy of classification scores to selectively identify and annotate previously unlabeled samples, subsequently incorporating them into the training regimen. This novel methodology empowers our model to attain competitive performance benchmarks on the challenging MSCOCO dataset, matching state-of-the-art outcomes, while obviating the need for additional data or supplementary training procedures.

原文English
主出版物標題2024 International Conference on Electronics, Information, and Communication, ICEIC 2024
發行者Institute of Electrical and Electronics Engineers Inc.
ISBN(電子)9798350371888
DOIs
出版狀態Published - 2024
事件2024 International Conference on Electronics, Information, and Communication, ICEIC 2024 - Taipei, 台灣
持續時間: 28 1月 202431 1月 2024

出版系列

名字2024 International Conference on Electronics, Information, and Communication, ICEIC 2024

Conference

Conference2024 International Conference on Electronics, Information, and Communication, ICEIC 2024
國家/地區台灣
城市Taipei
期間28/01/2431/01/24

指紋

深入研究「Language-Guided Negative Sample Mining for Open-Vocabulary Object Detection」主題。共同形成了獨特的指紋。

引用此