Cascade Meta-RCNN for Few-shot Object Detection

Shuting Li*, Qian Jiang*, Xin Jin, Nanqing Liu, Shiyu Chen, Shin Jye Lee

*此作品的通信作者

研究成果: Conference contribution同行評審

摘要

Data annotation is often a labor-intensive and time-consuming task, and in some cases, it is even challenging to collect certain data. Few-shot object detection (FSOD) addresses this challenge by recognizing objects with very few examples. Meta-learning-based FSOD usually trains models on a limited set of samples, enabling them to learn and generalize to new samples. Currently, meta-learning is widely utilized in the field of two-stage object detection, which aggregates query features and support features to obtain the final classification scores. Hence, there is a higher demand for accurate regions of interest (RoI). However, these methods only consider training samples with a single Intersection over Union (IoU) threshold. In this paper, we incorporate cascade structure into the existing Meta-RCNN model named Cascade Meta-RCNN. Specifically, the query feature is aggregated with prototypes of the support set. Then, the aggregated features are sequentially input into different RoI-Heads, which are trained with progressively increasing IoU thresholds. This method compels the network to generate more accurate query RoI features for matching with support prototypes. Additionally, we integrated the Channel and Spatial Attention (CSA) module into the model's backbone, enhancing the network's discriminative ability and further boosting its performance. To validate the effectiveness of our approach, we conducted a series of experiments on the PASCAL VOC dataset. The results demonstrate that our method outperforms the current state-of-the-art methods.

原文English
主出版物標題Proceedings - 2023 IEEE International Conference on Parallel and Distributed Processing with Applications, Big Data and Cloud Computing, Sustainable Computing and Communications, Social Computing and Networking, ISPA/BDCloud/SocialCom/SustainCom 2023
發行者Institute of Electrical and Electronics Engineers Inc.
頁面466-473
頁數8
ISBN(電子)9798350329223
DOIs
出版狀態Published - 2023
事件21st IEEE International Symposium on Parallel and Distributed Processing with Applications, 13th IEEE International Conference on Big Data and Cloud Computing, 16th IEEE International Conference on Social Computing and Networking and 13th International Conference on Sustainable Computing and Communications, ISPA/BDCloud/SocialCom/SustainCom 2023 - Wuhan, 中國
持續時間: 21 12月 202324 12月 2023

出版系列

名字Proceedings - 2023 IEEE International Conference on Parallel and Distributed Processing with Applications, Big Data and Cloud Computing, Sustainable Computing and Communications, Social Computing and Networking, ISPA/BDCloud/SocialCom/SustainCom 2023

Conference

Conference21st IEEE International Symposium on Parallel and Distributed Processing with Applications, 13th IEEE International Conference on Big Data and Cloud Computing, 16th IEEE International Conference on Social Computing and Networking and 13th International Conference on Sustainable Computing and Communications, ISPA/BDCloud/SocialCom/SustainCom 2023
國家/地區中國
城市Wuhan
期間21/12/2324/12/23

指紋

深入研究「Cascade Meta-RCNN for Few-shot Object Detection」主題。共同形成了獨特的指紋。

引用此