A Recipe for CAC: Mosaic-Based Generalized Loss for Improved Class-Agnostic Counting

Tsung Han Chou*, Brian Wang, Wei Chen Chiu, Jun Cheng Chen

*此作品的通信作者

研究成果: Conference contribution同行評審

摘要

Class agnostic counting (CAC) is a vision task that can be used to count the total occurrence number of any given reference objects in the query image. The task is usually formulated as a density map estimation problem through similarity computation among a few image samples of the reference object and the query image. In this paper, we point out a severe issue of the existing CAC framework: Given a multi-class setting, models don’t consider reference images and instead blindly match all dominant objects in the query image. Moreover, the current evaluation metrics and dataset cannot be used to faithfully assess the model’s generalization performance and robustness. To this end, we discover that the combination of mosaic augmentation with generalized loss is essential for addressing the aforementioned issue of CAC models to count objects of majority (i.e. dominant objects) regardless of the references. Furthermore, we introduce a new evaluation protocol and metrics for resolving the problem behind the existing CAC evaluation scheme and better benchmarking CAC models in a more fair manner. Besides, extensive evaluation results demonstrate that our proposed recipe can consistently improve the performance of different CAC models. The code is available at https://github.com/littlepenguin89106/MGCAC.

原文English
主出版物標題Computer Vision – ACCV 2024 - 17th Asian Conference on Computer Vision, Proceedings
編輯Minsu Cho, Ivan Laptev, Du Tran, Angela Yao, Hongbin Zha
發行者Springer Science and Business Media Deutschland GmbH
頁面413-428
頁數16
ISBN(列印)9789819609598
DOIs
出版狀態Published - 2025
事件17th Asian Conference on Computer Vision, ACCV 2024 - Hanoi, 越南
持續時間: 8 12月 202412 12月 2024

出版系列

名字Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
15477 LNCS
ISSN(列印)0302-9743
ISSN(電子)1611-3349

Conference

Conference17th Asian Conference on Computer Vision, ACCV 2024
國家/地區越南
城市Hanoi
期間8/12/2412/12/24

指紋

深入研究「A Recipe for CAC: Mosaic-Based Generalized Loss for Improved Class-Agnostic Counting」主題。共同形成了獨特的指紋。

引用此