Distance-dependent Feature Alignment and Selection for Imbalance 3D Point Cloud Object Detection

Ming Jen Chang, Chih Jen Cheng, Ching Chun Hsiao, I. Fan Chou, Ching Chun Huang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Although pillar-based 3D object detection methods can balance the performance and inference speed, the inconsistent object features caused by dramatic sparsity drops of LiDAR point clouds sabotage the detection accuracy. We present a novel and efficient plug-in method, SVDnet, to improve the state-of-the-art pillar-based models. First, a novel low-rank objective loss is introduced to extract distance-aware vehicle features and suppress the other variations. Next, we alleviated the remaining feature inconsistency caused by object positions with two strategies. One is a Distance Alignment Ratio-generation Network (DARN), which fuses multi-scale features by distance-adaptive ratios. The other is a position attention network that modulates features based on positions. Our results on the KITTI dataset show that SVDnet improves the pillar methods and outperforms the other plug-in strategies in accuracy and speed.

Original languageEnglish
Title of host publicationAVSS 2022 - 18th IEEE International Conference on Advanced Video and Signal-Based Surveillance
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781665463829
DOIs
StatePublished - 2022
Event18th IEEE International Conference on Advanced Video and Signal-Based Surveillance, AVSS 2022 - Virtual, Online, Spain
Duration: 29 Nov 20222 Dec 2022

Publication series

NameAVSS 2022 - 18th IEEE International Conference on Advanced Video and Signal-Based Surveillance

Conference

Conference18th IEEE International Conference on Advanced Video and Signal-Based Surveillance, AVSS 2022
Country/TerritorySpain
CityVirtual, Online
Period29/11/222/12/22

Fingerprint

Dive into the research topics of 'Distance-dependent Feature Alignment and Selection for Imbalance 3D Point Cloud Object Detection'. Together they form a unique fingerprint.

Cite this