TY - GEN
T1 - Distance-dependent Feature Alignment and Selection for Imbalance 3D Point Cloud Object Detection
AU - Chang, Ming Jen
AU - Cheng, Chih Jen
AU - Hsiao, Ching Chun
AU - Chou, I. Fan
AU - Huang, Ching Chun
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - Although pillar-based 3D object detection methods can balance the performance and inference speed, the inconsistent object features caused by dramatic sparsity drops of LiDAR point clouds sabotage the detection accuracy. We present a novel and efficient plug-in method, SVDnet, to improve the state-of-the-art pillar-based models. First, a novel low-rank objective loss is introduced to extract distance-aware vehicle features and suppress the other variations. Next, we alleviated the remaining feature inconsistency caused by object positions with two strategies. One is a Distance Alignment Ratio-generation Network (DARN), which fuses multi-scale features by distance-adaptive ratios. The other is a position attention network that modulates features based on positions. Our results on the KITTI dataset show that SVDnet improves the pillar methods and outperforms the other plug-in strategies in accuracy and speed.
AB - Although pillar-based 3D object detection methods can balance the performance and inference speed, the inconsistent object features caused by dramatic sparsity drops of LiDAR point clouds sabotage the detection accuracy. We present a novel and efficient plug-in method, SVDnet, to improve the state-of-the-art pillar-based models. First, a novel low-rank objective loss is introduced to extract distance-aware vehicle features and suppress the other variations. Next, we alleviated the remaining feature inconsistency caused by object positions with two strategies. One is a Distance Alignment Ratio-generation Network (DARN), which fuses multi-scale features by distance-adaptive ratios. The other is a position attention network that modulates features based on positions. Our results on the KITTI dataset show that SVDnet improves the pillar methods and outperforms the other plug-in strategies in accuracy and speed.
UR - http://www.scopus.com/inward/record.url?scp=85143904207&partnerID=8YFLogxK
U2 - 10.1109/AVSS56176.2022.9959572
DO - 10.1109/AVSS56176.2022.9959572
M3 - Conference contribution
AN - SCOPUS:85143904207
T3 - AVSS 2022 - 18th IEEE International Conference on Advanced Video and Signal-Based Surveillance
BT - AVSS 2022 - 18th IEEE International Conference on Advanced Video and Signal-Based Surveillance
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 18th IEEE International Conference on Advanced Video and Signal-Based Surveillance, AVSS 2022
Y2 - 29 November 2022 through 2 December 2022
ER -