Distance-dependent Feature Alignment and Selection for Imbalance 3D Point Cloud Object Detection

Ming Jen Chang, Chih Jen Cheng, Ching Chun Hsiao, I. Fan Chou, Ching Chun Huang

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Abstract

    Although pillar-based 3D object detection methods can balance the performance and inference speed, the inconsistent object features caused by dramatic sparsity drops of LiDAR point clouds sabotage the detection accuracy. We present a novel and efficient plug-in method, SVDnet, to improve the state-of-the-art pillar-based models. First, a novel low-rank objective loss is introduced to extract distance-aware vehicle features and suppress the other variations. Next, we alleviated the remaining feature inconsistency caused by object positions with two strategies. One is a Distance Alignment Ratio-generation Network (DARN), which fuses multi-scale features by distance-adaptive ratios. The other is a position attention network that modulates features based on positions. Our results on the KITTI dataset show that SVDnet improves the pillar methods and outperforms the other plug-in strategies in accuracy and speed.

    Original languageEnglish
    Title of host publicationAVSS 2022 - 18th IEEE International Conference on Advanced Video and Signal-Based Surveillance
    PublisherInstitute of Electrical and Electronics Engineers Inc.
    ISBN (Electronic)9781665463829
    DOIs
    StatePublished - 2022
    Event18th IEEE International Conference on Advanced Video and Signal-Based Surveillance, AVSS 2022 - Virtual, Online, Spain
    Duration: 29 Nov 20222 Dec 2022

    Publication series

    NameAVSS 2022 - 18th IEEE International Conference on Advanced Video and Signal-Based Surveillance

    Conference

    Conference18th IEEE International Conference on Advanced Video and Signal-Based Surveillance, AVSS 2022
    Country/TerritorySpain
    CityVirtual, Online
    Period29/11/222/12/22

    Fingerprint

    Dive into the research topics of 'Distance-dependent Feature Alignment and Selection for Imbalance 3D Point Cloud Object Detection'. Together they form a unique fingerprint.

    Cite this