TY - GEN
T1 - Improving LiDAR Semantic Segmentation on Minority Classes and Generalization Capability for Autonomous Driving
AU - Tseng, Chiao Hua
AU - Lin, Yu Ting
AU - Lin, Wen Chieh
AU - Wang, Chieh Chih
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - LiDARs have emerged as an important sensor in autonomous driving systems because they offer more accurate geometric measurements than cameras and radars. Therefore, LiDARs have been commonly combined with cameras or radars to tackle many perception problems in autonomous driving, such as object detection, semantic segmentation, or navigation. For semantic segmentation of LiDAR data, due to the class imbalance issue of large-scale scene, there is a performance gap between majority classes and minority classes of large-scale dataset. The minority classes usually include the crucial classes to the autonomous driving, such as 'person', 'motorcyclist', 'traffic-sign'. To improve the performance of minority classes, we adopt U-Net++ as the architecture, KPConv as convolution operator, and use both dice loss and cross entropy as loss functions. We get 5.1% mIoU improvement on SemanticKITTI of all classes and 9.5% mIoU improvement of minority classes. Moreover, due to the different resolution of LiDAR sensors, we show the generalization capability of our model by training it on 64-beam dataset and testing on 32-beam and 128-beam dataset. We get 3.3% mIoU improvement on 128-beam dataset and 1.9% mIoU improvement on 32-beam dataset.
AB - LiDARs have emerged as an important sensor in autonomous driving systems because they offer more accurate geometric measurements than cameras and radars. Therefore, LiDARs have been commonly combined with cameras or radars to tackle many perception problems in autonomous driving, such as object detection, semantic segmentation, or navigation. For semantic segmentation of LiDAR data, due to the class imbalance issue of large-scale scene, there is a performance gap between majority classes and minority classes of large-scale dataset. The minority classes usually include the crucial classes to the autonomous driving, such as 'person', 'motorcyclist', 'traffic-sign'. To improve the performance of minority classes, we adopt U-Net++ as the architecture, KPConv as convolution operator, and use both dice loss and cross entropy as loss functions. We get 5.1% mIoU improvement on SemanticKITTI of all classes and 9.5% mIoU improvement of minority classes. Moreover, due to the different resolution of LiDAR sensors, we show the generalization capability of our model by training it on 64-beam dataset and testing on 32-beam and 128-beam dataset. We get 3.3% mIoU improvement on 128-beam dataset and 1.9% mIoU improvement on 32-beam dataset.
KW - LiDAR semantic segmentation
KW - autonomous driving
KW - deep learning
KW - generalization capability
KW - minority class
UR - http://www.scopus.com/inward/record.url?scp=85150034481&partnerID=8YFLogxK
U2 - 10.1109/TAAI57707.2022.00032
DO - 10.1109/TAAI57707.2022.00032
M3 - Conference contribution
AN - SCOPUS:85150034481
T3 - Proceedings - 2022 International Conference on Technologies and Applications of Artificial Intelligence, TAAI 2022
SP - 131
EP - 136
BT - Proceedings - 2022 International Conference on Technologies and Applications of Artificial Intelligence, TAAI 2022
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 27th International Conference on Technologies and Applications of Artificial Intelligence, TAAI 2022
Y2 - 1 December 2022 through 3 December 2022
ER -