Monocular 3D Localization of Vehicles in Road Scenes

Haotian Zhang, Haorui Ji, Aotian Zheng, Jenq Neng Hwang, Ren Hung Hwang

研究成果: Conference contribution同行評審

6 引文 斯高帕斯(Scopus)

摘要

Sensing and perception systems for autonomous driving vehicles in road scenes are composed of three crucial components: 3D-based object detection, tracking, and localization. While all three components are important, most relevant papers tend to only focus on one single component. We propose a monocular vision-based framework for 3D-based detection, tracking, and localization by effectively integrating all three tasks in a complementary manner. Our system contains an RCNN-based Localization Network (LOCNet), which works in concert with fitness evaluation score (FES) based single-frame optimization, to get more accurate and refined 3D vehicle localization. To better utilize the temporal information, we further use a multi-frame optimization technique, taking advantage of camera ego-motion and a 3D TrackletNet Tracker (3D TNT), to improve both accuracy and consistency in our 3D localization results. Our system outperforms state-of-the-art image-based solutions in diverse scenarios and is even comparable with LiDAR-based methods.

原文English
主出版物標題Proceedings - 2021 IEEE/CVF International Conference on Computer Vision Workshops, ICCVW 2021
發行者Institute of Electrical and Electronics Engineers Inc.
頁面2855-2864
頁數10
ISBN(電子)9781665401913
DOIs
出版狀態Published - 2021
事件18th IEEE/CVF International Conference on Computer Vision Workshops, ICCVW 2021 - Virtual, Online, 加拿大
持續時間: 11 10月 202117 10月 2021

出版系列

名字Proceedings of the IEEE International Conference on Computer Vision
2021-October
ISSN(列印)1550-5499

Conference

Conference18th IEEE/CVF International Conference on Computer Vision Workshops, ICCVW 2021
國家/地區加拿大
城市Virtual, Online
期間11/10/2117/10/21

指紋

深入研究「Monocular 3D Localization of Vehicles in Road Scenes」主題。共同形成了獨特的指紋。

引用此