Parking Space Status Inference Upon a Deep CNN and Multi-Task Contrastive Network with Spatial Transform

Hoang Tran Vu, Ching-Chun Huang*

*此作品的通信作者

研究成果: Article同行評審

22 引文 斯高帕斯(Scopus)

摘要

Deep learning methods, especially CNNs, have achieved many promising results in a wide range of computer vision applications. However, few studies focused on designing suitable deep learning methods for parking space status inference. As we have known, it is challenging to detect parking spaces in an outdoor environment due to dynamic lighting variations, weather changes, and perspective distortion. By off-the-shelf CNNs, lighting variations might be handled well. However, to realize a practical and robust inference system, we also need to address troublesome problems, such as parking displacements, non-unified car sizes, inter-object occlusion, and perspective distortion. These problems may become even challenging if also considering the difference of space sizes. To overcome the problems, we proposed a custom-tailored deep convolutional and contrastive network with three contributions. First, we introduced a Siamese architecture to learn the contrastive and robust feature descriptor. This helps to reduce the effects owing to the variety of inter-object occlusion. Second, we integrated a convolutional Spatial Transformer Network (STN) to adaptively transform a 3-space input patch according to vehicle sizes and parking displacement. STN also helps to overcome the perspective distortion problem. Third, a multi-task loss function was designed to train the network by simultaneously considering the accuracy of inferring the status of the target space and the semantic smoothness of high-level features. Thereby, the errors caused by inter-object occlusion could be alleviated. To verify the proposed network, we visualized the learned features and analyzed their functionality. Experiments and evaluations have shown the robustness of our system in parking status inference. The real-time system currently running in public parking lots also demonstrates the effectiveness of the proposed deep network.

原文English
文章編號8337011
頁(從 - 到)1194-1208
頁數15
期刊IEEE Transactions on Circuits and Systems for Video Technology
29
發行號4
DOIs
出版狀態Published - 4月 2019

指紋

深入研究「Parking Space Status Inference Upon a Deep CNN and Multi-Task Contrastive Network with Spatial Transform」主題。共同形成了獨特的指紋。

引用此