TY - GEN
T1 - A multi-task convolutional neural network with spatial transform for parking space detection
AU - Vu, Hoang Tran
AU - Huang, Ching-Chun
PY - 2018/9/17
Y1 - 2018/9/17
N2 - Vacant parking space detection is a challenging vision task due to outdoor lighting variation and perspective distortion. Previous methods found on camera geometry and projection matrix to select space image region for status classification. By utilizing suitable hand-crafted features, outdoor lighting variation and perspective distortion could be well handled. However, if also considering parking displacement, non-unified car size, and inter-object occlusion, we find the problem becomes more troublesome. To overcome these problems, we propose a deep learning framework to infer the parking status with two contributions. First, we integrate a convolutional spatial transformer network (STN) to crop the local image area adaptively according to car size and parking displacement. Second, in order to solve inter-object occlusion problems, we group 3 neighboring spaces as a unit. A multi-task loss function is designed to consider the status estimation of the target space and its two neighbors jointly. With the loss function, we could force our network to learn occlusion patterns while estimating space status. The results show our system can reduce the error detection rate and thereby increase system accuracy.
AB - Vacant parking space detection is a challenging vision task due to outdoor lighting variation and perspective distortion. Previous methods found on camera geometry and projection matrix to select space image region for status classification. By utilizing suitable hand-crafted features, outdoor lighting variation and perspective distortion could be well handled. However, if also considering parking displacement, non-unified car size, and inter-object occlusion, we find the problem becomes more troublesome. To overcome these problems, we propose a deep learning framework to infer the parking status with two contributions. First, we integrate a convolutional spatial transformer network (STN) to crop the local image area adaptively according to car size and parking displacement. Second, in order to solve inter-object occlusion problems, we group 3 neighboring spaces as a unit. A multi-task loss function is designed to consider the status estimation of the target space and its two neighbors jointly. With the loss function, we could force our network to learn occlusion patterns while estimating space status. The results show our system can reduce the error detection rate and thereby increase system accuracy.
KW - CNN
KW - Deep learning
KW - Parking space detection
KW - Spatial transformer network
UR - http://www.scopus.com/inward/record.url?scp=85045343554&partnerID=8YFLogxK
U2 - 10.1109/ICIP.2017.8296584
DO - 10.1109/ICIP.2017.8296584
M3 - Conference contribution
AN - SCOPUS:85045343554
T3 - Proceedings - International Conference on Image Processing, ICIP
SP - 1762
EP - 1766
BT - 2017 IEEE International Conference on Image Processing, ICIP 2017 - Proceedings
PB - IEEE Computer Society
T2 - 24th IEEE International Conference on Image Processing, ICIP 2017
Y2 - 17 September 2017 through 20 September 2017
ER -