TY - JOUR
T1 - Survey on performance of deep learning models for detecting road damages using multiple dashcam image resources
AU - Cao, Minh Tu
AU - Tran, Quoc Viet
AU - Nguyen, Ngoc Mai
AU - Chang, Kuan Tsung
N1 - Publisher Copyright:
© 2020
PY - 2020/10
Y1 - 2020/10
N2 - Detecting road damage quickly and accurately facilitates the ability of road-maintenance agencies to make timely repairs to road surfaces, maintain optimal road conditions, optimize transportation safety, and minimize transportation costs. An extensive evaluation of eight deep-learning-based road-damage detection models was conducted in this study. Each model was trained on 9493 images sourced from multiple databases. The 16165 instances of road damage in these images were categorized into five types of damage, including longitudinal crack, horizontal crack, alligator damage, pothole-related crack, and line blurring. Two experiments were conducted that identified two models, single shot multi-box detector (SSD) Inception V2 and faster region-based convolutional neural networks (R-CNN) Inception V2, as providing the best balance of road-damage-detection accuracy and image processing time. These experiments demonstrated that increasing the diversity of image sources improved road-damage-detection model performance. In addition to combining data images from different sources with consistently relabeled damage instances, this study released road-damage image data from the road maintenance agency in Zhubei, Hsinchu County, Taiwan for research and other uses, increasing the limited amount of published image data sources and positively impacting future scholarly research into road damage detection.
AB - Detecting road damage quickly and accurately facilitates the ability of road-maintenance agencies to make timely repairs to road surfaces, maintain optimal road conditions, optimize transportation safety, and minimize transportation costs. An extensive evaluation of eight deep-learning-based road-damage detection models was conducted in this study. Each model was trained on 9493 images sourced from multiple databases. The 16165 instances of road damage in these images were categorized into five types of damage, including longitudinal crack, horizontal crack, alligator damage, pothole-related crack, and line blurring. Two experiments were conducted that identified two models, single shot multi-box detector (SSD) Inception V2 and faster region-based convolutional neural networks (R-CNN) Inception V2, as providing the best balance of road-damage-detection accuracy and image processing time. These experiments demonstrated that increasing the diversity of image sources improved road-damage-detection model performance. In addition to combining data images from different sources with consistently relabeled damage instances, this study released road-damage image data from the road maintenance agency in Zhubei, Hsinchu County, Taiwan for research and other uses, increasing the limited amount of published image data sources and positively impacting future scholarly research into road damage detection.
KW - Convolutional neural network
KW - Deep learning
KW - Road crack
KW - Road damage detection
KW - Road maintenance
KW - Single shot detection
UR - http://www.scopus.com/inward/record.url?scp=85091645430&partnerID=8YFLogxK
U2 - 10.1016/j.aei.2020.101182
DO - 10.1016/j.aei.2020.101182
M3 - Article
AN - SCOPUS:85091645430
SN - 1474-0346
VL - 46
JO - Advanced Engineering Informatics
JF - Advanced Engineering Informatics
M1 - 101182
ER -