TY - GEN
T1 - AERIAL VIEW RIVER LANDFORM VIDEO SEGMENTATION
T2 - 31st IEEE International Conference on Image Processing, ICIP 2024
AU - Chen, Chi Han
AU - Chen, Chieh Ming
AU - Cheng, Wen Huang
AU - Huang, Ching Chun
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - The study of terrain and landform classification through UAV remote sensing diverges significantly from ground vehicle patrol tasks. Besides grappling with the complexity of data annotation and ensuring temporal consistency, it also confronts the scarcity of relevant data and the limitations imposed by the effective range of many technologies. This research substantiates that, in aerial positioning tasks, both the mean Intersection over Union (mIoU) and temporal consistency (TC) metrics are of paramount importance. It is demonstrated that fully labeled data is not the optimal choice, as selecting only key data lacks the enhancement in TC, leading to failures. Hence, a teacher-student architecture, coupled with key frame selection and key frame updating algorithms, is proposed. This framework successfully performs weakly supervised learning and TC knowledge distillation, overcoming the deficiencies of traditional TC training in aerial tasks. The experimental results reveal that our method utilizing merely 30% of labeled data, concurrently elevates mIoU and temporal consistency ensuring stable localization of terrain objects. Result demo: https://gitlab.com/prophet.ai.inc/drone-basedriverbed-inspection.
AB - The study of terrain and landform classification through UAV remote sensing diverges significantly from ground vehicle patrol tasks. Besides grappling with the complexity of data annotation and ensuring temporal consistency, it also confronts the scarcity of relevant data and the limitations imposed by the effective range of many technologies. This research substantiates that, in aerial positioning tasks, both the mean Intersection over Union (mIoU) and temporal consistency (TC) metrics are of paramount importance. It is demonstrated that fully labeled data is not the optimal choice, as selecting only key data lacks the enhancement in TC, leading to failures. Hence, a teacher-student architecture, coupled with key frame selection and key frame updating algorithms, is proposed. This framework successfully performs weakly supervised learning and TC knowledge distillation, overcoming the deficiencies of traditional TC training in aerial tasks. The experimental results reveal that our method utilizing merely 30% of labeled data, concurrently elevates mIoU and temporal consistency ensuring stable localization of terrain objects. Result demo: https://gitlab.com/prophet.ai.inc/drone-basedriverbed-inspection.
KW - River landform segmentation
KW - distillation
KW - temporal consistency
KW - weakly supervised segmentation
UR - http://www.scopus.com/inward/record.url?scp=85216832787&partnerID=8YFLogxK
U2 - 10.1109/ICIP51287.2024.10647662
DO - 10.1109/ICIP51287.2024.10647662
M3 - Conference contribution
AN - SCOPUS:85216832787
T3 - Proceedings - International Conference on Image Processing, ICIP
SP - 984
EP - 990
BT - 2024 IEEE International Conference on Image Processing, ICIP 2024 - Proceedings
PB - IEEE Computer Society
Y2 - 27 October 2024 through 30 October 2024
ER -