Self-Supervise Reinforcement Learning Method for Vacant Parking Space Detection Based on Task Consistency and Corrupted Rewards

Manh Hung Nguyen, Tzu Yin Chao, Ching Chun Hsiao, Yung Hui Li, Ching Chun Huang*

*此作品的通信作者

研究成果: Article同行評審

摘要

This paper proposes a novel task-consistency learning method that enables us to train a vacant space detection network (target task) based on the logic consistency with the semantic outcomes from a flow-based motion behavior classifier (source task) in a parking lot. Note that the source task can introduce false detection during task-consistency learning, which implies noisy rewards or supervision. The target network can be trained in a reinforcement learning setting by appropriately designing the reward mechanism upon semantic consistency. We also introduce a novel symmetric constraint to detect corrupted samples and reduce the effect of noisy rewards. Unlike conventional corrupted learning methods that use only training losses to identify corrupted samples, our symmetric constraint also explores the relationship among training samples to improve performance. Compared with conventional supervised detection methods, the main contribution of our work is the ability to learn a vacant space detector via semantic consistency rather than supervised labels. The dynamic learning property allows the proposed detector to be easily deployed and updated in various lots without heavy human loads. Experiments demonstrate that our noisy task consistency mechanism can be successfully applied to train a vacant space detector from scratch.

原文English
頁(從 - 到)1346-1363
頁數18
期刊IEEE Transactions on Intelligent Transportation Systems
25
發行號2
DOIs
出版狀態Published - 1 2月 2024

指紋

深入研究「Self-Supervise Reinforcement Learning Method for Vacant Parking Space Detection Based on Task Consistency and Corrupted Rewards」主題。共同形成了獨特的指紋。

引用此