Taming NLU Noise: Student-Teacher Learning for Robust Dialogue Policy

Mahdin Rohmatillah*, Jen Tzung Chien

*此作品的通信作者

研究成果: Conference contribution同行評審

摘要

Dialogue policy is a crucial component of dialogue systems, responsible for determining system responses based on user inputs. While reinforcement learning (RL) can effectively optimize the dialogue policy, the system performance in real-world settings is heavily influenced by an earlier component for natural language understanding (NLU). Once the NLU produces a wrong information, the dialogue policy will be affected to degrade the performance. To enhance the robustness of dialogue policy, this paper proposes integrating RL optimization with a noisy student-teacher learning, taming the noise generated by NLU. To prevent overconfidence during knowledge transfer from the teacher, we introduce a dual-teacher mechanism where knowledge distillation is carried out by using dynamic changes in the samples stored in the replay buffer which leverages the exploration-exploitation paradigm from RL. Evaluations on multi-domain multi-turn dialogue tasks demonstrate the effectiveness of this approach which shows the increased robustness to noisy NLU outputs and accordingly the improved overall system performance.

原文English
主出版物標題Proceedings of 2024 IEEE Spoken Language Technology Workshop, SLT 2024
發行者Institute of Electrical and Electronics Engineers Inc.
頁面849-856
頁數8
ISBN(電子)9798350392258
DOIs
出版狀態Published - 2024
事件2024 IEEE Spoken Language Technology Workshop, SLT 2024 - Macao, 中國
持續時間: 2 12月 20245 12月 2024

出版系列

名字Proceedings of 2024 IEEE Spoken Language Technology Workshop, SLT 2024

Conference

Conference2024 IEEE Spoken Language Technology Workshop, SLT 2024
國家/地區中國
城市Macao
期間2/12/245/12/24

指紋

深入研究「Taming NLU Noise: Student-Teacher Learning for Robust Dialogue Policy」主題。共同形成了獨特的指紋。

引用此