Multitask Generative Adversarial Imitation Learning for Multi-Domain Dialogue System

Chuan En Hsu, Mahdin Rohmatillah, Jen Tzung Chien

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

12 Scopus citations

Abstract

In the task-oriented dialogue system, dialog policy plays an important role since it determines the suitable actions based on the user's goals. However, in real situations, user's goals are varying so that the system needs to deal with the complex optimization problem for dialog policy. This paper presents a novel approach to build the multi-domain dialog system based on the multitask generative adversarial imitation learning (MGAIL). MGAIL combines hierarchical reinforcement learning and generative adversarial imitation learning where a mixture of generators are represented for multitask learning. Unlike the traditional imitation learning, this method decomposes each of complex tasks into several subtasks and builds the policy in a hierarchical way to relax the agent in handling multiple complex tasks. Experiments on a multi-domain dialogue system using MultiWOZ 2.1 under ConvLab-2 frame-work show that the proposed method outperforms the other reinforcement learning methods in system-wise evaluation in terms of complete rate, success rate and book rate.

Original languageEnglish
Title of host publication2021 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2021 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages954-961
Number of pages8
ISBN (Electronic)9781665437394
DOIs
StatePublished - 2021
Event2021 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2021 - Cartagena, Colombia
Duration: 13 Dec 202117 Dec 2021

Publication series

Name2021 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2021 - Proceedings

Conference

Conference2021 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2021
Country/TerritoryColombia
CityCartagena
Period13/12/2117/12/21

Keywords

  • Dialogue policy optimization
  • generative adversarial imitation learning
  • multi-domain dialogues

Fingerprint

Dive into the research topics of 'Multitask Generative Adversarial Imitation Learning for Multi-Domain Dialogue System'. Together they form a unique fingerprint.

Cite this