跳至主導覽 跳至搜尋 跳過主要內容

Meta Learning for Domain Agnostic Soft Prompt

  • Ming Yen Chen*
  • , Mahdin Rohmatillah*
  • , Ching Hsien Lee
  • , Jen Tzung Chien*
  • *此作品的通信作者

研究成果: Conference contribution同行評審

12 引文 斯高帕斯(Scopus)

摘要

The prompt-based learning, as used in GPT-3, has become a popular approach to extract knowledge from a powerful pre-trained language model (PLM) for natural language understanding tasks. However, either applying the hard prompt for sentences by defining a collection of human-engineering prompt templates or directly optimizing the soft or continuous prompt with labeled data may not really generalize well for unseen domain data. To cope with this issue, this paper presents a new prompt-based unsupervised domain adaptation where the learned soft prompt is able to boost the frozen pre-trained language model to deal with the input tokens from unseen domains. Importantly, the meta learning and optimization is developed to carry out the domain agnostic soft prompt where the loss for masked language model is minimized. The experiments on multi-domain natural language understanding tasks show the merits of the proposed method.

原文English
主出版物標題ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing, Proceedings
發行者Institute of Electrical and Electronics Engineers Inc.
ISBN(電子)9781728163277
DOIs
出版狀態Published - 2023
事件48th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2023 - Rhodes Island, 希臘
持續時間: 4 6月 202310 6月 2023

出版系列

名字ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
2023-June
ISSN(列印)1520-6149

Conference

Conference48th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2023
國家/地區希臘
城市Rhodes Island
期間4/06/2310/06/23

指紋

深入研究「Meta Learning for Domain Agnostic Soft Prompt」主題。共同形成了獨特的指紋。

引用此