Few-Shot and Continual Learning with Attentive Independent Mechanisms

Eugene Lee, Cheng Han Huang, Chen Yi Lee

研究成果: Conference contribution同行評審

16 引文 斯高帕斯(Scopus)

摘要

Deep neural networks (DNNs) are known to perform well when deployed to test distributions that shares high similarity with the training distribution. Feeding DNNs with new data sequentially that were unseen in the training distribution has two major challenges - fast adaptation to new tasks and catastrophic forgetting of old tasks. Such difficulties paved way for the on-going research on few-shot learning and continual learning. To tackle these problems, we introduce Attentive Independent Mechanisms (AIM). We incorporate the idea of learning using fast and slow weights in conjunction with the decoupling of the feature extraction and higher-order conceptual learning of a DNN. AIM is designed for higher-order conceptual learning, modeled by a mixture of experts that compete to learn independent concepts to solve a new task. AIM is a modular component that can be inserted into existing deep learning frameworks. We demonstrate its capability for few-shot learning by adding it to SIB and trained on MiniImageNet and CIFAR-FS, showing significant improvement. AIM is also applied to ANML and OML trained on Omniglot, CIFAR-100 and MiniImageNet to demonstrate its capability in continual learning. Code made publicly available at https://github.com/huang50213/AIM-Fewshot-Continual.

原文English
主出版物標題Proceedings - 2021 IEEE/CVF International Conference on Computer Vision, ICCV 2021
發行者Institute of Electrical and Electronics Engineers Inc.
頁面9435-9444
頁數10
ISBN(電子)9781665428125
DOIs
出版狀態Published - 2021
事件18th IEEE/CVF International Conference on Computer Vision, ICCV 2021 - Virtual, Online, 加拿大
持續時間: 11 10月 202117 10月 2021

出版系列

名字Proceedings of the IEEE International Conference on Computer Vision
ISSN(列印)1550-5499

Conference

Conference18th IEEE/CVF International Conference on Computer Vision, ICCV 2021
國家/地區加拿大
城市Virtual, Online
期間11/10/2117/10/21

指紋

深入研究「Few-Shot and Continual Learning with Attentive Independent Mechanisms」主題。共同形成了獨特的指紋。

引用此