A Self-Supervised Solution for the Switch-Toggling Visual Task

Yuehong Huang*, Yu Chee Tseng

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

How a robot explores and interacts with the real world by itself is a major research challenge. On the other hand, causal reasoning and combinatorial generalization are indispensable parts of human intelligence for exploration and survival. This paper presents SelfSVT, a self-supervised solution for the switch-toggling visual task, of which the goal is to infer causalities of visual combinatorial effects on the environment. Specifically, when a robot takes over a new place with a set of light switches and knows nothing about the switches' functions, it has to figure out how to transfer the environment from the current visual state to a goal visual state by toggling these switches by itself. SelfSVT trains efficient learning models that are able to perform the goal-conditioned visual task by directly reasoning the causalities of different visual states or inferring the switch states from its observations. In particular, we use the switch state to directly represent the combinatorial effect to make self-supervised learning possible and our framework adopts a siamese network with a discrete contrastive loss. It can perform causal induction and combinatorial generalization in a new environment with a few interactions. Our solution outperforms previous methods in both simulated and real-world environments and both static and dynamic environments. SelfSVT could achieve 100% success reasoning rates in most cases when there are sufficient interactions with the environment.

Original languageEnglish
Title of host publication2022 26th International Conference on Pattern Recognition, ICPR 2022
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages3429-3435
Number of pages7
ISBN (Electronic)9781665490627
DOIs
StatePublished - 2022
Event26th International Conference on Pattern Recognition, ICPR 2022 - Montreal, Canada
Duration: 21 Aug 202225 Aug 2022

Publication series

NameProceedings - International Conference on Pattern Recognition
Volume2022-August
ISSN (Print)1051-4651

Conference

Conference26th International Conference on Pattern Recognition, ICPR 2022
Country/TerritoryCanada
CityMontreal
Period21/08/2225/08/22

Fingerprint

Dive into the research topics of 'A Self-Supervised Solution for the Switch-Toggling Visual Task'. Together they form a unique fingerprint.

Cite this