Human-Object Interaction Detection: An Overview

Jia Wang*, Hong Han Shuai, Yung Hui Li, Wen Huang Cheng

*Corresponding author for this work

Research output: Contribution to specialist publicationArticle

Abstract

This article systematically summarizes and discusses recent research on image-based human object interaction (HOI) detection, which aims to detect human object pairs and recognize the interactive behaviors between humans and objects in an image. It has plenty of applications and can serve as the basis to assist higher level tasks of visual understanding. We introduce existing methods by categorizing them into two main groups based on the model structure: one-stage and two-stage approaches. We further divide one-stage methods into point-based, region-based, and query-based methods. Similarly, the two-stage methods are divided into HOI detection with multistream modeling, HOI detection with human parts and pose, HOI detection with compositional learning, HOI detection with graph-based modeling, and HOI detection with query-based modeling. According to this taxonomy, we also summarize and analyze the core ideas behind each strategy. Then, we present the details of the experimental protocols, evaluation metrics, datasets, and the evaluation results of the most recent representative methods. Finally, we discuss the main open challenges and future trends in the HOI detection task.

Original languageEnglish
Pages56-72
Number of pages17
Volume13
No6
Specialist publicationIEEE Consumer Electronics Magazine
DOIs
StatePublished - 2024

Fingerprint

Dive into the research topics of 'Human-Object Interaction Detection: An Overview'. Together they form a unique fingerprint.

Cite this