Social-SSL: Self-supervised Cross-Sequence Representation Learning Based on Transformers for Multi-agent Trajectory Prediction

Li Wu Tsao*, Yan Kai Wang, Hao Siang Lin, Hong Han Shuai, Lai Kuan Wong, Wen Huang Cheng

*此作品的通信作者

研究成果: Conference contribution同行評審

10 引文 斯高帕斯(Scopus)

摘要

Earlier trajectory prediction approaches focus on ways of capturing sequential structures among pedestrians by using recurrent networks, which is known to have some limitations in capturing long sequence structures. To address this limitation, some recent works proposed Transformer-based architectures, which are built with attention mechanisms. However, these Transformer-based networks are trained end-to-end without capitalizing on the value of pre-training. In this work, we propose Social-SSL that captures cross-sequence trajectory structures via self-supervised pre-training, which plays a crucial role in improving both data efficiency and generalizability of Transformer networks for trajectory prediction. Specifically, Social-SSL models the interaction and motion patterns with three pretext tasks: interaction type prediction, closeness prediction, and masked cross-sequence to sequence pre-training. Comprehensive experiments show that Social-SSL outperforms the state-of-the-art methods by at least 12% and 20% on ETH/UCY and SDD datasets in terms of Average Displacement Error and Final Displacement Error (code available at https://github.com/Sigta678/Social-SSL.

原文English
主出版物標題Computer Vision – ECCV 2022 - 17th European Conference, Proceedings
編輯Shai Avidan, Gabriel Brostow, Moustapha Cissé, Giovanni Maria Farinella, Tal Hassner
發行者Springer Science and Business Media Deutschland GmbH
頁面234-250
頁數17
ISBN(列印)9783031200465
DOIs
出版狀態Published - 2022
事件17th European Conference on Computer Vision, ECCV 2022 - Tel Aviv, Israel
持續時間: 23 10月 202227 10月 2022

出版系列

名字Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
13682 LNCS
ISSN(列印)0302-9743
ISSN(電子)1611-3349

Conference

Conference17th European Conference on Computer Vision, ECCV 2022
國家/地區Israel
城市Tel Aviv
期間23/10/2227/10/22

指紋

深入研究「Social-SSL: Self-supervised Cross-Sequence Representation Learning Based on Transformers for Multi-agent Trajectory Prediction」主題。共同形成了獨特的指紋。

引用此