Graph-Based Embedding Improvement Feature Distribution in Videos

Junbin Zhang*, Pei Hsuan Tsai, Meng Hsun Tsai

*此作品的通信作者

研究成果: Conference contribution同行評審

1 引文 斯高帕斯(Scopus)

摘要

Video understanding is a significant computer vision research subject since online video content is growing exponentially. Feature extraction and representation play a crucial role in video understanding tasks such as classification, segmentation, and recognition. However, the model's learning is ambiguous since adjacent video frames typically have similar RGB features. To address this issue, we present graph-based embedding to enhance video feature distribution. We construct a graph-structured of videos by connecting similar features. Node embedding is generated by utilizing a graph model. Experiments demonstrate that our approach effectively improves feature distribution. The graph attention network (GAT) improves accuracy and editing score by 4% over the visual model.

原文English
主出版物標題2023 International Conference on Consumer Electronics - Taiwan, ICCE-Taiwan 2023 - Proceedings
發行者Institute of Electrical and Electronics Engineers Inc.
頁面435-436
頁數2
ISBN(電子)9798350324174
DOIs
出版狀態Published - 2023
事件2023 International Conference on Consumer Electronics - Taiwan, ICCE-Taiwan 2023 - Pingtung, 台灣
持續時間: 17 7月 202319 7月 2023

出版系列

名字2023 International Conference on Consumer Electronics - Taiwan, ICCE-Taiwan 2023 - Proceedings

Conference

Conference2023 International Conference on Consumer Electronics - Taiwan, ICCE-Taiwan 2023
國家/地區台灣
城市Pingtung
期間17/07/2319/07/23

指紋

深入研究「Graph-Based Embedding Improvement Feature Distribution in Videos」主題。共同形成了獨特的指紋。

引用此