TY - JOUR
T1 - A comparative study of data fusion for RGB-D based visual recognition
AU - Sanchez-Riera, Jordi
AU - Hua, Kai Lung
AU - Hsiao, Yuan Sheng
AU - Lim, Tekoing
AU - Hidayati, Shintami C.
AU - Cheng, Wen-Huang
N1 - Publisher Copyright:
© 2015 Elsevier B.V. All rights reserved.
PY - 2016/4/1
Y1 - 2016/4/1
N2 - Data fusion from different modalities has been extensively studied for a better understanding of multimedia contents. On one hand, the emergence of new devices and decreasing storage costs cause growing amounts of data being collected. Though bigger data makes it easier to mine information, methods for big data analytics are not well investigated. On the other hand, new machine learning techniques, such as deep learning, have been shown to be one of the key elements in achieving state-of-the-art inference performances in a variety of applications. Therefore, some of the old questions in data fusion are in need to be addressed again for these new changes. These questions are: What is the most effective way to combine data for various modalities? Does the fusion method affect the performance with different classifiers? To answer these questions, in this paper, we present a comparative study for evaluating early and late fusion schemes with several types of SVM and deep learning classifiers on two challenging RGB-D based visual recognition tasks: hand gesture recognition and generic object recognition. The findings from this study provide useful policy and practical guidance for the development of visual recognition systems.
AB - Data fusion from different modalities has been extensively studied for a better understanding of multimedia contents. On one hand, the emergence of new devices and decreasing storage costs cause growing amounts of data being collected. Though bigger data makes it easier to mine information, methods for big data analytics are not well investigated. On the other hand, new machine learning techniques, such as deep learning, have been shown to be one of the key elements in achieving state-of-the-art inference performances in a variety of applications. Therefore, some of the old questions in data fusion are in need to be addressed again for these new changes. These questions are: What is the most effective way to combine data for various modalities? Does the fusion method affect the performance with different classifiers? To answer these questions, in this paper, we present a comparative study for evaluating early and late fusion schemes with several types of SVM and deep learning classifiers on two challenging RGB-D based visual recognition tasks: hand gesture recognition and generic object recognition. The findings from this study provide useful policy and practical guidance for the development of visual recognition systems.
KW - CNN
KW - DBN
KW - Fusion
KW - RGB-D
KW - SAE
KW - SVM
UR - http://www.scopus.com/inward/record.url?scp=84955291441&partnerID=8YFLogxK
U2 - 10.1016/j.patrec.2015.12.006
DO - 10.1016/j.patrec.2015.12.006
M3 - Article
AN - SCOPUS:84955291441
SN - 0167-8655
VL - 73
SP - 1
EP - 6
JO - Pattern Recognition Letters
JF - Pattern Recognition Letters
ER -