DNN Audio Classification Based on Extracted Spectral Attributes

Pei Chen Lo*, Chuan Yi Liu, Tsung Hsien Chou

*此作品的通信作者

研究成果: Conference contribution同行評審

3 引文 斯高帕斯(Scopus)

摘要

Recent advances in multimedia systems provide remarkable audio-visual experiences to various fields including entertainment, education, communication, industrial design, etc. To facilitate the audio-visual experience, audio quality enhancement becomes important. However, methods and techniques for improving audio quality highly depend on such audio attributes like human voices, music of different genres, or audio of various programs. This study is devoted to the development of an effective method for real-time audio classification based on deep learning scheme. Three classes of interest include classical music, non-classical music and news. Subband-power distribution (SPD) is a one-dimensional feature based on the audio power in frequency domain, which effectively reflects the spectral attributes of various audio content and allows us to implement DNN (deep neural network) audio classifier in real time. This study develops different DNN models according to various input designs, original SPD of different frequency resolutions and SPD pre-processed by principal component analysis (PCA). Overall accuracy Acc and prediction accuracy of each class using confusion matrix (CFM) will be evaluated to compare the performance. According to our results, the DNN audio classifier implemented with the input SPD pre-processed by PCA not only achieves better performance but remarkably reduces the memory capacity and computational time.

原文English
主出版物標題Proceedings - 2022 14th International Conference on Signal Processing Systems, ICSPS 2022
發行者Institute of Electrical and Electronics Engineers Inc.
頁面259-262
頁數4
ISBN(電子)9798350336313
DOIs
出版狀態Published - 2022
事件14th International Conference on Signal Processing Systems, ICSPS 2022 - Virtual, Online, 中國
持續時間: 18 11月 202220 11月 2022

出版系列

名字Proceedings - 2022 14th International Conference on Signal Processing Systems, ICSPS 2022

Conference

Conference14th International Conference on Signal Processing Systems, ICSPS 2022
國家/地區中國
城市Virtual, Online
期間18/11/2220/11/22

指紋

深入研究「DNN Audio Classification Based on Extracted Spectral Attributes」主題。共同形成了獨特的指紋。

引用此