TY - JOUR
T1 - Neuromimetic sound representation for percept detection and manipulation
AU - Zotkin, Dmitry N.
AU - Chi, Tai-Shih
AU - Shamma, Shihab A.
AU - Duraiswami, Ramani
PY - 2005/6/1
Y1 - 2005/6/1
N2 - The acoustic wave received at the ears is processed by the human auditory system to separate different sounds along the intensity, pitch, and timbre dimensions. Conventional Fourier-based signal processing, while endowed with fast algorithms, is unable to easily represent a signal along these attributes. In this paper, we discuss the creation of maximally separable sounds in auditory user interfaces and use a recently proposed cortical sound representation, which performs a biomimetic decomposition of an acoustic signal, to represent and manipulate sound for this purpose. We briefly overview algorithms for obtaining, manipulating, and inverting a cortical representation of a sound and describe algorithms for manipulating signal pitch and timbre separately. The algorithms are also used to create sound of an instrument between a "guitar" and a "trumpet." Excellent sound quality can be achieved if processing time is not a concern, and intelligible signals can be reconstructed in reasonable processing time (about ten seconds of computational time for a one-second signal sampled at 8kHz). Work on bringing the algorithms into the real-time processing domain is ongoing.
AB - The acoustic wave received at the ears is processed by the human auditory system to separate different sounds along the intensity, pitch, and timbre dimensions. Conventional Fourier-based signal processing, while endowed with fast algorithms, is unable to easily represent a signal along these attributes. In this paper, we discuss the creation of maximally separable sounds in auditory user interfaces and use a recently proposed cortical sound representation, which performs a biomimetic decomposition of an acoustic signal, to represent and manipulate sound for this purpose. We briefly overview algorithms for obtaining, manipulating, and inverting a cortical representation of a sound and describe algorithms for manipulating signal pitch and timbre separately. The algorithms are also used to create sound of an instrument between a "guitar" and a "trumpet." Excellent sound quality can be achieved if processing time is not a concern, and intelligible signals can be reconstructed in reasonable processing time (about ten seconds of computational time for a one-second signal sampled at 8kHz). Work on bringing the algorithms into the real-time processing domain is ongoing.
KW - Anthropomorphic algorithms
KW - Human sound perception
KW - Pitch detection
UR - http://www.scopus.com/inward/record.url?scp=27844598720&partnerID=8YFLogxK
U2 - 10.1155/ASP.2005.1350
DO - 10.1155/ASP.2005.1350
M3 - Article
AN - SCOPUS:27844598720
SN - 1110-8657
VL - 2005
SP - 1350
EP - 1364
JO - Eurasip Journal on Applied Signal Processing
JF - Eurasip Journal on Applied Signal Processing
IS - 9
ER -