TY - GEN
T1 - Intelligent voice smoother for silence-suppressed voice over Internet
AU - Tien, Po-Lung
AU - Yuang, Maria C.
PY - 1998/12/1
Y1 - 1998/12/1
N2 - For transporting voice data with silence suppression over the Internet, the problem of jitter introduced from the network often renders the speech unintelligible. It is thus indispensable to offer intramedia synchronization to remove jitter while retaining minimal playout delay. We propose a neural-network-based intra-voice synchronization mechanism, called the intelligent voice smoother (IVoS). The IVoS is composed of three components: smoother buffer, neural network (NN) traffic predictor, and constant bit rate (CBR) enforcer. Newly arriving frames, being assumed to follow a generic Markov modulated Bernoulli process (MMBP), are queued in the smoother buffer. The NN traffic predictor employs an on-line-trained backpropagation neural network (BPNN) to predict three traffic characteristics of every newly encountered talkspurt period. Based on the predicted characteristics, the CBR enforcer derives an adaptive buffering delay. It then imposes such delay on the playout of the first frame in the talkspurt period. The CBR enforcer in turn regulates CBR-based departures for the remaining frames of the talkspurt, aiming at assuring minimal mean and variance of distortion of talkspurts (DOT) and mean playout delay (PD). Simulation results reveal that, compared to three other playout approaches, IVoS achieves superior playout yielding negligible DOT and PD irrespective of traffic variation.
AB - For transporting voice data with silence suppression over the Internet, the problem of jitter introduced from the network often renders the speech unintelligible. It is thus indispensable to offer intramedia synchronization to remove jitter while retaining minimal playout delay. We propose a neural-network-based intra-voice synchronization mechanism, called the intelligent voice smoother (IVoS). The IVoS is composed of three components: smoother buffer, neural network (NN) traffic predictor, and constant bit rate (CBR) enforcer. Newly arriving frames, being assumed to follow a generic Markov modulated Bernoulli process (MMBP), are queued in the smoother buffer. The NN traffic predictor employs an on-line-trained backpropagation neural network (BPNN) to predict three traffic characteristics of every newly encountered talkspurt period. Based on the predicted characteristics, the CBR enforcer derives an adaptive buffering delay. It then imposes such delay on the playout of the first frame in the talkspurt period. The CBR enforcer in turn regulates CBR-based departures for the remaining frames of the talkspurt, aiming at assuring minimal mean and variance of distortion of talkspurts (DOT) and mean playout delay (PD). Simulation results reveal that, compared to three other playout approaches, IVoS achieves superior playout yielding negligible DOT and PD irrespective of traffic variation.
UR - http://www.scopus.com/inward/record.url?scp=0031644953&partnerID=8YFLogxK
U2 - 10.1109/ICC.1998.683132
DO - 10.1109/ICC.1998.683132
M3 - Conference contribution
AN - SCOPUS:0031644953
SN - 0780347889
SN - 9780780347885
T3 - International Conference on Communications - Proceedings
SP - 1764
EP - 1768
BT - ICC 1998 - 1998 IEEE International Conference on Communications
T2 - 1998 IEEE International Conference on Communications: New Century Communications, ICC 1998 - Affiliated with SUPERCOMM 1998
Y2 - 7 June 1998 through 11 June 1998
ER -