An Additive and Convolutive Bias Compensation Algorithm for Telephoe Speech Recognition
-
摘要: 为了统一地补偿电话语音受加性噪声和卷积通道响应的影响,本文提出了矢量分段多 项式近似(VPP)算法.并把此算法成功地应用到稳态噪声和非稳态噪声环境.对于稳态噪声环 境,在log谱域采用Batch EM(B EM)方法;对于非稳态噪声环境,在倒谱域采用递归EM(R EM)方法.这两种方法都是基于最小均方误差估计(MMSE)准则的特征补偿.实验结果表明,受 背景噪声和电话通道(包括固定电话和GSM)影响的大词汇量连续语音识别应用此算法误识率 可以降低约18%.Abstract: A Vector piecewise polynomial (VPP) approximation algorithm is proposed for environment compensation of speech signals degraded by both additive and convolutive noises. By investigating the model of the telephone environment, we propose a piecewise polynomial, namely two linear polynomials and a cuadratic polynomial, to approximate the environment function precisely. The VPP is applied either to the stationary noise, or to the non-stationary noise. In the first case, the batch EM is used m log spectral domain; in the second case the recursive EM with iterative stochastic approximation is developed in cepstral domain. Both approaches are based on the minimum mean squared error (MMSE) sense. Experimental results are presented on the application of this approach in improving the performance of Mandarin large vocabulary continuous speech recognition (LVCSR) due to the background noises and different transmission channels (such as fixed telephone line and GSM). The method can reduce the average character error rate (CER) by about 18%.
计量
- 文章访问数: 3228
- HTML全文浏览量: 133
- PDF下载量: 1077
- 被引次数: 0