一种基于互补声学模型的多系统融合语音关键词检测方法

孟猛; 王晓瑞; 梁家恩; 徐波

doi:10.3724/SP.J.1004.2009.00039

留言板

尊敬的读者、作者、审稿人, 关于本刊的投稿、审稿、编辑和出版的任何问题, 您可以本页添加留言。我们将尽快给您答复。谢谢您的支持!

姓名

邮箱

手机号码

标题

留言内容

验证码

一种基于互补声学模型的多系统融合语音关键词检测方法

doi: 10.3724/SP.J.1004.2009.00039

1.
中国科学院自动化研究所数字内容技术研究中心北京 100190
2.
中国科学院自动化研究所模式识别国家重点实验室北京 100190

详细信息

通讯作者:
孟猛

中图分类号: TP391.4
计量
- 文章访问数: 3013
- HTML全文浏览量: 73
- PDF下载量: 1735
- 被引次数: 0
出版历程
- 收稿日期: 2007-12-21
- 修回日期: 2008-06-30
- 刊出日期: 2009-01-20

A System Combination Based Keyword-spotting Method Using Complementary Acoustic Models

1.
Digital Content Technology Research Center, Institute of Automation, Chinese Academy of Sciences, Beijing 100190;
2.
National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing 100190

More Information

Corresponding author: MENG Meng

摘要

摘要: 采用一种基于互补声学模型的多系统融合方法来获得高性能的语音关键词检测系统: 1)在基线系统的基础上, 使用不同的音素集进行声学建模, 并引入基于神经网络的声学建模方法, 获得另外两套具有建模差异性的声学系统; 2)在多套关键词检测系统的基础上, 通过选择有效的系统融合准则, 将多个系统的输出进行整合, 获得更好的语音关键词检测结果. 该方法充分利用了差异性声学建模系统之间的互补性, 在不增加训练数据的情况下, 显著地提升了最终系统的性能. 和基线系统相比, 该方法在2005年国家863电话语音关键词检测技术评测集上, 在等错误率(Equal error rate, EER)指标下, 获得相对21.6%的显著性能提升.
- 关键词检测 /
- 高斯混合模型 /
- 神经网络
Abstract: In this work we explored a system combination based keyword spotting (KWS) method by using complementary acoustic models. The main steps included: 1) constructed two complementary acoustic models by using different modeling units (phone set) and different modeling methods (neural networks), respectively, as well as the baseline system; 2) combined the outputs of the above three systems with appropriate method and obtained a better result. The proposed approach exploited the complementary features of different systems to improve the system performance without using any additional training data. With this method, a significant relative reduction of 21.6% in EER (equal error eate) was obtained over the baseline system for the Mandarin CTS (conversational telephone speech) KWS task in 2005 China National ``863'' Evaluation.
- Keyword spotting (KWS) /
- Gaussian-mixture model (GMM) /
- neural networks (NN)