中文语音合成系统中的一种两层韵律结构生成体系

留言板

尊敬的读者、作者、审稿人, 关于本刊的投稿、审稿、编辑和出版的任何问题, 您可以本页添加留言。我们将尽快给您答复。谢谢您的支持!

姓名
邮箱
手机号码
标题
留言内容
验证码

中文语音合成系统中的一种两层韵律结构生成体系

董远, 周涛, 董乘宇, 王海拉

文章导航 > 自动化学报 > 2010 > 36(11): 1569-1574

董远, 周涛, 董乘宇, 王海拉. 中文语音合成系统中的一种两层韵律结构生成体系. 自动化学报, 2010, 36(11): 1569-1574. doi: 10.3724/SP.J.1004.2010.01569

引用本文:

董远, 周涛, 董乘宇, 王海拉. 中文语音合成系统中的一种两层韵律结构生成体系. 自动化学报, 2010, 36(11): 1569-1574. doi: 10.3724/SP.J.1004.2010.01569

DONG Yuan, ZHOU Tao, DONG Cheng-Yu, WANG Hai-La. A Two-stage Prosodic Structure Generation Strategy for Mandarin Text-to-speech Systems. ACTA AUTOMATICA SINICA, 2010, 36(11): 1569-1574. doi: 10.3724/SP.J.1004.2010.01569

Citation:

DONG Yuan, ZHOU Tao, DONG Cheng-Yu, WANG Hai-La. A Two-stage Prosodic Structure Generation Strategy for Mandarin Text-to-speech Systems. ACTA AUTOMATICA SINICA, 2010, 36(11): 1569-1574. doi: 10.3724/SP.J.1004.2010.01569

董远, 周涛, 董乘宇, 王海拉. 中文语音合成系统中的一种两层韵律结构生成体系. 自动化学报, 2010, 36(11): 1569-1574. doi: 10.3724/SP.J.1004.2010.01569

引用本文:

董远, 周涛, 董乘宇, 王海拉. 中文语音合成系统中的一种两层韵律结构生成体系. 自动化学报, 2010, 36(11): 1569-1574. doi: 10.3724/SP.J.1004.2010.01569

DONG Yuan, ZHOU Tao, DONG Cheng-Yu, WANG Hai-La. A Two-stage Prosodic Structure Generation Strategy for Mandarin Text-to-speech Systems. ACTA AUTOMATICA SINICA, 2010, 36(11): 1569-1574. doi: 10.3724/SP.J.1004.2010.01569

Citation:

DONG Yuan, ZHOU Tao, DONG Cheng-Yu, WANG Hai-La. A Two-stage Prosodic Structure Generation Strategy for Mandarin Text-to-speech Systems. ACTA AUTOMATICA SINICA, 2010, 36(11): 1569-1574. doi: 10.3724/SP.J.1004.2010.01569

中文语音合成系统中的一种两层韵律结构生成体系

doi: 10.3724/SP.J.1004.2010.01569

1.
北京邮电大学北京 100876
2.
法国电信北京研究中心北京 100192

通讯作者:
周涛

计量
- 文章访问数: 2200
- HTML全文浏览量: 46
- PDF下载量: 901
- 被引次数: 0
出版历程
- 收稿日期: 2009-03-25
- 修回日期: 2010-07-01
- 刊出日期: 2010-11-20

A Two-stage Prosodic Structure Generation Strategy for Mandarin Text-to-speech Systems

1.
Beijing University of Posts and Telecommunications, Beijing 100876, P.R. China;
2.
France Telecom R&D (Beijing), Beijing 100190, P.R. China

More Information

Corresponding author: ZHOU Tao

摘要: 韵律结构生成是改进一个语音合成系统中的合成语音的完整度和自然度的重要组成部分. 韵律词和韵律短语的自动切分是中文层级韵律结构的两个重要的基本层面, 本文调研了这个基本问题, 并提出了一种两层韵律结构生成体系. 为此, 我们建立了条件随机场模型为韵律词和韵律短语的预测选取不同的前端特征. 除此之外, 我们还引入了基于转换的错误驱动学习模块来修正后端的初始预测. 实验结果显示, 这种结合条件随机场和错误驱动学习的方法使得韵律词和韵律短语的自动分割的F-score值达到了94.66%.
- 语音合成 /
- 字音转换 /
- 韵律结构生成 /
- 条件随机场 /
- 错误驱动学习
Abstract: Prosodic structure generation is the key component in improving the intelligibility and naturalness of synthetic speech for a text-to-speech (TTS) system. This paper investigates the problem of automatic segmentation of prosodic word and prosodic phrase, which are two fundamental layers in the hierarchical prosodic structure of Mandarin, and presents a two-stage prosodic structure generation strategy. Conditional random fields (CRF) models are built for both prosodic word and prosodic phrase prediction at the front end with different feature selections. Besides, a transformation-based error-driven learning (TBL) modification module is introduced in the back end to amend the initial prediction. Experiment results show that the approach combining CRF and TBL achieves an F-score of 94.66%.
- Text-to-speech (TTS) /
- prosodic structure generation /
- conditional random fields (CRF) /
- transformation-based error-driven learning (TBL) /

参考文献(0)

资源附件(0)

计量

文章访问数: 2200
HTML全文浏览量: 46
PDF下载量: 901
被引次数: 0

/

下载: 全尺寸图片幻灯片

分享

用微信扫码二维码

分享至好友和朋友圈

返回

版权所有 © 《自动化学报》编辑部京ICP备14019135号-6

地址：北京中关村东路95号邮政编码：100190E-mail：aas_editor@ia.ac.cn

电话：010-82544677 (日常咨询和稿件处理)，010-82544653(费用管理、寄刊)

本系统由北京仁和汇智信息技术有限公司开发技术支持： info@rhhz.net