-
摘要: 定义句模板和关联词向量作为候选定义句选择的常用特征, 前者确定定义的表达形式, 后者确定定义的叙述内容. 目前, 大多数定义类问题回答系统中都是根据问题Target及其周边词之间的相对位置关系来提取定义句模板和关联词. 在这种基于位置关系的基础上提出了基于依赖关系的定义句模板和关联词抽取方法, 然后将这些特征应用到改进的在线算法MIRA (Margin infused relaxed algorithm), 从而实现对候选定义句子的排序. 这种改进的MIRA算法能够根据学习进程自动调整约束条件, 从而提高算法的收敛速度与性能.Abstract: Definitional pattern and centroid word vector are the popular features for definitional question answering. The former determines how to describe the information, and the latter determines what topics should be concerned. Most current systems use sequence pattern and sequence centroid words extracted only by relative position to question target to identify definition sentences. In contrast to sequence knowledge, we propose dependency-based knowledge, including dependency pattern and dependency centroid word extracted by dependency relation to question target. Specifically, we use the improved ultraconservative online algorithm and margin infused relaxed algorithm (MIRA) for the task of candidate sentences ranking. Experiments have demonstrated that the improved MIRA can greatly increase the system performance in terms of progressive reduction of optimization constraints as training moves forward.
点击查看大图
计量
- 文章访问数: 2313
- HTML全文浏览量: 40
- PDF下载量: 1213
- 被引次数: 0