2.845

2023影响因子

(CJCR)

  • 中文核心
  • EI
  • 中国科技核心
  • Scopus
  • CSCD
  • 英国科学文摘

留言板

尊敬的读者、作者、审稿人, 关于本刊的投稿、审稿、编辑和出版的任何问题, 您可以本页添加留言。我们将尽快给您答复。谢谢您的支持!

姓名
邮箱
手机号码
标题
留言内容
验证码

基于全局覆盖机制与表示学习的生成式知识问答技术

刘琼昕 王亚男 龙航 王佳升 卢士帅

刘琼昕, 王亚男, 龙航, 王佳升, 卢士帅. 基于全局覆盖机制与表示学习的生成式知识问答技术. 自动化学报, 2022, 48(10): 2392−2405 doi: 10.16383/j.aas.c190785
引用本文: 刘琼昕, 王亚男, 龙航, 王佳升, 卢士帅. 基于全局覆盖机制与表示学习的生成式知识问答技术. 自动化学报, 2022, 48(10): 2392−2405 doi: 10.16383/j.aas.c190785
Liu Qiong-Xin, Wang Ya-Nan, Long Hang, Wang Jia-Sheng, Lu Shi-Shuai. Generative knowledge question answering technology based on global coverage mechanism and representation learning. Acta Automatica Sinica, 2022, 48(10): 2392−2405 doi: 10.16383/j.aas.c190785
Citation: Liu Qiong-Xin, Wang Ya-Nan, Long Hang, Wang Jia-Sheng, Lu Shi-Shuai. Generative knowledge question answering technology based on global coverage mechanism and representation learning. Acta Automatica Sinica, 2022, 48(10): 2392−2405 doi: 10.16383/j.aas.c190785

基于全局覆盖机制与表示学习的生成式知识问答技术

doi: 10.16383/j.aas.c190785
基金项目: 国家自然科学基金 (62072039) 资助
详细信息
    作者简介:

    刘琼昕:北京理工大学计算机学院副教授. 主要研究方向为人工智能, 自然语言处理, 具体研究知识推理, 关系抽取, 任务规划, 决策支持. 本文通信作者. E-mail: summer@bit.edu.cn

    王亚男:北京理工大学计算机学院硕士研究生. 主要研究方向为自然语言处理, 问答系统.E-mail: wyn1895@163.com

    龙航:北京理工大学计算机学院硕士研究生. 主要研究方向为自然语言处理, 表示学习, 问答系统.E-mail: longhang@ict.ac.cn

    王佳升:北京理工大学硕士研究生. 主要研究方向为深度学习, 自然语言处理和知识图谱.E-mail: 3120191049@bit.edu.cn

    卢士帅:北京理工大学计算机学院硕士研究生. 主要研究方向为自然语言处理领域的关系提取, 特别是关系提取中的小样本学习.E-mail: 3120191028@bit.edu.cn

Generative Knowledge Question Answering Technology Based on Global Coverage Mechanism and Representation Learning

Funds: Supported by National Natural Science Foundation of  China (62072039) 
More Information
    Author Bio:

    LIU Qiong-Xin Associate professor at the School of Computer Science and Technology, Beijing Institute of Technology. Her research interest covers artificial intelligence and natural language processing, specifically knowledge reasoning, relationship extraction, task planning, and decision support. Corresponding author of this paper

    WANG Ya-Nan Master student at the School of Computer Science and Technology, Beijing Institute of Technology. Her research interest covers natural language processing and question answering system

    LONG Hang Master student at the School of Computer Science and Technology, Beijing Institute of Technology. His research interest covers natural language processing, representation learning, and question answering system

    WANG Jia-Sheng Master student at the School of Computer Science and Technology, Beijing Institute of Technology. His research interest covers deep learning, natural language processing, and knowledge graphs

    LU Shi-Shuai Master student at the School of Computer Science and Technology, Beijing Institute of Technology. His research interest covers relation extraction in the field of natural language processing, especially few-shot-learning in relation extraction

  • 摘要: 针对现有生成式问答模型中陌生词汇导致答案准确率低下的问题和模式混乱导致的词汇重复问题, 本文提出引入知识表示学习结果的方法提高模型识别陌生词汇的能力, 提高模型准确率. 同时本文提出使用全局覆盖机制以平衡不同模式答案生成的概率, 减少由预测模式混乱导致的重复输出问题, 提高答案的质量. 本文在知识问答模型基础上结合知识表示学习的推理结果, 使模型具备模糊回答的能力. 在合成数据集和现实世界数据集上的实验证明了本模型能够有效地提高生成答案的质量, 能对推理知识进行模糊回答.
  • 图  1  MCQA 模型图

    Fig.  1  The overall diagram of MCQA

    图  2  模型词典示意图

    Fig.  2  The diagram of vocabulary

    图  3  解码器工作机制示意图

    Fig.  3  The diagram of working mechanism of decoder

    图  4  MCQA (TE, CE)与CoreQA 答案对比样例

    Fig.  4  The comparison of MCQA (TE, CE) and CoreQA sample outputs

    图  5  社区问答样例

    Fig.  5  The sample outputsof community QA

    图  6  知识补全示意图

    Fig.  6  The diagram of knowledge base completion

    图  7  模糊问答样例

    Fig.  7  The sample outputs of ambiguously QA

    表  1  问答数据集规模

    Table  1  The size of QA datasets

    数据集 问答对数量 关系数量
    SimpleQuestions 101 754 1 631
    生日问答数据集 239 922 5
    社区问答数据集 505 021 4 011
    下载: 导出CSV

    表  2  SimpleQuestion 数据集实验结果

    Table  2  The experimental results of SimpleQuestion datasets

    方法 准确率 (%)
    BiCNN[13] 90.0
    AMPCNN[29] 91.3
    HR-BiLSTM[30] 93.3
    CoreQA 92.8
    MCQA (WE, CE) 93.8
    MCQA (TE, CE) 94.3
    下载: 导出CSV

    表  3  生日数据集实验结果 (%)

    Table  3  The experimental results of birthday datasets (%)

    方法 $ {{P}_{g}} $ $ {{P}_{y}} $ $ {{P}_{m}} $ $ {{P}_{d}} $ $ {{P}_{r}} $
    Seq2Seq 67.3 23.4 37.2
    NMT 71.6 27.1 54.7
    CopyNet 75.2 71.9
    GenQA (本文) 73.4 63.2 65.8 77.1 62.6
    CoreQA 75.6 84.8 93.4 81 80.3
    MCQA (WE, CE) 89.8 89.1 98.4 93.2 84.1
    MCQA (TE, CE) 88.6 89.4 98.7 93.6 84.6
    下载: 导出CSV

    表  4  社区问答实验结果 (%)

    Table  4  The experimental results of community QA datasets (%)

    方法 正确性 流畅性 一致性
    CopyNet 19.4 21.3
    GenQA (本文) 24.3 38.3 24.1
    CoreQA 49.3 51.8 62.5
    MCQA (WE, CE) 52.3 55.8 65.2
    MCQA (TE, CE) 54.1 56.3 65.0
    下载: 导出CSV

    表  5  模糊问答推理结果 (%)

    Table  5  The prediction results of ambiguously QA (%)

    方法 $ {{P}_{y}} $ $ {{P}_{m}} $ $ {{P}_{d}} $
    PTransE 93.2 97.4 95.0
    下载: 导出CSV

    表  6  模糊问答结果 (%)

    Table  6  The results of ambiguously QA (%)

    方法 $ {{F1}_{t}} $ $ {{P}_{y}} $ $ {{P}_{m}} $ $ {{P}_{d}} $ $ {{P}_{r}} $
    MCQA (WE, CE) 87.7 78.1 88.2 90.8 80.9
    下载: 导出CSV
  • [1] Vanessa L, Victoria U, Marta S, Enrico M. Is question answering fit for the semantic web?: A survey. Semantic Web, 2011, 2(2): 125-155 doi: 10.3233/SW-2011-0041
    [2] Sydorova A, Poerner N, Roth B. Interpretable question answering on knowledge bases and text. In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Florence, Italy: ACL, 2019. 4943−4951
    [3] Cho K, Merriënboer B V, Gulcehre C, Bahdanau D, Bougares F, Schwenk H, Bengio Y. Learning phrase representations using rnn encoder-decoder for statistical machine translation. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing. Doha, Qatar: Association for Computational Linguistics, 2014. 1724−1734
    [4] Gu J T, Lu Z D, Li H, Li V O K. Incorporating copying mechanism in sequence-to-sequence learning. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics. Berlin, Germany: the Association for Computational Linguistics, 2016. 1631−1640
    [5] Gulcehre C, Ahn S, Nallapati R, Zhou B W, Bengio Y. Pointing the unknown words. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics. Berlin, Germany: the Association for Computational Linguistics, 2016. 140−149
    [6] He S Z, Liu C, Liu K, Zhao J. Generating natural answers by incorporating copying and retrieving mechanisms in sequence-to-sequence learning. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics. Vancouver, Canada: the Association for Computational Linguistics, 2017. 199−208
    [7] 刘康, 张元哲, 纪国良, 来斯惟, 赵军. 基于表示学习的知识库问答研究进展与展望. 自动化学报, 2016, 42(6): 807-818

    Liu Kang, Zhang Yuan-Zhe, Ji Guo-Liang, Lai Si-Wei, Zhao Jun. Representation learning for question answering over knowledge base: an overview. Acta Automatica Sinica, 2016, 42(6): 807-818
    [8] Tang X, Chen L, Cui J, Wei B. Knowledge representation learning with entity descriptions, hierarchical types, and textual relations. Information Processing & Management, 2019, 56(3): 809-822
    [9] Mikolov T, Chen K, Corrado G, Dean J. Efficient estimation of word representations in vector space. arXiv: 1301.3781, 2013.
    [10] Unger C, Freitas A, Cimiano P. An introduction to question answering over linked data. In: Proceedings of Reasoning on the Web in the Big Data Era — the 10th International Summer School. Athens, Greece: IEEE, 2014. 100−140
    [11] Bast H, Haussmann E. More accurate question answering on freebase. In: Proceedings of the 24th International Conference on Information and Knowledge Management. Melbourne, VIC, Australia: ACM, 2015. 1431−1440
    [12] Bordes A, Chopra S, Weston J. Question answering with subgraph embeddings. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing. Doha, Qatar: ACL, 2014. 615−620
    [13] Yih W, Chang M W, He X D, Gao J F. Semantic parsing via staged query graph generation: Question answering with knowledge base. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing. Beijing, China: ACL, 2015. 1321−1331
    [14] Sun Y W, Zhang L L, Cheng G, Qu Y Z. Sparqa: Skeleton-based semantic parsing for complex questions over knowledge bases. In: Proceedings of the 34th AAAI Conference on Artificial Intelligence. New York, USA: arXiv: 2003.13956, 2020.
    [15] Xu K, Wu L F, Wang Z G, Yu M, Chen L W, Sheinin V. Exploiting rich syntactic information for semantic parsing with graph-to-sequence model. In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. Brussels, Belgium: ACL, 2018. 918−924
    [16] Miller A, Fisch A, Dodge J, Karimi A H, Weston J. Key-value memory networks for directly reading documents. In: Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing. Austin, Texas, USA: ACL, 2016. 1400−1409
    [17] Bordes A, Usunier N, Chopra S, Weston J. Large-scale simple question answering with memory networks. arXiv: 1506.02075. 2015.
    [18] Yin J, Jiang X, Lu Z D, Shang L F, Li H, Li X M. Neural generative question answering. In: Proceedings of the 25th International Joint Conference on Artificial Intelligence. New York, USA: IJCAI/AAAI, 2016. 2972−2978
    [19] Liu C, He S Z, Liu K, Zhao J. Curriculum learning for natural answer generation. In: Proceedings of the 27th International Joint Conference on Artificial Intelligence. Stockholm, Sweden: IJCAI/AAAI, 2018. 4223−4229
    [20] Wang T Z, Cai M, Li J X. A neural conversational model using MMI-WMD decoder based on the Seq2Seq with attention mechanism. In: Proceedings of the 2019 Chinese Control and Decision Conference (CCDC). Nanchang, China: IEEE, 2019. 2696−2700
    [21] Sharma A, Contractor D, Kumar H, Joshi S. Neural conversational QA: Learning to reason vs exploiting patterns. arXiv: 1909.03759, 2019.
    [22] Lei W Q, Jin X, Kan M Y, Ren Z C, He X N, Yin D W. Sequicity: Simplifying task-oriented dialogue systems with single sequence-to-sequence architectures. In: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics. Melbourne, Australia: ACL, 2018. 1437−1447
    [23] Rashkin H, Smith E M, Li M, Boureau Y L. Towards empathetic open-domain conversation models: A new benchmark and dataset. In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Florence, Italy: ACL, 2019. 5370−5381
    [24] Lin Y K, Liu Z Y, Sun M S. Modeling relation paths for representation learning of knowledge bases. In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing. Lisbon, Portugal: ACL, 2015. 705−714
    [25] Quan W, Mao Z D, Wang B, Li G. Knowledge graph embedding: A survey of approaches and applications. IEEE Transactions on Knowledge & Data Engineering, 2017, 29(12): 2724-2743
    [26] Bordes A, Weston J, Usunier N. Open question answering with weakly supervised embedding models. In: Proceedings of the 2014 Machine Learning and Knowledge Discovery in Databases European Conference. Nancy, France: Springer, 2014. 165−180
    [27] Bahdanau D, Cho K Y, Bengio Y. Neural machine translation by jointly learning to align and translate. Arxiv: 1409.0473, 2014
    [28] 冯冲, 石戈, 郭宇航, 龚静, 黄河燕. 基于词向量语义分类的微博实体链接方法. 自动化学报, 2016, 42(6): 915-922

    Feng C, Shi G, Guo YH, Gong J, Huang HY. An entity linking method for microblog based on semantic categorization by word embeddings. Acta Autom. Sinica, 2016, 42(6): 915-922
    [29] Yin W P, Yu M, Xiang B, Zhou B, Schutze H. Simple question answering by attentive convolutional neural network. ArXiv: 1606.03391, 2016.
    [30] Yu M, Yin W P, Hasan K S, Santos C D, Xiang B, Zhou B W. Improved neural relation detection for knowledge base question answering. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics. Vancouver, Canada: ACL, 2017. 571−581
    [31] Deng Y, Xie Y X, Li Y L, Yang M, Shen Y. Multi-task learning with multi-view attention for answer selection and knowledge base question answering. In: Proceedings of the 33rd Conference on Artificial Intelligence. Honolulu, Hawaii, USA: AAAI, 2019. 6318−6325
  • 加载中
图(7) / 表(6)
计量
  • 文章访问数:  695
  • HTML全文浏览量:  201
  • PDF下载量:  268
  • 被引次数: 0
出版历程
  • 收稿日期:  2019-11-15
  • 录用日期:  2020-04-10
  • 网络出版日期:  2022-09-16
  • 刊出日期:  2022-10-14

目录

    /

    返回文章
    返回