2.845

2023影响因子

(CJCR)

  • 中文核心
  • EI
  • 中国科技核心
  • Scopus
  • CSCD
  • 英国科学文摘

留言板

尊敬的读者、作者、审稿人, 关于本刊的投稿、审稿、编辑和出版的任何问题, 您可以本页添加留言。我们将尽快给您答复。谢谢您的支持!

姓名
邮箱
手机号码
标题
留言内容
验证码

基于表示学习的知识库问答研究进展与展望

刘康 张元哲 纪国良 来斯惟 赵军

刘康, 张元哲, 纪国良, 来斯惟, 赵军. 基于表示学习的知识库问答研究进展与展望. 自动化学报, 2016, 42(6): 807-818. doi: 10.16383/j.aas.2016.c150674
引用本文: 刘康, 张元哲, 纪国良, 来斯惟, 赵军. 基于表示学习的知识库问答研究进展与展望. 自动化学报, 2016, 42(6): 807-818. doi: 10.16383/j.aas.2016.c150674
LIU Kang, ZHANG Yuan-Zhe, JI Guo-Liang, LAI Si-Wei, ZHAO Jun. Representation Learning for Question Answering over Knowledge Base: An Overview. ACTA AUTOMATICA SINICA, 2016, 42(6): 807-818. doi: 10.16383/j.aas.2016.c150674
Citation: LIU Kang, ZHANG Yuan-Zhe, JI Guo-Liang, LAI Si-Wei, ZHAO Jun. Representation Learning for Question Answering over Knowledge Base: An Overview. ACTA AUTOMATICA SINICA, 2016, 42(6): 807-818. doi: 10.16383/j.aas.2016.c150674

基于表示学习的知识库问答研究进展与展望

doi: 10.16383/j.aas.2016.c150674
基金项目: 

国家自然科学基金 61533018

国家重点基础研究发展计划(973计划) 2014CB340503

详细信息
    作者简介:

    张元哲 中国科学院自动化研究所博士研究生. 主要研究方向为问答系统和自然语言处理. E-mail: yzzhang@nlpr.ia.ac.cn

    纪国良 中国科学院自动化研究所博士研究生. 主要研究方向为知识工程和自然语言处理. E-mail: guoliang.ji@nlpr.ia.ac.cn

    来斯惟 中国科学院自动化研究所博士研究生. 主要研究方向为表示学习和自然语言处理. E-mail: swlai@nlpr.ia.ac.cn

    赵军 中国科学院自动化研究所研究员. 主要研究方向为信息检索, 信息提取, 网络挖掘, 问答系统. E-mail: jzhao@nlpr.ia.ac.cn

    通讯作者:

    刘康 中国科学院自动化研究所副研究员. 主要研究方向为问答系统, 观点挖掘, 自然语言处理. 本文通信作者. E-mail: kliu@nlpr.ia.ac.cn

Representation Learning for Question Answering over Knowledge Base: An Overview

Funds: 

National Natural Science Foundation of China 61533018

National Basic Research Program of China (973 Program) 2014CB340503

More Information
    Author Bio:

    ZHANG Yuan-Zhe Ph. D. candidate at the Institute of Automation, Chinese Academy of Sciences. His research interest covers question answering and natural language processing

    JI Guo-Liang Ph. D. candidate at the Institute of Automation, Chinese Academy of Sciences. His research interest covers knowledge engineering and natural language processing

    LAI Si-Wei Ph. D. candidate at the Institute of Automation, Chinese Academy of Sciences. His research interest covers representation learning and natural language processing

    ZHAO Jun Professor at the Institute of Automation, Chinese Academy of Sciences. His research interest covers information retrieval, information extraction, web mining and question answering

    Corresponding author: LIU Kang Associate professor at the Institute of Automation, Chinese Academy of Sciences. His research interest covers question answering, opinion mining, and natural language processing. Corresponding author of this paper
  • 摘要: 面向知识库的问答(Question answering over knowledge base, KBQA)是问答系统的重要组成. 近些年, 随着以深度学习为代表的表示学习技术在多个领域的成功应用, 许多研究者开始着手研究基于表示学习的知识库问答技术. 其基本假设是把知识库问答看做是一个语义匹配的过程. 通过表示学习知识库以及用户问题的语义表示, 将知识库中的实体、关系以及问句文本转换为一个低维语义空间中的数值向量, 在此基础上, 利用数值计算, 直接匹配与用户问句语义最相似的答案. 从目前的结果看, 基于表示学习的知识库问答系统在性能上已经超过传统知识库问答方法. 本文将对现有基于表示学习的知识库问答的研究进展进行综述, 包括知识库表示学习和问句(文本)表示学习的代表性工作, 同时对于其中存在难点以及仍存在的研究问题进行分析和讨论.
  • 图  1  知识库问答过程

    Fig.  1  The process of KBQA

    图  2  基于表示学习的知识库问答方法与传统方法的性能比较

    Fig.  2  The comparisons between representation learning based KBQA and traditional KBQA

    图  3  基于表示学习的知识库问答示意图

    Fig.  3  Representation learning based KBQA

    图  4  RESCAL 系统原理[15

    Fig.  4  RESCAL system architecture[15

    图  5  TransE、TransH 和TransR[24-26]

    Fig.  5  TransE、TransH 和TransR[24-26]

    图  6  递归神经网络结构图

    Fig.  6  Recursive neural network architecture

    图  7  循环神经网络模型结构图

    Fig.  7  Recurrent neural network architecture

    图  8  卷积神经网络模型结构图[46]

    Fig.  8  Convolutional neural network architecture[46]

    图  9  Subgraph embedding 模型[11]

    Fig.  9  Subgraph embedding model[11]

    图  10  处理问句的CNN 模型[51]

    Fig.  10  CNN model used to process question[51]

    图  11  Multi-column CNN 模型[12

    Fig.  11  Multi-column CNN model[12

  • [1] Etzioni O. Search needs a shake-up. Nature, 2011, 476(7358) : 25-26
    [2] Lehmann J, Isele R, Jakob M, Jentzsch A, Kontokostas D, Mendes P N, Hellmann S, Morsey M, van Kleef P, Auer S, Bizer C. Dbpedia-a large-scale, multilingual knowledge base extracted from Wikipedia. Semantic Web, 2015, 6(2) : 167-195
    [3] Bollacker K, Evans C, Paritosh P, Sturge T, Taylor J. Freebase: a collaboratively created graph database for structuring human knowledge. In: Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data. Vancouver, Canada: ACM, 2008. 1247-1250
    [4] Suchanek F M, Kasneci G, Weikum G. Yago: a core of semantic knowledge. In: Proceedings of the 16th International Conference on World Wide Web. Alberta, Canada: ACM, 2007. 697-706
    [5] Kwiatkowski T, Zettlemoyer L, Goldwater S, Steedman M. Lexical generalization in CCG grammar induction for semantic parsing. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing. Scotland, UK: Association for Computational Linguistics, 2011. 1512-1523
    [6] Liang P, Jordan M I, Klein D. Learning dependency-based compositional semantics. Computational Linguistics, 2013, 39(2) : 389-446
    [7] Socher R, Perelygin A, Wu J Y, Chuang J, Manning C D, Ng A Y, Potts C. Recursive deep models for semantic compositionality over a sentiment treebank. In: Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing. Seattle, USA: Association for Computational Linguistics, 2013. 1631-1642
    [8] Cho K, Van Merriënboer B, Gülçehre C, Bahdanau D, Bougares F, Schwenk H, Bengio Y. Learning phrase representations using RNN encoder-decoder for statistical machine translation. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing. Doha, Qatar: Association for Computational Linguistics, 2014. 1724-1734
    [9] Socher R, Manning C D, Ng A Y. Learning continuous phrase representations and syntactic parsing with recursive neural networks. In: Proceedings of the NIPS-2010 Deep Learning and Unsupervised Feature Learning Workshop. Vancouver, Canada, 2010. 1-9
    [10] Bordes A, Weston J, Usunier N. Open question answering with weakly supervised embedding models. Machine Learning and Knowledge Discovery in Databases. Berlin Heidelberg: Springer, 2014. 165-180
    [11] Bordes A, Chopra S, Weston J. Question answering with subgraph embeddings. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing. Doha, Qatar: Association for Computational Linguistic, 2014. 615-620
    [12] Dong L, Wei F R, Zhou M, Xu K. Question answering over freebase with multi-column convolutional neural networks. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing. Beijing, China: ACL, 2015. 260-269
    [13] Nickel M, Tresp V, Kriegel H P. A three-way model for collective learning on multi-relational data. In: Proceedings of the 28th International Conference on Machine Learning. Washington, USA, 2011. 809-816
    [14] Nickel M, Tresp V, Kriegel H P. Factorizing YAGO: scalable machine learning for linked data. In: Proceedings of the 21st International Conference on World Wide Web. Lyon, France: ACM, 2012. 271-280
    [15] Nickel M, Tresp V. Logistic tensor factorization for multi-relational data. In: Proceedings of the 30th International Conference on Machine Learning. Atlanta, USA, 2013.
    [16] Nickel M, Tresp V. Tensor factorization for multi-relational learning. Machine Learning and Knowledge Discovery in Databases. Berlin Heidelberg: Springer, 2013. 617-621
    [17] Nickel M, Murphy K, Tresp V, Gabrilovich E. A review of relational machine learning for knowledge graphs. Proceedings of the IEEE, 2016, 104(1) : 11-33
    [18] Bordes A, Weston J, Collobert R, Bengio Y. Learning structured embeddings of knowledge bases. In: Proceedings of the 25th AAAI Conference on Artificial Intelligence. San Francisco, USA: AAAI, 2011. 301-306
    [19] Bordes A, Glorot X, Weston J, Bengio Y. Joint learning of words and meaning representations for open-text semantic parsing. In: Proceedings of the 15th International Conference on Artificial Intelligence and Statistics. Cadiz, Spain, 2012. 127-135
    [20] Bordes A, Glorot X, Weston J, Bengio Y. A semantic matching energy function for learning with multi-relational data. Machine Learning, 2014, 94(2) : 233-259
    [21] Jenatton R, Le Roux N, Bordes A, Obozinski G. A latent factor model for highly multi-relational data. In: Proceedings of Advances in Neural Information Processing Systems 25. Lake Tahoe, Nevada, United States: Curran Associates, Inc., 2012. 3167-3175
    [22] Sutskever I, Salakhutdinov R, Joshua B T. Modelling relational data using Bayesian clustered tensor factorization. In: Proceedings of Advances in Neural Information Processing Systems 22. Vancouver, Canada, 2009. 1821-1828
    [23] Socher R, Chen D Q, Manning C D, Ng A Y. Reasoning with neural tensor networks for knowledge base completion. In: Proceedings of Advances in Neural Information Processing Systems 26. Stateline, USA: MIT Press, 2013. 926-934
    [24] Bordes A, Usunier N, Garcia-Durán A, Weston J, Yakhnenko O. Translating embeddings for modeling multi-relational data. In: Proceedings of Advances in Neural Information Processing Systems 26. Stateline, USA: MIT Press, 2013. 2787-2795
    [25] Wang Z, Zhang J W, Feng J L, Chen Z. Knowledge graph embedding by translating on hyperplanes. In: Proceedings of the 28th AAAI Conference on Artificial Intelligence. Québec, Canada: AAAI, 2014. 1112-1119
    [26] Lin Y, Zhang J, Liu Z, Sun M, Liu Y, Zhu X. Learning entity and relation embeddings for knowledge graph completion. In: Proceedings of the 29th AAAI Conference on Artificial Intelligence. Austin, USA: AAAI, 2015. 2181-2187
    [27] Ji G L, He S Z, Xu L H, Liu K, Zhao J. Knowledge graph embedding via dynamic mapping matrix. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics. Beijing, China: Association for Computational Linguistics, 2015. 687-696
    [28] Lin Y K, Liu Z Y, Luan H B, Sun M S, Rao S W, Liu S. Modeling relation paths for representation learning of knowledge bases. In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing. Lisbon, Portugal: Association for Computational Linguistics, 2015. 705-714
    [29] Guo S, Wang Q, Wang B, Wang L H, Guo L. Semantically smooth knowledge graph embedding. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing. Beijing, China: Association for Computational Linguistics, 2015. 84-94
    [30] He S Z, Liu K, Ji G L, Zhao J. Learning to represent knowledge graphs with Gaussian embedding. In: Proceedings of the 24th ACM International on Conference on Information and Knowledge Management. Melbourne, Australia: ACM, 2015. 623-632.
    [31] Frege G. Jber sinn und bedeutung. Wittgenstein Studien, 1892, 100: 25-50
    [32] Hermann K M. Distributed Representations for Compositional Semantics[Ph.D. dissertation], University of Oxford, 2014
    [33] Socher R, Karpathy A, Le Q V, Manning C D, Ng A Y. Grounded compositional semantics for finding and describing images with sentences. Transactions of the Association for Computational Linguistics, 2014, 2: 207-218
    [34] Socher R, Huang E H, Pennin J, Manning C D, Ng A Y. Dynamic pooling and unfolding recursive autoencoders for paraphrase detection. In: Proceedings of Advances in Neural Information Processing Systems 24. Granada, Spain, 2011. 801-809
    [35] Socher R, Pennington J, Huang E H, Ng A Y, Manning C D. Semi-supervised recursive autoencoders for predicting sentiment distributions. In: Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing. Scotland, UK: Association for Computational Linguistics, 2011. 151-161
    [36] Socher R, Lin C C Y, Ng A Y, Manning C D. Parsing natural scenes and natural language with recursive neural networks. In: Proceedings of the 28th International Conference on Machine Learning. Bellevue, WA, USA, 2011. 129-136
    [37] Socher R, Bauer J, Manning C D, Ng A Y. Parsing with compositional vector grammars. In: Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics. Sofia, Bulgaria: Association for Computational Linguistics, 2013. 455-465
    [38] Mitchell J, Lapata M. Composition in distributional models of semantics. Cognitive Science, 2010, 34(8) : 1388-1429
    [39] Socher R, Huval B, Manning C D, Ng A Y. Semantic compositionality through recursive matrix-vector spaces. In: Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning. Jeju Island, Korea: Association for Computational Linguistics, 2012. 1201-1211
    [40] Elman J L. Finding structure in time. Cognitive Science, 1990, 14(2) : 179-211
    [41] Bengio Y, Simard P, Frasconi P. Learning long-term dependencies with gradient descent is difficult. IEEE Transactions on Neural Networks, 1994, 5(2) : 157-166
    [42] Hochreiter S. The vanishing gradient problem during learning recurrent neural nets and problem solutions. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems, 1998, 6(2) : 107-116
    [43] Hochreiter S, Schmidhuber J. Long short-term memory. Neural Computation, 1997, 9(8) : 1735-1780
    [44] Fukushima K. Neocognitron: a self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position. Biological Cybernetics, 1980, 36(4) : 193-202
    [45] LeCun Y, Bottou L, Bengio Y, Haffner P. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 1998, 86(11) : 2278-2324
    [46] Collobert R, Weston J, Bottou L, Karlen M, Kavukcuoglu K, Kuksa P. Natural language processing (almost) from scratch. The Journal of Machine Learning Research, 2011, 12: 2493-2537
    [47] Kalchbrenner N, Grefenstette E, Blunsom P. A convolutional neural network for modelling sentences. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics. Baltimore, USA: Association for Computational Linguistics, 2014. 655-665
    [48] Kim Y. Convolutional neural networks for sentence classification. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing. Doha, Qatar: Association for Computational Linguistics, 2014. 1746-1751
    [49] Zeng D H, Liu K, Lai S W, Zhou G Y, Zhao J. Relation classification via convolutional deep neural network. In: Proceedings of the 25th International Conference on Computational Linguistics. Dublin, Ireland: Association for Computational Linguistics, 2014. 2335-2344
    [50] Fader A, Soderland S, Etzioni O. Identifying relations for open information extraction. In: Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing. Scotland, UK: Association for Computational Linguistics, 2011. 1535-1545
    [51] Yih W T, He X D, Meek C. Semantic parsing for single-relation question answering. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistic. Baltimore, USA: Association for Computational Linguistic, 2014. 643-648
    [52] Fader A, Zettlemoyer L S, Etzioni O. Paraphrase-driven learning for open question answering. In: Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics. Sofia, Bulgaria: Association for Computational Linguistics, 2013. 1608-1618
    [53] Berant J, Liang P. Semantic parsing via paraphrasing. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistic. Baltimore, USA: Association for Computational Linguistic, 2014. 1415-1425
    [54] Yao X C, Van Durme B. Information extraction over structured data: question answering with freebase. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistic. Baltimore, USA: Association for Computational Linguistic, 2014. 956-966
    [55] Zhao H, Lu Z D, Poupart P. Self-adaptive hierarchical sentence model. In: Proceedings of the 24th International Conference on Artificial Intelligence. Buenos Aires, Argentina: AAAI, 2015. 4069-4076
    [56] Luong M T, Pham H, Manning C D. Effective approaches to attention-based neural machine translation. In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing. Lisbon, Portugal: ACL, 2015. 1412-1421
    [57] Fader A, Zettlemoyer L, Etzioni O. Open question answering over curated and extracted knowledge bases. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York, USA: ACM, 2014. 1156-1165
  • 加载中
图(11)
计量
  • 文章访问数:  6841
  • HTML全文浏览量:  1792
  • PDF下载量:  4584
  • 被引次数: 0
出版历程
  • 收稿日期:  2015-11-02
  • 录用日期:  2016-05-03
  • 刊出日期:  2016-06-20

目录

    /

    返回文章
    返回