Identifying Word Sentiment Orientation for Free Comments via Complex Network
-
摘要: 词汇情感倾向性(Word sentiment orientation, WSO)的鉴定通常是对文本进行粗粒度意见挖掘的基础.自由评论中存在许多语法噪声, 这使得以往基于规范文本提出的WSO鉴定方法不再适合自由评论. 自由评论中的情感词汇往往是上下文敏感的, 这使得非当前鉴定的情感词汇难以适用于当前自由评论的粗粒度意见挖掘. 针对上述问题,提出一种新的利用复杂网络为自由评论鉴定WSO的方法. 该方法主要有两个部分: 1)为了利用自由评论中词汇之间的上下文信息建模一个能够有效解决上下文敏感问题且具有良好抗噪声能力的情感倾向性关系网络(Sentiment orientation relationship network, SORN),提出了两个算法:金字塔抗噪声信息模型算法和利用抗噪声信息优化调整SORN的算法; 2)为了有效利用SORN为自由评论鉴定WSO,提出了基于SORN的WSO鉴定算法. 实验表明:对于在线为自由评论鉴定WSO,本文方法不仅在精确度方面远高于Hatzivassiloglou提出的方法,且具有良好的时间效率.Abstract: Identifying word sentiment orientation (WSO) is usually the foundation of mining coarse-grained emotion information. In free comments, there exist many grammatical errors which disable previous grammatical text-based methods in identifying WSO for free comments, and there exist some context-sensitive words which disable offline opinion words in mining coarse-grained emotion information. In view of the above questions, a new method which identifies WSO for free comments via complex network is proposed. This method consists of two parts. The first part makes use of context information in free comments to build a sentiment orientation relationship network (SORN) for effectively solving the context sensitive and noise problems. For this purpose, two algorithms are brought forward. One is the algorithm for building the pyramid anti-noise information model and the other is the algorithm for optimizing the sentiment orientation relationship network by anti-noise information. The second part identifies WSO for free comments via SORN. For this purpose, the SORN-based WSO algorithm is put forward. Experimental results show that our method far exceeds HM in identifying WSO for free comments and has good timeliness.
-
Key words:
- Opinion mining /
- free comments /
- word sentiment orientation (WSO) /
- complex network
-
[1] Pang B, Lee L. Opinion mining and sentiment analysis. Foundations and Trends in Information Retrieval, 2008, 2(1-2): 1-135[2] Xia Yun-Qing, Wong Kam-Fai. Methods and practice in Chinese network informal language processing. In: Proceedings of the 8th National Joint Conference on Computational Linguistics. Nanjing, China: Tsinghua University Press, 2005. 566-572(夏云庆, 黄锦辉. 中文网络非正规语言处理的方法与实践. 全国第八届计算语言学联合学术会议. 南京, 中国: 清华大学出版社, 2005. 566-572)[3] Zhao Yan-Yan, Qin Bing, Liu Ting. Sentiment analysis. Journal of Software, 2010, 21(8): 1834-1848(赵妍妍, 秦兵, 刘挺. 文本情感分析. 软件学报, 2010, 21(8): 1834-1848)[4] Zhao Yan-Yan, Qin Bing, Liu Ting. Integrating intra- and inter-document evidences for improving sentence sentiment classification. Acta Automatica Sinica, 2010, 36(10): 1417-1425(赵妍妍, 秦兵, 刘挺. 文基于图的篇章内外特征相融合的评价句极性识别. 自动化学报, 2010, 36(10): 1417-1425)[5] Khan A, Baharudin B, Khan K. Sentiment classification using sentence-level lexical based semantic orientation of online reviews. Trends in Applied Sciences Research, 2011, 6(10): 1141-1157[6] Yang Feng, Peng Qin-Ke, Xu Tao. Sentiment classification for online comments based on random network theory. Acta Automatica Sinica, 2010, 36(6): 837-844(杨锋, 彭勤科, 徐涛. 基于随机网络的在线评论情绪倾向性分类. 自动化学报, 2010, 36(6): 837-844)[7] Gao Yang, Zhou Li, Zhang Yong, Xing Chun-Xiao, Sun Yi-Gang, Zhu Xian-Zhong. Sentiment classification for stock news. Journal of Software, 2010, 21(Supplement): 349-362(高旸, 周莉, 张勇, 邢春晓, 孙一钢, 朱先忠. 面向股票新闻的情感分类方法. 软件学报, 2010, 21(zk): 349-362)[8] Wang Su-Ge, Li De-Yu, Wei Ying-Jie. A method of text sentiment classification based on weighted rough membership. Journal of Computer Research and Development, 2011, 48(5): 855-861(王素格, 李德玉, 魏英杰. 基于赋权粗糙隶属度的文本情感分类方法. 计算机研究与发展, 2011, 48(5): 855-861)[9] Zhao Yan-Yan, Qin Bing, Che Wan-Xiang, Liu Ting. Appraisal expression recognition based on syntactic path. Journal of Software, 2011, 22(5): 887-898(赵妍妍, 秦兵, 车万翔, 刘挺. 基于句法路径的情感评价单元识别 软件学报, 2011, 22(5): 887-898)[10] Hatzivassiloglou V, McKeown K R. Predicting the semantic orientation of adjectives. In: Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and the 8th Conference of the European Chapter of the Association for Computational Linguistics. Madrid, Spain: ACL, 1997. 174-181[11] Turney P, Littman M. Measuring praise and criticism: inference of semantic orientation from association. ACM Transactions on Information Systems, 2003, 21(4): 315-346[12] Du Wei-Fu, Tan Song-Bo, Yun Xiao-Chun, Cheng Xue-Qi. A new method to compute semantic orientation. Journal of Computer Research and Development, 2009, 46(10): 1713-1720(杜伟夫, 谭松波, 云晓春, 程学旗. 一种新的情感词汇语义倾向计算方法. 计算机研究与发展, 2009, 46(10): 1713-1720)[13] Kamps J, Marx M, Mokken R J, Rijke M. Using WordNet to measure semantic orientation of adjectives. In: Proceedings of the 4th International Conference on Language Resources and Evaluation. Lisbon, Portugal: European Language Resources Association, 2004. 1115-1118[14] Esuli A, Sebastiani F. Determining the semantic orientation of terms through gloss classification. In: Proceedings of the 14th ACM International Conference on Information and Knowledge Management. Bremen, Germany: ACM, 2005. 617-624[15] Esuli A, Sebastiani F. Determining term subjectivity and term orientation for opinion mining. In: Proceedings of the 11th Conference of the European Chapter of the Association for Computational Linguistics. Trento, Italy: ACL, 2006. 193-200[16] Esuli A, Sebastiani F. Sentiwordnet: a publicly available lexical resource for opinion mining. In: Proceedings of the 5th Conference on Language Resources and Evaluation. Genoa, Italy: European Language Resources Association, 2006. 417-422[17] Kim S M, Hovy E. Determining the sentiment of opinions. In: Proceedings of the 20th International Conference on Computational Linguistics. Stroudsburg, USA: ACL, 2004. 1367-1373[18] Rao D, Ravichandran D. Semi-supervised polarity lexicon induction. In: Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics. Athens, Greece: ACL, 2009. 675-682[19] Zhu Yan-Lan, Min Jin, Zhou Ya-Qian, Huang Xuan-Jing, Wu Li-De. Semantic orientation computing based on HowNet. Journal of Chinese Information Processing, 2006, 20(1): 14-20(朱嫣岚, 闵锦, 周雅倩, 黄萱菁, 吴立德. 基于HowNet的词汇语义倾向计算. 中文信息学报, 2006, 20(1): 14-20)[20] Cancho R F, Sole R V. The small world of human language. Proceedings of the Royal Society of London, Series B: Biological Sciences, 2001, 268(1482): 2261-2265[21] Sole R V, Murtra B C, Valverde S, Steels L. Language networks: their structure, function and evolution. Complexity, 2010, 15(6): 20-26[22] Albert R, Barabasi A L. Statistical mechanics of complex networks. Reviews of Modern Physics, 2002, 74(1): 47-97[23] Cormen T H, Leiserson C E, Rivest R L, Stein C. Introduction to Algorithms (Second Edition). Cambridge: The MIT Press, 2001. 595-601[24] Hatzivassiloglou V, Wiebe J M. Effects of adjective orientation and gradability on sentence subjectivity. In: Proceedings of the 18th International Conference on Computational Linguistics. Saarbrucken, Germany: ACL, 2000. 299-305
点击查看大图
计量
- 文章访问数: 2199
- HTML全文浏览量: 61
- PDF下载量: 987
- 被引次数: 0