电子学报 ›› 2016, Vol. 44 ›› Issue (10): 2471-2476.DOI: 10.3969/j.issn.0372-2112.2016.10.027

• 学术论文 • 上一篇    下一篇

基于词汇语义和句法依存的情感关键句识别

冯冲, 廖纯, 刘至润, 黄河燕   

  1. 北京理工大学计算机学院, 北京 100081
  • 收稿日期:2015-02-03 修回日期:2015-07-20 出版日期:2016-10-25
    • 通讯作者:
    • 冯冲
    • 作者简介:
    • 廖纯,女,1990年生于河南驻马店,北京理工大学计算机学院硕士研究生,主要研究方向为社交网络,评价对象评价词抽取,情感倾向性分析.E-mail:cliao@bit.edu.cn
    • 基金资助:
    • 国家重点基础研究发展计划 (No.2013CB329605,No.2013CB329303); 国家自然科学基金重点项目 (No.61132009,No.61201351); 国家高技术研究发展计划863项目 (No.2015AA015404)

Sentiment Key Sentence Identification Based on Lexical Semantics and Syntactic Dependency

FENG Chong, LIAO Chun, LIU Zhi-run, HUANG He-yan   

  1. Beijing Institute of Technology, Beijing 100081, China
  • Received:2015-02-03 Revised:2015-07-20 Online:2016-10-25 Published:2016-10-25

摘要:

门户网站、博客和论坛中的新闻性文章往往都带有自己的情感倾向性,而情感关键句的识别对判断文章的情感倾向、了解社会动态和舆情状况有着非常重要的作用.传统方法主要基于词汇特征,未能充分利用潜在的句法和语义信息.本文提出了一种基于词汇语义和句法依存的情感关键句识别方法.该方法首先通过构建情感词典和关键词词典获取词汇语义信息,然后利用一种新颖的面向情感关键句提取算法获取句法依存信息,最后把情感关键句的识别问题看成一个是否为情感关键句的二分类问题加以解决.在COAE2014公开评测数据集上进行的实验表明本文方法的准确率和召回率均显著优于其他方法.

关键词: 情感关键句, 词汇语义, 句法依存, 支持向量机

Abstract:

A lot of news articles in the portal,blog and forums always have their own emotional orientations and sentiment key sentence identification plays an important role in distinguishing emotional orientation of one article,supervising social trends and public sentiment state.The traditional lexicon-based methods totally depended on lexical semantics and did not excavate the implied syntactic structure.So a hybrid method of sentiment key sentence identification based on lexical semantics and syntactic dependency is proposed in this paper.This approach first gets lexical semantics knowledge from emotion lexicon expansion and keywords lexicon construction,and then this paper proposes a novel dependency templates extraction algorithm for syntactic dependency information to build a dependency knowledge base,finally we regard sentiment key sentence identification as a classification task and perform identification through different groups of features.Experimental results on COAE2014 dataset show that this approach notably outperforms other baselines of sentiment key sentence identification on precision and recall.

Key words: sentiment key sentence identification, lexical semantics, syntactic dependency, support vector machine

中图分类号: