【SO-PMI】Sentiment classification of Movie and Product Reviews Using Contextual Valence Shifters

本文提出一种通过考虑情境情感偏移(如否定和加强词)来确定评论情感方向的方法。使用General Inquirer识别正面、负面词汇及否定、夸张和低调表达,并结合SO-PMI验证了包含情感偏移能显著提升电影评论分类准确性,但对产品评论影响不大。实验证明,否定词比加强词更有效提升准确率。

摘要生成于 C知道 ,由 DeepSeek-R1 满血版支持, 前往体验 >

Sentiment classification of Movie and Product Reviews Using Contextual Valence Shifters [SO-PMI–2005.1]

In this paper, author proposed a method to determine the sentiment orientation by taking into account contextual valence shifters, which are negations and intensifiers. They used General Inquirer to identify positive and negative terms, as well as negations, overstatements and understatements. By using SO-PMI, they finally figure it out that including contextual valence shifters improves the accuracy of the classification.

In order to prove the accuracy improvement is because of valence shifter instead of other factors, they also compare the basic system and improved system. As for basic system, it only contains methods from GI, while improved system also included CTRW and web corpus. From the result, it is clear that the improvement is statistically significant for movie reviews, while as for products reviews is not.

Moreover, they also want to know which one is more effective, negation or intensifiers? By adding different terms from GI&CTRW, it shows that negation plays an important part in improve accuracy and adding extra overstatements and understatements did not make much difference.

They also noticed that in most cases of their method, classifying product reviews performs better than movie reviews. The resons are just like the paper ”semantic orientation applied to unsupervised classification of reviews” : movies has the worse performance, and the explaination is sometimes positive reviews mention unpleasant things and negative reviews often mention pleasant things because of different plots.

文中提到的术语/方法:

  • 贝叶斯信念网络(Bayesian belief network)与朴素贝叶斯的区别
    图片摘自:https://blog.youkuaiyun.com/weixin_38438451/article/details/82849802
    马尔可夫毯:一个结点的马尔可夫毯为该结点的子结点、父结点以及子结点的父结点;在特征选择时常用,比如有时需要获取目标变量的所有信息,用这个方法只要看其相应的马尔可夫毯中的内容即可
  • Tabu Search(禁忌搜索):一个用来跳脱局部最优解的搜索方法。其先创立一个初始化的方案;基于此,算法“移动”到一相邻的方案。经过许多连续的移动过程,提高解的质量。
  • Bootstrapping: 利用有限的样本进行多次重复抽样,常用于样本数量有限的情况

图摘自https://blog.youkuaiyun.com/weixin_38438451/article/details/82849802

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值