Yu Hongkun, Shang Jingbo, Hsu Meichun, Castellanos Malú, Han Jiawei
Computer Science Department, University of Illinois at Urbana-Champaign, IL, USA.
HPE Vertica, CA, USA.
Proc ACM Int Conf Inf Knowl Manag. 2016 Oct;2016:939-948. doi: 10.1145/2983323.2983793.
Users often write reviews on different themes involving linguistic structures with complex sentiments. The sentiment polarity of a word can be different across themes. Moreover, contextual valence shifters may change sentiment polarity depending on the contexts that they appear in. Both challenges cannot be modeled effectively and explicitly in traditional sentiment analysis. Studying both phenomena requires multi-theme sentiment analysis at the word level, which is very interesting but significantly more challenging than overall polarity classification. To simultaneously resolve the and problems, we propose a data-driven framework to enable both capabilities: (1) polarity predictions of the same word in reviews of different themes, and (2) discovery and quantification of contextual valence shifters. The framework formulates multi-theme sentiment by factorizing the review sentiments with theme/word embeddings and then derives the shifter effect learning problem as a logistic regression. The improvement of sentiment polarity classification accuracy demonstrates not only the importance of and , but also effectiveness of our framework. Human evaluations and case studies further show the success of multi-theme word sentiment predictions and automatic effect quantification of contextual valence shifters.
用户经常围绕涉及复杂情感的语言结构的不同主题撰写评论。一个词的情感极性可能因主题而异。此外,上下文价转移词可能会根据它们出现的上下文改变情感极性。在传统情感分析中,这两个挑战都无法得到有效且明确的建模。研究这两种现象需要在词级进行多主题情感分析,这非常有趣,但比整体极性分类更具挑战性。为了同时解决这两个问题,我们提出了一个数据驱动的框架来实现这两种能力:(1)对不同主题评论中同一个词的极性预测,以及(2)上下文价转移词的发现和量化。该框架通过用主题/词嵌入对评论情感进行分解来制定多主题情感,然后将转移效应学习问题推导为逻辑回归。情感极性分类准确率的提高不仅证明了这两个问题的重要性,也证明了我们框架的有效性。人工评估和案例研究进一步表明了多主题词情感预测和上下文价转移词自动效应量化的成功。