Eskisehir Osmangazi University, Electrical and Electronics Engineering Department, Eskisehir 26480, Turkey.
Comput Intell Neurosci. 2022 Sep 28;2022:5186144. doi: 10.1155/2022/5186144. eCollection 2022.
Internet environments such as social networks, news sites, and blogs are the platforms where people can share their ideas and opinions. Many people share their comments instantly on the internet, which results in creating large volumes of entries. It is important for institutions and organizations to analyze this big data in an efficient and rapid manner to produce summary information about the feelings or opinions of individuals. In this study, we propose a scalable framework that makes sentiment classification by evaluating the compound probability scores of the most widely used methods in sentiment analysis through a fuzzy inference mechanism in an ensemble manner. The designed fuzzy inference system makes the sentiment estimation by evaluating the compound scores of valance aware dictionary, word embedding, and count vectorization processes. The difference of the proposed method from the classical ensemble methods is that it allows weighting of base learners and combines the strengths of each algorithm through fuzzy rules. The sentiment estimation process from text data can be managed either as a 2-class (positive and negative) or as a 3-class (positive, neutral, and negative) problem. We performed the experimental work on four available tagged social network data sets for both 2-class and 3-class classifications and observed that the proposed method provides improvements in accuracy.
互联网环境,如社交网络、新闻网站和博客,是人们分享想法和意见的平台。许多人会在互联网上即时分享他们的评论,从而产生大量的条目。对于机构和组织来说,以高效和快速的方式分析这些大数据,以生成有关个人感受或意见的总结信息非常重要。在这项研究中,我们提出了一个可扩展的框架,通过模糊推理机制以集成方式评估情感分析中最广泛使用的方法的复合概率得分,从而进行情感分类。设计的模糊推理系统通过评估情感感知词典、词嵌入和计数向量化过程的复合得分来进行情感估计。与经典集成方法的不同之处在于,它允许对基础学习者进行加权,并通过模糊规则结合每个算法的优势。可以将文本数据的情感估计过程管理为 2 类(正和负)或 3 类(正、中、负)问题。我们在四个可用的标记社交网络数据集上进行了 2 类和 3 类分类的实验工作,观察到所提出的方法提高了准确性。