• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

加权联合情感-主题模型在情感分析中的应用比较 与 ALGA 相比:基于遗传算法的自适应词汇学习。

Weighted Joint Sentiment-Topic Model for Sentiment Analysis Compared to ALGA: Adaptive Lexicon Learning Using Genetic Algorithm.

机构信息

Department of Computer Engineering, Qazvin Branch, Islamic Azad University, Qazvin, Iran.

Department of Computer Engineering, Urmia Branch, Islamic Azad University, Urmia, Iran.

出版信息

Comput Intell Neurosci. 2022 Jul 31;2022:7612276. doi: 10.1155/2022/7612276. eCollection 2022.

DOI:10.1155/2022/7612276
PMID:35965748
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9374039/
Abstract

Latent Dirichlet Allocation (LDA) is an approach to unsupervised learning that aims to investigate the semantics among words in a document as well as the influence of a subject on a word. As an LDA-based model, Joint Sentiment-Topic (JST) examines the impact of topics and emotions on words. The emotion parameter is insufficient, and additional parameters may play valuable roles in achieving better performance. In this study, two new topic models, Weighted Joint Sentiment-Topic (WJST) and Weighted Joint Sentiment-Topic 1 (WJST1), have been presented to extend and improve JST through two new parameters that can generate a sentiment dictionary. In the proposed methods, each word in a document affects its neighbors, and different words in the document may be affected simultaneously by several neighbor words. Therefore, proposed models consider the effect of words on each other, which, from our view, is an important factor and can increase the performance of baseline methods. Regarding evaluation results, the new parameters have an immense effect on model accuracy. While not requiring labeled data, the proposed methods are more accurate than discriminative models such as SVM and logistic regression in accordance with evaluation results. The proposed methods are simple with a low number of parameters. While providing a broad perception of connections between different words in documents of a single collection (single-domain) or multiple collections (multidomain), the proposed methods have prepared solutions for two different situations (single-domain and multidomain). WJST is suitable for multidomain datasets, and WJST1 is a version of WJST which is suitable for single-domain datasets. While being able to detect emotion at the level of the document, the proposed models improve the evaluation outcomes of the baseline approaches. Thirteen datasets with different sizes have been used in implementations. In this study, perplexity, opinion mining at the level of the document, and topic_coherency are employed for assessment. Also, a statistical test called Friedman test is used to check whether the results of the proposed models are statistically different from the results of other algorithms. As can be seen from results, the accuracy of proposed methods is above 80% for most of the datasets. WJST1 achieves the highest accuracy on Movie dataset with 97 percent, and WJST achieves the highest accuracy on Electronic dataset with 86 percent. The proposed models obtain better results compared to Adaptive Lexicon learning using Genetic Algorithm (ALGA), which employs an evolutionary approach to make an emotion dictionary. Results show that the proposed methods perform better with different topic number settings, especially for WJST1 with 97% accuracy at || = 5 on the Movie dataset.

摘要

潜在狄利克雷分配 (LDA) 是一种无监督学习方法,旨在研究文档中单词之间的语义以及主题对单词的影响。作为基于 LDA 的模型,联合情感主题 (JST) 检查主题和情感对单词的影响。情感参数不足,并且其他参数可能在实现更好的性能方面发挥有价值的作用。在这项研究中,提出了两种新的主题模型,加权联合情感主题 (WJST) 和加权联合情感主题 1 (WJST1),通过两个新的参数来扩展和改进 JST,这些参数可以生成情感词典。在所提出的方法中,文档中的每个单词都会影响其邻居,并且文档中的不同单词可能会同时受到几个邻居单词的影响。因此,所提出的模型考虑了单词之间的相互影响,这从我们的角度来看是一个重要因素,可以提高基线方法的性能。关于评估结果,新参数对模型准确性有巨大影响。虽然不需要标记数据,但与 SVM 和逻辑回归等判别模型相比,所提出的方法根据评估结果更准确。所提出的方法简单,参数数量少。在为单域和多域两种情况(单域和多域)提供解决方案的同时,所提出的方法提供了对单个集合(单域)或多个集合(多域)中不同单词之间连接的广泛认识。WJST 适用于多域数据集,而 WJST1 是适用于单域数据集的 WJST 版本。虽然能够在文档级别检测情感,但所提出的模型提高了基线方法的评估结果。在实现中使用了 13 个具有不同大小的数据集。在这项研究中,使用困惑度、文档级别的意见挖掘和主题一致性进行评估。此外,还使用称为 Friedman 检验的统计检验来检查所提出的模型的结果是否与其他算法的结果在统计上有显著差异。从结果可以看出,在所提出的方法中,大多数数据集的准确率都在 80%以上。WJST1 在 Movie 数据集上的准确率最高,为 97%,WJST 在 Electronic 数据集上的准确率最高,为 86%。与使用遗传算法 (ALGA) 进行自适应词汇学习的方法相比,所提出的方法具有更好的效果,该方法采用进化方法制作情感词典。结果表明,在所提出的方法中,不同主题数量的设置效果更好,特别是对于 WJST1,在 Movie 数据集上的||=5 时准确率为 97%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6f4/9374039/09534cd87800/CIN2022-7612276.017.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6f4/9374039/a6bc46021f83/CIN2022-7612276.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6f4/9374039/a48641b0358c/CIN2022-7612276.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6f4/9374039/af77d883fc04/CIN2022-7612276.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6f4/9374039/ce19e3626575/CIN2022-7612276.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6f4/9374039/fff3d34c1591/CIN2022-7612276.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6f4/9374039/13776bccc0f7/CIN2022-7612276.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6f4/9374039/fc593213299e/CIN2022-7612276.007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6f4/9374039/0050729618a3/CIN2022-7612276.008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6f4/9374039/284962871c5a/CIN2022-7612276.009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6f4/9374039/4a3bd784a06a/CIN2022-7612276.010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6f4/9374039/b6723c770f2c/CIN2022-7612276.011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6f4/9374039/30740510e4de/CIN2022-7612276.012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6f4/9374039/fb3e78d48afd/CIN2022-7612276.013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6f4/9374039/ebec722c8be2/CIN2022-7612276.014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6f4/9374039/1ecbce0f4e16/CIN2022-7612276.015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6f4/9374039/6261b406514d/CIN2022-7612276.016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6f4/9374039/09534cd87800/CIN2022-7612276.017.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6f4/9374039/a6bc46021f83/CIN2022-7612276.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6f4/9374039/a48641b0358c/CIN2022-7612276.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6f4/9374039/af77d883fc04/CIN2022-7612276.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6f4/9374039/ce19e3626575/CIN2022-7612276.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6f4/9374039/fff3d34c1591/CIN2022-7612276.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6f4/9374039/13776bccc0f7/CIN2022-7612276.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6f4/9374039/fc593213299e/CIN2022-7612276.007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6f4/9374039/0050729618a3/CIN2022-7612276.008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6f4/9374039/284962871c5a/CIN2022-7612276.009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6f4/9374039/4a3bd784a06a/CIN2022-7612276.010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6f4/9374039/b6723c770f2c/CIN2022-7612276.011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6f4/9374039/30740510e4de/CIN2022-7612276.012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6f4/9374039/fb3e78d48afd/CIN2022-7612276.013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6f4/9374039/ebec722c8be2/CIN2022-7612276.014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6f4/9374039/1ecbce0f4e16/CIN2022-7612276.015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6f4/9374039/6261b406514d/CIN2022-7612276.016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6f4/9374039/09534cd87800/CIN2022-7612276.017.jpg

相似文献

1
Weighted Joint Sentiment-Topic Model for Sentiment Analysis Compared to ALGA: Adaptive Lexicon Learning Using Genetic Algorithm.加权联合情感-主题模型在情感分析中的应用比较 与 ALGA 相比:基于遗传算法的自适应词汇学习。
Comput Intell Neurosci. 2022 Jul 31;2022:7612276. doi: 10.1155/2022/7612276. eCollection 2022.
2
Neural Networks with Emotion Associations, Topic Modeling and Supervised Term Weighting for Sentiment Analysis.用于情感分析的具有情感关联、主题建模和监督词加权的神经网络
Int J Neural Syst. 2021 Oct;31(10):2150013. doi: 10.1142/S0129065721500131. Epub 2021 Feb 10.
3
Malay sentiment analysis based on combined classification approaches and Senti-lexicon algorithm.基于组合分类方法和 Senti-lexicon 算法的马来语情感分析。
PLoS One. 2018 Apr 23;13(4):e0194852. doi: 10.1371/journal.pone.0194852. eCollection 2018.
4
Two-Level LSTM for Sentiment Analysis With Lexicon Embedding and Polar Flipping.基于词典嵌入和极性翻转的两层长短时记忆网络情感分析
IEEE Trans Cybern. 2022 May;52(5):3867-3879. doi: 10.1109/TCYB.2020.3017378. Epub 2022 May 19.
5
Attention-Emotion-Enhanced Convolutional LSTM for Sentiment Analysis.用于情感分析的注意力-情感增强卷积长短期记忆网络
IEEE Trans Neural Netw Learn Syst. 2022 Sep;33(9):4332-4345. doi: 10.1109/TNNLS.2021.3056664. Epub 2022 Aug 31.
6
Comparison of Machine Learning and Sentiment Analysis in Detection of Suspicious Online Reviewers on Different Type of Data.机器学习和情感分析在不同类型数据中检测可疑在线评论者的比较。
Sensors (Basel). 2021 Dec 27;22(1):155. doi: 10.3390/s22010155.
7
Construction of an Emotional Lexicon of Patients With Breast Cancer: Development and Sentiment Analysis.构建乳腺癌患者情绪词汇库:编制与情感分析
J Med Internet Res. 2023 Sep 12;25:e44897. doi: 10.2196/44897.
8
Sentiment Analysis and Emotion Recognition from Speech Using Universal Speech Representations.基于通用语音表示的语音情感分析和情绪识别。
Sensors (Basel). 2022 Aug 24;22(17):6369. doi: 10.3390/s22176369.
9
Sentiment analysis of clinical narratives: A scoping review.临床叙事的情感分析:一项范围综述。
J Biomed Inform. 2023 Apr;140:104336. doi: 10.1016/j.jbi.2023.104336. Epub 2023 Mar 22.
10
EMFSA: Emoji-based multifeature fusion sentiment analysis.EMFSA:基于表情符号的多特征融合情感分析。
PLoS One. 2024 Sep 19;19(9):e0310715. doi: 10.1371/journal.pone.0310715. eCollection 2024.

引用本文的文献

1
Mining LDA topics on construction engineering change risks based on graded evidence.基于分级证据挖掘建筑工程变更风险的 LDA 主题。
PLoS One. 2024 Jun 20;19(6):e0303424. doi: 10.1371/journal.pone.0303424. eCollection 2024.
2
Sentiment analysis of tweets on alopecia areata, hidradenitis suppurativa, and psoriasis: Revealing the patient experience.斑秃、化脓性汗腺炎和银屑病推文的情感分析:揭示患者体验。
Front Med (Lausanne). 2022 Oct 31;9:996378. doi: 10.3389/fmed.2022.996378. eCollection 2022.

本文引用的文献

1
Topic2features: a novel framework to classify noisy and sparse textual data using LDA topic distributions.主题2特征:一种使用LDA主题分布对噪声和稀疏文本数据进行分类的新颖框架。
PeerJ Comput Sci. 2021 Aug 11;7:e677. doi: 10.7717/peerj-cs.677. eCollection 2021.
2
Design of adaptive ensemble classifier for online sentiment analysis and opinion mining.用于在线情感分析和观点挖掘的自适应集成分类器设计
PeerJ Comput Sci. 2021 Aug 5;7:e660. doi: 10.7717/peerj-cs.660. eCollection 2021.
3
TopicBERT: A Topic-Enhanced Neural Language Model Fine-Tuned for Sentiment Classification.
TopicBERT:一种基于主题增强的神经语言模型,经过情感分类微调。
IEEE Trans Neural Netw Learn Syst. 2023 Jan;34(1):380-393. doi: 10.1109/TNNLS.2021.3094987. Epub 2023 Jan 5.
4
Attention-Emotion-Enhanced Convolutional LSTM for Sentiment Analysis.用于情感分析的注意力-情感增强卷积长短期记忆网络
IEEE Trans Neural Netw Learn Syst. 2022 Sep;33(9):4332-4345. doi: 10.1109/TNNLS.2021.3056664. Epub 2022 Aug 31.
5
HCET: Hierarchical Clinical Embedding With Topic Modeling on Electronic Health Records for Predicting Future Depression.基于电子健康记录的分层临床嵌入与主题建模预测未来抑郁
IEEE J Biomed Health Inform. 2021 Apr;25(4):1265-1272. doi: 10.1109/JBHI.2020.3004072. Epub 2021 Apr 6.
6
Statistical inferences for polarity identification in natural language.自然语言中极性识别的统计推断。
PLoS One. 2018 Dec 21;13(12):e0209323. doi: 10.1371/journal.pone.0209323. eCollection 2018.
7
Finding scientific topics.寻找科学主题。
Proc Natl Acad Sci U S A. 2004 Apr 6;101 Suppl 1(Suppl 1):5228-35. doi: 10.1073/pnas.0307752101. Epub 2004 Feb 10.