• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于信息论方法的中文文本情感分析主题发现与热点分析

Topic Discovery and Hotspot Analysis of Sentiment Analysis of Chinese Text Using Information-Theoretic Method.

作者信息

Zhang Changlu, Fan Haojie, Zhang Jian, Yang Qiong, Tang Liqian

机构信息

School of Economics & Management, Beijing Information Science & Technology University, Beijing 100192, China.

Beijing Key Lab of Green Development Decision Based on Big Data, Beijing 100192, China.

出版信息

Entropy (Basel). 2023 Jun 13;25(6):935. doi: 10.3390/e25060935.

DOI:10.3390/e25060935
PMID:37372279
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10296934/
Abstract

Currently, sentiment analysis is a research hotspot in many fields such as computer science and statistical science. Topic discovery of the literature in the field of text sentiment analysis aims to provide scholars with a quick and effective understanding of its research trends. In this paper, we propose a new model for the topic discovery analysis of literature. Firstly, the FastText model is applied to calculate the word vector of literature keywords, based on which cosine similarity is applied to calculate keyword similarity, to carry out the merging of synonymous keywords. Secondly, the hierarchical clustering method based on the Jaccard coefficient is used to cluster the domain literature and count the literature volume of each topic. Thirdly, the information gain method is applied to extract the high information gain characteristic words of various topics, based on which the connotation of each topic is condensed. Finally, by conducting a time series analysis of the literature, a four-quadrant matrix of topic distribution is constructed to compare the research trends of each topic within different stages. The 1186 articles in the field of text sentiment analysis from 2012 to 2022 can be divided into 12 categories. By comparing and analyzing the topic distribution matrices of the two phases of 2012 to 2016 and 2017 to 2022, it is found that the various categories of topics have obvious research development changes in different phases. The results show that: ① Among the 12 categories, online opinion analysis of social media comments represented by microblogs is one of the current hot topics. ② The integration and application of methods such as sentiment lexicon, traditional machine learning and deep learning should be enhanced. ③ Semantic disambiguation of aspect-level sentiment analysis is one of the current difficult problems this field faces. ④ Research on multimodal sentiment analysis and cross-modal sentiment analysis should be promoted.

摘要

目前,情感分析是计算机科学和统计学等许多领域的研究热点。文本情感分析领域文献的主题发现旨在为学者提供对其研究趋势的快速有效理解。在本文中,我们提出了一种用于文献主题发现分析的新模型。首先,应用FastText模型计算文献关键词的词向量,在此基础上应用余弦相似度计算关键词相似度,以进行同义词关键词的合并。其次,使用基于杰卡德系数的层次聚类方法对领域文献进行聚类,并统计每个主题的文献量。第三,应用信息增益方法提取各主题的高信息增益特征词,在此基础上凝练各主题的内涵。最后,通过对文献进行时间序列分析,构建主题分布的四象限矩阵,以比较不同阶段各主题的研究趋势。2012年至2022年文本情感分析领域的1186篇文章可分为12类。通过比较和分析2012年至2016年和2017年至2022年两个阶段的主题分布矩阵,发现各主题类别在不同阶段有明显的研究发展变化。结果表明:①在这12类中,以微博为代表的社交媒体评论的在线意见分析是当前的热点话题之一。②应加强情感词典、传统机器学习和深度学习等方法的融合与应用。③方面级情感分析的语义消歧是该领域当前面临的难题之一。④应推动多模态情感分析和跨模态情感分析的研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b428/10296934/caaa861a9772/entropy-25-00935-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b428/10296934/09364adde602/entropy-25-00935-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b428/10296934/3c8463da3662/entropy-25-00935-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b428/10296934/5fa3a557d64c/entropy-25-00935-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b428/10296934/caaa861a9772/entropy-25-00935-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b428/10296934/09364adde602/entropy-25-00935-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b428/10296934/3c8463da3662/entropy-25-00935-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b428/10296934/5fa3a557d64c/entropy-25-00935-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b428/10296934/caaa861a9772/entropy-25-00935-g004.jpg

相似文献

1
Topic Discovery and Hotspot Analysis of Sentiment Analysis of Chinese Text Using Information-Theoretic Method.基于信息论方法的中文文本情感分析主题发现与热点分析
Entropy (Basel). 2023 Jun 13;25(6):935. doi: 10.3390/e25060935.
2
Detecting Topic and Sentiment Trends in Physician Rating Websites: Analysis of Online Reviews Using 3-Wave Datasets.检测医生评级网站的主题和情感趋势:使用三波数据集的在线评论分析
Int J Environ Res Public Health. 2021 Apr 29;18(9):4743. doi: 10.3390/ijerph18094743.
3
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
4
Construction of an Emotional Lexicon of Patients With Breast Cancer: Development and Sentiment Analysis.构建乳腺癌患者情绪词汇库:编制与情感分析
J Med Internet Res. 2023 Sep 12;25:e44897. doi: 10.2196/44897.
5
Brand Potential User Identification Algorithm Based on Sentiment Analysis.基于情感分析的品牌潜在用户识别算法
Front Psychol. 2022 May 30;13:906928. doi: 10.3389/fpsyg.2022.906928. eCollection 2022.
6
Public sense of dental implants on social media: A cross-sectional study based on text analysis of comments.社交媒体上公众对种植牙的认知:基于评论文本分析的横断面研究。
J Dent. 2023 Oct;137:104671. doi: 10.1016/j.jdent.2023.104671. Epub 2023 Aug 20.
7
Sentiment analysis and prediction model based on Chinese government affairs microblogs.基于中国政务微博的情感分析与预测模型
Heliyon. 2023 Aug 12;9(8):e19091. doi: 10.1016/j.heliyon.2023.e19091. eCollection 2023 Aug.
8
The impact factors of social media users' forwarding behavior of COVID-19 vaccine topic: Based on empirical analysis of Chinese Weibo users.社交媒体用户转发新冠疫苗话题的影响因素:基于中国微博用户的实证分析。
Front Public Health. 2022 Sep 14;10:871722. doi: 10.3389/fpubh.2022.871722. eCollection 2022.
9
Opinion mining for national security: techniques, domain applications, challenges and research opportunities.国家安全的观点挖掘:技术、领域应用、挑战与研究机遇
J Big Data. 2021;8(1):150. doi: 10.1186/s40537-021-00536-5. Epub 2021 Dec 4.
10
Hierarchical Fusion Network with Enhanced Knowledge and Contrastive Learning for Multimodal Aspect-Based Sentiment Analysis on Social Media.基于增强知识和对比学习的层次融合网络用于社交媒体上基于多模态方面的情感分析
Sensors (Basel). 2023 Aug 22;23(17):7330. doi: 10.3390/s23177330.

引用本文的文献

1
How perceived sustainability influences consumers' clothing preferences.消费者感知的可持续性如何影响其服装偏好。
Sci Rep. 2024 Nov 19;14(1):28672. doi: 10.1038/s41598-024-80279-4.

本文引用的文献

1
Topic modeling and sentiment analysis of Chinese people's attitudes toward volunteerism amid the COVID-19 pandemic.新冠疫情期间中国人对志愿服务态度的主题建模与情感分析
Front Psychol. 2022 Nov 4;13:1064372. doi: 10.3389/fpsyg.2022.1064372. eCollection 2022.