Suppr超能文献

机器学习方法预测社交媒体灾难谣言辟谣者。

Machine Learning Methods to Predict Social Media Disaster Rumor Refuters.

机构信息

Business School, Sichuan University, Chengdu 610064, China.

College of Movie and Media, Sichuan Normal University, Chengdu 610064, China.

出版信息

Int J Environ Res Public Health. 2019 Apr 24;16(8):1452. doi: 10.3390/ijerph16081452.

Abstract

This research provides a general methodology for distinguishing disaster-related anti-rumor spreaders from a non-ignorant population base, with strong connections in their social circle. Several important influencing factors are examined and illustrated. User information from the most recent posted microblog content of 3793 Sina Weibo users was collected. Natural language processing (NLP) was used for the sentiment and short text similarity analyses, and four machine learning techniques, i.e., logistic regression (LR), support vector machines (SVM), random forest (RF), and extreme gradient boosting (XGBoost) were compared on different rumor refuting microblogs; after which a valid and robust distinguishing XGBoost model was trained and validated to predict who would retweet disaster-related rumor refuting microblogs. Compared with traditional prediction variables that only access user information, the similarity and sentiment analyses of the most recent user microblog contents were found to significantly improve prediction precision and robustness. The number of user microblogs also proved to be a valuable reference for all samples during the prediction process. This prediction methodology could be possibly more useful for WeChat or Facebook as these have relatively stable closed-loop communication channels, which means that rumors are more likely to be refuted by acquaintances. Therefore, the methodology is going to be further optimized and validated on WeChat-like channels in the future. The novel rumor refuting approach presented in this research harnessed NLP for the user microblog content analysis and then used the analysis results of NLP as additional prediction variables to identify the anti-rumor spreaders. Therefore, compared to previous studies, this study presents a new and effective decision support for rumor countermeasures.

摘要

本研究提供了一种从具有强社交联系的非无知人群中区分与灾害相关的反谣言传播者的一般方法。研究考察并说明了几个重要的影响因素。收集了 3793 名新浪微博用户最近发布的微博内容中的用户信息。采用自然语言处理(NLP)进行情感和短文本相似度分析,并比较了逻辑回归(LR)、支持向量机(SVM)、随机森林(RF)和极端梯度提升(XGBoost)等四种机器学习技术在不同的辟谣微博上的表现;然后,针对预测谁会转发与灾害相关的辟谣微博,训练和验证了一个有效的、稳健的 XGBoost 区分模型。与仅访问用户信息的传统预测变量相比,最近用户微博内容的相似度和情感分析被发现显著提高了预测精度和稳健性。用户微博数量也被证明是预测过程中所有样本的一个有价值的参考。由于微信或 Facebook 等社交媒体具有相对稳定的闭环通信渠道,谣言更有可能被熟人辟谣,因此这种预测方法可能对微信或 Facebook 等社交媒体更有用。本研究提出的新颖的辟谣方法利用 NLP 对用户微博内容进行分析,然后将 NLP 的分析结果作为附加预测变量来识别反谣言传播者。因此,与以往的研究相比,本研究为谣言对策提供了一种新的、有效的决策支持。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验