推特上与情绪相关的时空因素建模：综合分析及改善局部偏差识别的建议

Modeling Spatiotemporal Factors Associated With Sentiment on Twitter: Synthesis and Suggestions for Improving the Identification of Localized Deviations.

作者信息

Shah Zubair, Martin Paige, Coiera Enrico, Mandl Kenneth D, Dunn Adam G

机构信息

Centre for Health Informatics, Australian Institute for Health Innovation, Macquarie University, Sydney, Australia.

Computational Health Informatics Program, Boston Children's Hospital, Boston, MA, United States.

出版信息

J Med Internet Res. 2019 May 8;21(5):e12881. doi: 10.2196/12881.

DOI:10.2196/12881

PMID:31344669

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6682275/

Abstract

BACKGROUND

Studies examining how sentiment on social media varies depending on timing and location appear to produce inconsistent results, making it hard to design systems that use sentiment to detect localized events for public health applications.

OBJECTIVE

The aim of this study was to measure how common timing and location confounders explain variation in sentiment on Twitter.

METHODS

Using a dataset of 16.54 million English-language tweets from 100 cities posted between July 13 and November 30, 2017, we estimated the positive and negative sentiment for each of the cities using a dictionary-based sentiment analysis and constructed models to explain the differences in sentiment using time of day, day of week, weather, city, and interaction type (conversations or broadcasting) as factors and found that all factors were independently associated with sentiment.

RESULTS

In the full multivariable model of positive (Pearson r in test data 0.236; 95% CI 0.231-0.241) and negative (Pearson r in test data 0.306; 95% CI 0.301-0.310) sentiment, the city and time of day explained more of the variance than weather and day of week. Models that account for these confounders produce a different distribution and ranking of important events compared with models that do not account for these confounders.

CONCLUSIONS

In public health applications that aim to detect localized events by aggregating sentiment across populations of Twitter users, it is worthwhile accounting for baseline differences before looking for unexpected changes.

摘要

背景

关于社交媒体上的情绪如何随时间和地点变化的研究似乎产生了不一致的结果，这使得设计利用情绪来检测公共卫生应用中的局部事件的系统变得困难。

目的

本研究的目的是衡量常见的时间和地点混杂因素如何解释推特上的情绪变化。

方法

我们使用了一个包含2017年7月13日至11月30日期间100个城市发布的1654万条英语推文的数据集，使用基于词典的情绪分析方法估计每个城市的积极和消极情绪，并构建模型，以一天中的时间、一周中的日期、天气、城市和互动类型（对话或广播）作为因素来解释情绪差异，发现所有因素都与情绪独立相关。

结果

在积极情绪（测试数据中的皮尔逊r为0.236；95%置信区间为0.231 - 0.241）和消极情绪（测试数据中的皮尔逊r为0.306；95%置信区间为0.301 - 0.310）的完整多变量模型中，城市和一天中的时间比天气和一周中的日期解释了更多的方差。与不考虑这些混杂因素的模型相比，考虑这些混杂因素的模型会产生不同的重要事件分布和排名。

结论

在旨在通过汇总推特用户群体的情绪来检测局部事件的公共卫生应用中，在寻找意外变化之前考虑基线差异是值得的。

相似文献

Modeling Spatiotemporal Factors Associated With Sentiment on Twitter: Synthesis and Suggestions for Improving the Identification of Localized Deviations.推特上与情绪相关的时空因素建模：综合分析及改善局部偏差识别的建议

J Med Internet Res. 2019 May 8;21(5):e12881. doi: 10.2196/12881.

Applying Multiple Data Collection Tools to Quantify Human Papillomavirus Vaccine Communication on Twitter.应用多种数据收集工具量化推特上的人乳头瘤病毒疫苗传播情况

J Med Internet Res. 2016 Dec 5;18(12):e318. doi: 10.2196/jmir.6670.

Using Twitter to Better Understand the Spatiotemporal Patterns of Public Sentiment: A Case Study in Massachusetts, USA.利用 Twitter 更好地了解公众情绪的时空模式：以美国马萨诸塞州为例。

Int J Environ Res Public Health. 2018 Feb 2;15(2):250. doi: 10.3390/ijerph15020250.

Temporal and spatiotemporal investigation of tourist attraction visit sentiment on Twitter.基于 Twitter 的旅游景点访问情绪的时间和时空调查。

PLoS One. 2018 Jun 14;13(6):e0198857. doi: 10.1371/journal.pone.0198857. eCollection 2018.

Emotions and Topics Expressed on Twitter During the COVID-19 Pandemic in the United Kingdom: Comparative Geolocation and Text Mining Analysis.在英国 COVID-19 大流行期间在 Twitter 上表达的情绪和主题：比较地理定位和文本挖掘分析。

J Med Internet Res. 2022 Oct 5;24(10):e40323. doi: 10.2196/40323.

Geographic Differences in Cannabis Conversations on Twitter: Infodemiology Study.推特上关于大麻的讨论存在地域差异：一项信息流行病学研究。

JMIR Public Health Surveill. 2020 Oct 5;6(4):e18540. doi: 10.2196/18540.

Using Twitter to Examine Web-Based Patient Experience Sentiments in the United States: Longitudinal Study.利用推特研究美国基于网络的患者体验情绪：纵向研究。

J Med Internet Res. 2018 Oct 12;20(10):e10043. doi: 10.2196/10043.

Perceptions of Menthol Cigarettes Among Twitter Users: Content and Sentiment Analysis.推特用户对薄荷醇香烟的认知：内容与情感分析

J Med Internet Res. 2017 Feb 27;19(2):e56. doi: 10.2196/jmir.5694.

Topics, Trends, and Sentiments of Tweets About the COVID-19 Pandemic: Temporal Infoveillance Study.关于新冠疫情的推文主题、趋势和情绪：时间信息监测研究

J Med Internet Res. 2020 Oct 23;22(10):e22624. doi: 10.2196/22624.

Using Twitter Comments to Understand People's Experiences of UK Health Care During the COVID-19 Pandemic: Thematic and Sentiment Analysis.利用推特评论了解新冠疫情期间英国人对英国医疗保健的体验：主题和情感分析。

J Med Internet Res. 2021 Oct 25;23(10):e31101. doi: 10.2196/31101.

引用本文的文献

Top-k sentiment analysis over spatio-temporal data.基于时空数据的Top-k情感分析。

PeerJ Comput Sci. 2024 Sep 10;10:e2297. doi: 10.7717/peerj-cs.2297. eCollection 2024.

Extracting factors associated with vaccination from Twitter data and mapping to behavioral models.从 Twitter 数据中提取与疫苗接种相关的因素，并映射到行为模型中。

Hum Vaccin Immunother. 2023 Dec 15;19(3):2281729. doi: 10.1080/21645515.2023.2281729. Epub 2023 Nov 27.

The impact of exogenous shocks on national wellbeing. New Zealanders' reaction to COVID-19.外部冲击对国民福祉的影响。新西兰人对新冠疫情的反应。

Appl Res Qual Life. 2022;17(3):1787-1812. doi: 10.1007/s11482-021-09977-9. Epub 2021 Oct 6.

Spatiotemporal data mining: a survey on challenges and open problems.时空数据挖掘：关于挑战与开放问题的综述

Artif Intell Rev. 2022;55(2):1441-1488. doi: 10.1007/s10462-021-09994-y. Epub 2021 Apr 15.

Top Concerns of Tweeters During the COVID-19 Pandemic: Infoveillance Study.新冠疫情期间推特用户的主要担忧：信息监测研究

J Med Internet Res. 2020 Apr 21;22(4):e19016. doi: 10.2196/19016.

本文引用的文献

Us and them: identifying cyber hate on Twitter across multiple protected characteristics.我们与他们：识别推特上针对多种受保护特征的网络仇恨言论。

EPJ Data Sci. 2016;5(1):11. doi: 10.1140/epjds/s13688-016-0072-6. Epub 2016 Mar 23.

Social media interventions for precision public health: promises and risks.用于精准公共卫生的社交媒体干预措施：前景与风险。

NPJ Digit Med. 2018;1. doi: 10.1038/s41746-018-0054-0. Epub 2018 Sep 19.

Temporal and spatiotemporal investigation of tourist attraction visit sentiment on Twitter.基于 Twitter 的旅游景点访问情绪的时间和时空调查。

PLoS One. 2018 Jun 14;13(6):e0198857. doi: 10.1371/journal.pone.0198857. eCollection 2018.

Enhancing disease surveillance with novel data streams: challenges and opportunities.利用新型数据流加强疾病监测：挑战与机遇

EPJ Data Sci. 2015;4(1). doi: 10.1140/epjds/s13688-015-0054-0. Epub 2015 Oct 16.

Who Tweets with Their Location? Understanding the Relationship between Demographic Characteristics and the Use of Geoservices and Geotagging on Twitter.哪些人会在推特上标注自己的位置？了解人口统计学特征与推特上地理服务和地理标签使用之间的关系。

PLoS One. 2015 Nov 6;10(11):e0142209. doi: 10.1371/journal.pone.0142209. eCollection 2015.

You Are What You Tweet: Connecting the Geographic Variation in America's Obesity Rate to Twitter Content.人如其言：将美国肥胖率的地理差异与推特内容联系起来。

PLoS One. 2015 Sep 2;10(9):e0133505. doi: 10.1371/journal.pone.0133505. eCollection 2015.

A Study of the Demographics of Web-Based Health-Related Social Media Users.基于网络的健康相关社交媒体用户的人口统计学研究。

J Med Internet Res. 2015 Aug 6;17(8):e194. doi: 10.2196/jmir.4308.

Who tweets? Deriving the demographic characteristics of age, occupation and social class from twitter user meta-data.谁会发推文？从推特用户元数据中推导年龄、职业和社会阶层的人口统计学特征。

PLoS One. 2015 Mar 2;10(3):e0115545. doi: 10.1371/journal.pone.0115545. eCollection 2015.

We feel: mapping emotion on Twitter.我们的感受：在 Twitter 上绘制情绪图谱。

IEEE J Biomed Health Inform. 2015 Jul;19(4):1246-52. doi: 10.1109/JBHI.2015.2403839. Epub 2015 Feb 13.

Happiness and the patterns of life: a study of geolocated tweets.幸福与生活模式：一项关于地理位置定位推文的研究。

Sci Rep. 2013;3:2625. doi: 10.1038/srep02625.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

推特上与情绪相关的时空因素建模：综合分析及改善局部偏差识别的建议

Modeling Spatiotemporal Factors Associated With Sentiment on Twitter: Synthesis and Suggestions for Improving the Identification of Localized Deviations.

作者信息

机构信息

出版信息

BACKGROUND

OBJECTIVE

METHODS

RESULTS

CONCLUSIONS

背景

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献