Kaur Mahakprit, Cargill Taylor, Hui Kevin, Vu Minh, Bragazzi Nicola Luigi, Kong Jude Dzevela
Department of Biology, Faculty of Science, York University, Toronto, ON, Canada.
Africa-Canada Artificial Intelligence and Data Innovation Consortium, Toronto, ON, Canada.
JMIR Form Res. 2024 Jan 29;8:e46087. doi: 10.2196/46087.
The COVID-19 pandemic has highlighted gaps in the current handling of medical resource demand surges and the need for prioritizing scarce medical resources to mitigate the risk of health care facilities becoming overwhelmed.
During a health care emergency, such as the COVID-19 pandemic, the public often uses social media to express negative sentiment (eg, urgency, fear, and frustration) as a real-time response to the evolving crisis. The sentiment expressed in COVID-19 posts may provide valuable real-time information about the relative severity of medical resource demand in different regions of a country. In this study, Twitter (subsequently rebranded as X) sentiment analysis was used to investigate whether an increase in negative sentiment COVID-19 tweets corresponded to a greater demand for hospital intensive care unit (ICU) beds in specific regions of the United States, Brazil, and India.
Tweets were collected from a publicly available data set containing COVID-19 tweets with sentiment labels and geolocation information posted between February 1, 2020, and March 31, 2021. Regional medical resource shortage data were gathered from publicly available data sets reporting a time series of ICU bed demand across each country. Negative sentiment tweets were analyzed using the Granger causality test and convergent cross-mapping (CCM) analysis to assess the utility of the time series of negative sentiment tweets in forecasting ICU bed shortages.
For the United States (30,742,934 negative sentiment tweets), the results of the Granger causality test (for whether negative sentiment COVID-19 tweets forecast ICU bed shortage, assuming a stochastic system) were significant (P<.05) for 14 (28%) of the 50 states that passed the augmented Dickey-Fuller test at lag 2, and the results of the CCM analysis (for whether negative sentiment COVID-19 tweets forecast ICU bed shortage, assuming a dynamic system) were significant (P<.05) for 46 (92%) of the 50 states. For Brazil (3,004,039 negative sentiment tweets), the results of the Granger causality test were significant (P<.05) for 6 (22%) of the 27 federative units, and the results of the CCM analysis were significant (P<.05) for 26 (96%) of the 27 federative units. For India (4,199,151 negative sentiment tweets), the results of the Granger causality test were significant (P<.05) for 6 (23%) of the 26 included regions (25 states and the national capital region of Delhi), and the results of the CCM analysis were significant (P<.05) for 26 (100%) of the 26 included regions.
This study provides a novel approach for identifying the regions of high hospital bed demand during a health care emergency scenario by analyzing Twitter sentiment data. Leveraging analyses that take advantage of natural language processing-driven tweet extraction systems has the potential to be an effective method for the early detection of medical resource demand surges.
新冠疫情凸显了当前应对医疗资源需求激增方面的差距,以及优先分配稀缺医疗资源以降低医疗机构不堪重负风险的必要性。
在诸如新冠疫情这样的医疗紧急情况期间,公众常常利用社交媒体来表达负面情绪(如紧迫感、恐惧和沮丧),作为对不断演变的危机的实时反应。新冠疫情相关推文所表达的情绪可能会提供有关一个国家不同地区医疗资源需求相对严重程度的有价值的实时信息。在本研究中,运用推特(后更名为X)情绪分析来调查新冠疫情负面情绪推文的增加是否与美国、巴西和印度特定地区对医院重症监护病房(ICU)床位的更大需求相对应。
推文是从一个公开可用的数据集中收集的,该数据集包含2020年2月1日至2021年3月31日期间发布的带有情绪标签和地理位置信息的新冠疫情相关推文。区域医疗资源短缺数据是从公开可用的数据集中收集的,这些数据集报告了每个国家ICU床位需求的时间序列。使用格兰杰因果检验和收敛交叉映射(CCM)分析对负面情绪推文进行分析,以评估负面情绪推文时间序列在预测ICU床位短缺方面的效用。
对于美国(30,742,934条负面情绪推文),格兰杰因果检验(关于新冠疫情负面情绪推文是否预测ICU床位短缺,假设为一个随机系统)的结果在滞后2时通过增广迪基 - 富勒检验的50个州中的14个(28%)是显著的(P<.05),而CCM分析(关于新冠疫情负面情绪推文是否预测ICU床位短缺,假设为一个动态系统)的结果在50个州中的46个(92%)是显著的(P<.05)。对于巴西(3,004,039条负面情绪推文),格兰杰因果检验的结果在27个联邦单位中的6个(22%)是显著的(P<.05),而CCM分析的结果在27个联邦单位中的26个(96%)是显著的(P<.05)。对于印度(4,199,151条负面情绪推文),格兰杰因果检验的结果在26个纳入地区(25个邦和德里国家首都辖区)中的6个(23%)是显著的(P<.05),而CCM分析的结果在26个纳入地区中的26个(100%)是显著的(P<.05)。
本研究提供了一种通过分析推特情绪数据来识别医疗紧急情况期间医院床位高需求地区的新方法。利用基于自然语言处理驱动的推文提取系统的分析方法有可能成为早期检测医疗资源需求激增的有效方法。