Yuan Xiaoling, Xu Jie, Hussain Sabiha, Wang He, Gao Nan, Zhang Lanjing
Department of Infectious Disease, Shanghai Ninth People's Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China.
Department of Biological Sciences, Rutgers University Newark, NJ, USA.
medRxiv. 2020 Apr 20:2020.04.15.20064485. doi: 10.1101/2020.04.15.20064485.
The coronavirus disease 2019 (COVID-19) infected more than 586,000 patients in the U.S. However, its daily incidence and deaths in the U.S. are poorly understood. Internet search interest was found correlated with COVID-19 daily incidence in China, but not yet applied to the U.S. Therefore, we examined the association of internet search-interest with COVID-19 daily incidence and deaths in the U.S.
We extracted the COVDI-19 daily incidence and death data in the U.S. from two population-based datasets. The search interest of COVID-19 related terms was obtained using Google Trends. Pearson correlation test and general linear model were used to examine correlations and predict future trends, respectively.
There were 555,245 new cases and 22,019 deaths of COVID-19 reported in the U.S. from March 1 to April 12, 2020. The search interest of COVID, "COVID pneumonia," and "COVID heart" were correlated with COVDI-19 daily incidence with ~12-day of delay (Pearson's r=0.978, 0.978 and 0.979, respectively) and deaths with 19-day of delay (Pearson's r=0.963, 0.958 and 0.970, respectively). The COVID-19 daily incidence and deaths appeared to both peak on April 10. The 4-day follow-up with prospectively collected data showed moderate to good accuracies for predicting new cases (Pearson's r=-0.641 to -0.833) and poor to good accuracies for daily new deaths (Pearson's r=0.365 to 0.935).
Search terms related to COVID-19 are highly correlated with the trends in COVID-19 daily incidence and deaths in the U.S. The prediction-models based on the search interest trend reached moderate to good accuracies.
2019年冠状病毒病(COVID-19)在美国感染了超过58.6万名患者。然而,其在美国的每日发病率和死亡情况仍知之甚少。研究发现,在中国,互联网搜索兴趣与COVID-19每日发病率相关,但尚未应用于美国。因此,我们研究了美国互联网搜索兴趣与COVID-19每日发病率和死亡之间的关联。
我们从两个人口数据集提取了美国COVID-19的每日发病率和死亡数据。使用谷歌趋势获取与COVID-19相关术语的搜索兴趣。分别使用Pearson相关检验和一般线性模型来检验相关性并预测未来趋势。
2020年3月1日至4月12日,美国报告了555245例COVID-19新病例和22019例死亡。“COVID”“COVID肺炎”和“COVID心脏”的搜索兴趣与COVID-19每日发病率相关,延迟约12天(Pearson相关系数分别为0.978、0.978和0.979),与死亡相关,延迟19天(Pearson相关系数分别为0.963、0.958和0.970)。COVID-19每日发病率和死亡似乎均在4月10日达到峰值。对前瞻性收集的数据进行4天随访显示,预测新病例的准确性为中等至良好(Pearson相关系数为-0.641至-0.833),预测每日新死亡的准确性为差至良好(Pearson相关系数为0.365至0.935)。
与COVID-19相关的搜索词与美国COVID-19每日发病率和死亡趋势高度相关。基于搜索兴趣趋势的预测模型准确性达到中等至良好。