Department of Translational Data Science and Informatics, Geisinger, Danville, PA 17822, USA.
Department of General Internal Medicine, Geisinger, Danville, PA 17822, USA.
Int J Environ Res Public Health. 2021 Apr 25;18(9):4560. doi: 10.3390/ijerph18094560.
We utilize functional data analysis techniques to investigate patterns of COVID-19 positivity and mortality in the US and their associations with Google search trends for COVID-19-related symptoms. Specifically, we represent state-level time series data for COVID-19 and Google search trends for symptoms as smoothed functional curves. Given these functional data, we explore the modes of variation in the data using functional principal component analysis (FPCA). We also apply functional clustering analysis to identify patterns of COVID-19 confirmed case and death trajectories across the US. Moreover, we quantify the associations between Google COVID-19 search trends for symptoms and COVID-19 confirmed case and death trajectories using dynamic correlation. Finally, we examine the dynamics of correlations for the top nine Google search trends of symptoms commonly associated with COVID-19 confirmed case and death trajectories. Our results reveal and characterize distinct patterns for COVID-19 spread and mortality across the US. The dynamics of these correlations suggest the feasibility of using Google queries to forecast COVID-19 cases and mortality for up to three weeks in advance. Our results and analysis framework set the stage for the development of predictive models for forecasting COVID-19 confirmed cases and deaths using historical data and Google search trends for nine symptoms associated with both outcomes.
我们利用功能数据分析技术研究了美国 COVID-19 阳性率和死亡率的模式及其与 COVID-19 相关症状的谷歌搜索趋势之间的关联。具体来说,我们将 COVID-19 的州级时间序列数据和 COVID-19 相关症状的谷歌搜索趋势表示为平滑的功能曲线。考虑到这些功能数据,我们使用功能主成分分析(FPCA)探索数据的变化模式。我们还应用功能聚类分析来识别美国 COVID-19 确诊病例和死亡轨迹的模式。此外,我们使用动态相关系数来量化症状的谷歌 COVID-19 搜索趋势与 COVID-19 确诊病例和死亡轨迹之间的关联。最后,我们研究了与 COVID-19 确诊病例和死亡轨迹相关的前九个常见症状的谷歌搜索趋势的相关性的动态变化。我们的研究结果揭示并描述了美国 COVID-19 传播和死亡率的不同模式。这些相关性的动态变化表明,使用谷歌查询提前三到四周预测 COVID-19 病例和死亡率是可行的。我们的研究结果和分析框架为使用历史数据和与这两个结果都相关的九个症状的谷歌搜索趋势开发预测模型来预测 COVID-19 确诊病例和死亡人数奠定了基础。