Quintanilha Dayanna, Moura Eduardo, Xavier Danielly
Research and Innovation Center, Afya Educação Médica, Alameda Lorena, 01.424-001, São Paulo, São Paulo, Brazil.
BMC Public Health. 2025 Aug 18;25(1):2827. doi: 10.1186/s12889-025-24188-9.
Public health surveillance depends on continuous monitoring to guide interventions and allocate resources effectively. This study aimed to evaluate whether structured medical search data from the Afya Whitebook®, a clinical decision-support platform, can serve as exogenous variables to enhance the explanatory capacity of time series models characterising hospitalisation patterns within Brazil's public health system.
An ecological time series analysis was conducted using hospitalisation data (SIH/SUS) and Afya Whitebook® search volumes from 2021 to 2024. SARIMAX models assessed temporal associations between search activity and hospital admissions across Brazilian states, compared to univariate SARIMA models to evaluate the added value of search data.
In 278 of the 478 time series, SARIMAX models provided a better fit than univariate SARIMA models, particularly for conditions such as chronic obstructive pulmonary disease, dengue, urinary tract infections, type 2 diabetes, asthma, depression, and chronic kidney disease. Model fit varied by disease and region, underscoring the influence of contextual factors in the association between search behaviour and hospital admissions.
This study demonstrates that structured medical search data can serve as exogenous variables to improve the explanatory capacity of time series models of hospitalisation patterns. Despite variation between diseases and regions, this approach shows promise in supporting public health surveillance and could be strengthened by incorporating contextual data in future studies.
公共卫生监测依赖持续监测来指导干预措施并有效分配资源。本研究旨在评估来自临床决策支持平台Afya Whitebook®的结构化医学搜索数据是否可作为外生变量,以增强表征巴西公共卫生系统内住院模式的时间序列模型的解释能力。
使用2021年至2024年的住院数据(SIH/SUS)和Afya Whitebook®搜索量进行生态时间序列分析。与单变量SARIMA模型相比,SARIMAX模型评估了巴西各州搜索活动与住院之间的时间关联,以评估搜索数据的附加值。
在478个时间序列中的278个中,SARIMAX模型比单变量SARIMA模型拟合得更好,特别是对于慢性阻塞性肺疾病、登革热、尿路感染、2型糖尿病、哮喘、抑郁症和慢性肾脏病等疾病。模型拟合因疾病和地区而异,突显了背景因素对搜索行为与住院之间关联的影响。
本研究表明,结构化医学搜索数据可作为外生变量来提高住院模式时间序列模型的解释能力。尽管疾病和地区之间存在差异,但这种方法在支持公共卫生监测方面显示出前景,并且可以通过在未来研究中纳入背景数据来加强。