Andreano Anita, Lepore Vito, Magnoni Pietro, Milanese Alberto, Fanizza Caterina, Testa Deborah, Musa Alessandro, Zanfino Adele, Rebora Paola, Bisceglia Lucia, Russo Antonio Giampiero
SC Unità Di Epidemiologia, Agenzia Di Tutela Della Salute Della Città Metropolitana Di Milano, Via Conca del Naviglio 45, Milan, 20123, Italy.
Area Epidemiologia E Care Intelligence, Agenzia Regionale Strategica Per La Salute Ed Il Sociale (AReSS) Puglia, Lungomare Nazario Sauro 33, Bari, 70121, Italy.
Syst Rev. 2024 Dec 24;13(1):313. doi: 10.1186/s13643-024-02717-8.
Heart failure (HF), affecting 1-4% of adults in industrialized countries, is a major public health priority. Several algorithms based on administrative health data (HAD) have been developed to detect patients with HF in a timely and inexpensive manner, in order to perform real-world studies at the population level. However, their reported diagnostic accuracy is highly variable.
To assess the diagnostic accuracy of validated HAD-based algorithms for detecting HF, compared to clinical diagnosis, and to investigate causes of heterogeneity.
We included all diagnostic accuracy studies that utilized HAD for the diagnosis of congestive HF in the general adult population, using clinical examination or chart review as the reference standard. A systematic search of MEDLINE (1946-2023) and Embase (1947-2023) was conducted, without restrictions. The QUADAS-2 tool was employed to assess the risk of bias and concerns regarding applicability. Due to low-quality issues of the primary studies, associated with both the index test and the reference standard definition and conduct, and to the high level of clinical heterogeneity, a quantitative synthesis was not performed. Measures of diagnostic accuracy of the included algorithms were summarized narratively and presented graphically, by population subgroups.
We included 24 studies (161,524 patients) and extracted 36 algorithms. Algorithm selection was based on type of administrative data and DOR. Six studies (103,018 patients, 14 algorithms) were performed in the general outpatient population, with sensitivities ranging from 24.8 to 97.3% and specificities ranging from 35.6 to 99.5%. Eight studies (14,957 patients, 10 algorithms) included hospitalized patients with sensitivities ranging from 29.0 to 96.0% and specificities ranging from 65.8 to 99.2%. The remaining studies included subgroups of the general population or hospitalized patients with cardiologic conditions and were analyzed separately. Fourteen studies had one or more domains at high risk of bias, and there were concerns regarding applicability in 9 studies.
The considerable percentage of studies with a high risk of bias, together with the high clinical heterogeneity among different studies, did not allow to generate a pooled estimate of diagnostic accuracy for HAD-based algorithms to be used in an unselected general adult population.
PROSPERO CRD42023487565.
心力衰竭(HF)影响着工业化国家1%-4%的成年人,是一项重大的公共卫生优先事项。已经开发了几种基于行政卫生数据(HAD)的算法,以便及时、低成本地检测HF患者,从而在人群层面开展真实世界研究。然而,报告的这些算法的诊断准确性差异很大。
评估经过验证的基于HAD的HF检测算法相对于临床诊断的诊断准确性,并探究异质性的原因。
我们纳入了所有利用HAD对普通成年人群充血性HF进行诊断的诊断准确性研究,以临床检查或病历审查作为参考标准。对MEDLINE(1946 - 2023年)和Embase(1947 - 2023年)进行了无限制的系统检索。采用QUADAS-2工具评估偏倚风险和适用性方面的问题。由于主要研究存在质量较低的问题,涉及指标测试以及参考标准的定义和实施,且临床异质性程度较高,因此未进行定量综合分析。纳入算法的诊断准确性指标以叙述方式进行总结,并按人群亚组以图表形式呈现。
我们纳入了24项研究(161,524名患者),并提取了36种算法。算法选择基于行政数据类型和诊断比值比(DOR)。六项研究(103,018名患者,14种算法)在普通门诊人群中进行,敏感性范围为24.8%至97.3%,特异性范围为35.6%至99.5%。八项研究(14,957名患者,10种算法)纳入了住院患者,敏感性范围为29.0%至96.0%,特异性范围为65.8%至99.2%。其余研究纳入了普通人群亚组或患有心脏病的住院患者,并分别进行了分析。14项研究存在一个或多个高偏倚风险领域,9项研究存在适用性方面的问题。
大量研究存在高偏倚风险,同时不同研究之间临床异质性较高,这使得无法对用于未经过筛选的普通成年人群的基于HAD的算法生成综合诊断准确性估计值。
PROSPERO CRD42023487565。