China Medical University, Shenyang, 110122, China.
Digital Health China Co. Ltd, Beijing, 100089, China.
BMC Med. 2024 Feb 5;22(1):56. doi: 10.1186/s12916-024-03273-7.
A comprehensive overview of artificial intelligence (AI) for cardiovascular disease (CVD) prediction and a screening tool of AI models (AI-Ms) for independent external validation are lacking. This systematic review aims to identify, describe, and appraise AI-Ms of CVD prediction in the general and special populations and develop a new independent validation score (IVS) for AI-Ms replicability evaluation.
PubMed, Web of Science, Embase, and IEEE library were searched up to July 2021. Data extraction and analysis were performed for the populations, distribution, predictors, algorithms, etc. The risk of bias was evaluated with the prediction risk of bias assessment tool (PROBAST). Subsequently, we designed IVS for model replicability evaluation with five steps in five items, including transparency of algorithms, performance of models, feasibility of reproduction, risk of reproduction, and clinical implication, respectively. The review is registered in PROSPERO (No. CRD42021271789).
In 20,887 screened references, 79 articles (82.5% in 2017-2021) were included, which contained 114 datasets (67 in Europe and North America, but 0 in Africa). We identified 486 AI-Ms, of which the majority were in development (n = 380), but none of them had undergone independent external validation. A total of 66 idiographic algorithms were found; however, 36.4% were used only once and only 39.4% over three times. A large number of different predictors (range 5-52,000, median 21) and large-span sample size (range 80-3,660,000, median 4466) were observed. All models were at high risk of bias according to PROBAST, primarily due to the incorrect use of statistical methods. IVS analysis confirmed only 10 models as "recommended"; however, 281 and 187 were "not recommended" and "warning," respectively.
AI has led the digital revolution in the field of CVD prediction, but is still in the early stage of development as the defects of research design, report, and evaluation systems. The IVS we developed may contribute to independent external validation and the development of this field.
目前缺乏对心血管疾病(CVD)预测的人工智能(AI)进行全面综述,以及用于独立外部验证的 AI 模型(AI-M)筛选工具。本系统综述旨在确定、描述和评估一般人群和特殊人群中 CVD 预测的 AI-M,并为 AI-M 的可重复性评估开发新的独立验证评分(IVS)。
检索了 2021 年 7 月前的 PubMed、Web of Science、Embase 和 IEEE 文库。对人群、分布、预测因子、算法等进行了数据提取和分析。采用预测风险偏倚评估工具(PROBAST)评估偏倚风险。随后,我们设计了 IVS,用于模型可重复性评估,共包含五个项目的五个步骤,分别是算法透明度、模型性能、可复制性、复制风险和临床意义。该综述已在 PROSPERO(编号:CRD42021271789)上注册。
在 20887 篇筛选文献中,共纳入 79 篇文章(2017-2021 年占 82.5%),其中包含 114 个数据集(67 个来自欧洲和北美,但没有来自非洲的)。我们共确定了 486 个 AI-M,其中大多数处于开发阶段(n=380),但没有一个经过独立的外部验证。共发现 66 个个体化算法,但其中 36.4%仅使用过一次,只有 39.4%使用过三次以上。观察到大量不同的预测因子(范围为 5-52000,中位数为 21)和跨度较大的样本量(范围为 80-3660000,中位数为 4466)。根据 PROBAST,所有模型的偏倚风险均较高,主要是由于统计方法使用不正确。IVS 分析仅确认了 10 个模型为“推荐”,但 281 个和 187 个分别为“不推荐”和“警告”。
人工智能引领了 CVD 预测领域的数字化革命,但由于研究设计、报告和评估系统的缺陷,仍处于早期发展阶段。我们开发的 IVS 可能有助于独立的外部验证和该领域的发展。