Evidence-Based Nursing Center, School of Nursing, Lanzhou University, Lanzhou, China.
Department of Obstetrics and Gynecology, The Second School of Clinical Medicine, Shanxi University of Chinese Medicine, Shanxi, China.
Acta Obstet Gynecol Scand. 2023 Jan;102(1):7-14. doi: 10.1111/aogs.14475. Epub 2022 Nov 17.
There was limited evidence on the quality of reporting and methodological quality of prediction models using machine learning methods in preterm birth. This systematic review aimed to assess the reporting quality and risk of bias of a machine learning-based prediction model in preterm birth.
We conducted a systematic review, searching the PubMed, Embase, the Cochrane Library, China National Knowledge Infrastructure, China Biology Medicine disk, VIP Database, and WanFang Data from inception to September 27, 2021. Studies that developed (validated) a prediction model using machine learning methods in preterm birth were included. We used the Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD) statement and Prediction model Risk of Bias Assessment Tool (PROBAST) to evaluate the reporting quality and the risk of bias of included studies, respectively. Findings were summarized using descriptive statistics and visual plots. The protocol was registered in PROSPERO (no. CRD 42022301623).
Twenty-nine studies met the inclusion criteria, with 24 development-only studies and 5 development-with-validation studies. Overall, TRIPOD adherence per study ranged from 17% to 79%, with a median adherence of 49%. The reporting of title, abstract, blinding of predictors, sample size justification, explanation of model, and model performance were mostly poor, with TRIPOD adherence ranging from 4% to 17%. For all included studies, 79% had a high overall risk of bias, and 21% had an unclear overall risk of bias. The analysis domain was most commonly rated as high risk of bias in included studies, mainly as a result of small effective sample size, selection of predictors based on univariable analysis, and lack of calibration evaluation.
Reporting and methodological quality of machine learning-based prediction models in preterm birth were poor. It is urgent to improve the design, conduct, and reporting of such studies to boost the application of machine learning-based prediction models in preterm birth in clinical practice.
使用机器学习方法预测早产的文献中,关于报告质量和方法学质量的证据有限。本系统评价旨在评估基于机器学习的早产预测模型的报告质量和偏倚风险。
我们进行了系统评价,检索了从建库至 2021 年 9 月 27 日的 PubMed、Embase、Cochrane 图书馆、中国知网、中国生物医学文献数据库、维普数据库和万方数据。纳入使用机器学习方法开发(验证)预测模型的研究。我们使用多变量预测模型个体预后或诊断的透明报告(TRIPOD)声明和预测模型风险偏倚评估工具(PROBAST)分别评估纳入研究的报告质量和偏倚风险,结果使用描述性统计和可视化图进行总结。方案在 PROSPERO(编号:CRD42022301623)注册。
29 项研究符合纳入标准,其中 24 项为仅开发研究,5 项为开发与验证研究。总体而言,每项研究的 TRIPOD 依从性从 17%到 79%不等,中位数为 49%。标题、摘要、预测因素的盲法、样本量依据说明、模型解释和模型性能的报告大多较差,TRIPOD 依从性为 4%至 17%。所有纳入研究中,79%的研究整体偏倚风险高,21%的研究整体偏倚风险不明确。分析域在纳入研究中最常被评为高偏倚风险,主要是因为有效样本量小、基于单变量分析选择预测因素以及缺乏校准评估。
基于机器学习的早产预测模型的报告和方法学质量较差。迫切需要改进此类研究的设计、实施和报告,以促进基于机器学习的预测模型在早产临床实践中的应用。