Suppr超能文献

构建基于机器学习的中期流产风险预测模型。

Building a machine learning-based risk prediction model for second-trimester miscarriage.

机构信息

Department of Obstetrics and Gynecology, Women and Children's Hospital of Ningbo University, No. 339 Liuting Street, Haishu District, Ningbo, 315012, Zhejiang, China.

出版信息

BMC Pregnancy Childbirth. 2024 Nov 9;24(1):738. doi: 10.1186/s12884-024-06942-w.

Abstract

BACKGROUND

Second-trimester miscarriage is a common adverse pregnancy outcome that imposes substantial economic and psychological pressures on both the physical and mental well-being of patients and their families. Currently, there is a scarcity of research on predictive models for the risk of second-trimester miscarriage.

METHODS

Clinical data were retrospectively collected from patients who were in the second trimester of pregnancy (between 14+0 and 27+6 weeks gestation), whose main diagnosis was "threatened abortion" and who were hospitalized at the Women and Children's Hospital of Ningbo University from January 2020 to October 2023. Following preliminary data processing, the patient cohort was randomly stratified into a training cohort and a validation cohort at proportions of 70% and 30%, respectively. The Boruta algorithm and multifactor analysis were used to refine feature factors and determine the optimal features linked to second-trimester miscarriages. The imbalanced dataset from the training cohort was rectified by applying the SMOTE oversampling approach. Seven machine-learning models were built and subjected to a comprehensive analysis to validate and evaluate their predictive capabilities. Through this rigorous assessment, the optimal model was selected. Shapley additive explanations (SHAP) were generated to provide insights into the model's predictions, and a visual representation of the predictive model was built.

RESULTS

A total of 2006 patients were included in the study; 395 (19.69%) of them had second-trimester miscarriages. XGBoost was shown to be the optimal model after a comparison of seven different models utilizing metrics such as accuracy, precision, recall, the F1 score, precision-recall average precision, the receiver operating characteristic-area under the curve, decision curve analysis, and the calibration curve. The most significant feature was cervical length, and the top ten features of second-trimester miscarriage were found using the SHAP technique based on relevance rankings.

CONCLUSION

The risk of a second-trimester miscarriage can be accurately predicted by the visual risk prediction model, which is based on the machine learning mentioned above.

摘要

背景

中期流产是一种常见的不良妊娠结局,给患者及其家庭的身心健康带来了巨大的经济和心理压力。目前,对于中期流产风险的预测模型研究较少。

方法

回顾性收集了 2020 年 1 月至 2023 年 10 月在宁波大学妇女儿童医院住院的孕中期(14+0 周至 27+6 周)、主要诊断为“先兆流产”的患者的临床资料。对患者资料进行初步数据处理后,将患者队列按 70%和 30%的比例随机分层为训练队列和验证队列。采用 Boruta 算法和多因素分析筛选特征因素,并确定与中期流产相关的最优特征因素。应用 SMOTE 过采样方法对训练队列的不平衡数据集进行校正。构建了 7 种机器学习模型,并对其进行综合分析,以验证和评估其预测能力。通过严格评估,选择了最优模型。生成 Shapley 加法解释(SHAP),以了解模型的预测情况,并构建预测模型的可视化表示。

结果

共纳入 2006 例患者,其中 395 例(19.69%)发生中期流产。通过比较 7 种不同模型的准确率、精确率、召回率、F1 评分、精确召回平均精度、受试者工作特征曲线下面积、决策曲线分析和校准曲线等指标,XGBoost 是最优模型。基于相关性排序,使用 SHAP 技术发现了与中期流产相关的前 10 个特征因素。

结论

基于上述机器学习方法构建的可视化风险预测模型可以准确预测中期流产的风险。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3253/11550545/dcaa1cd8afa0/12884_2024_6942_Fig1_HTML.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验