• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

比较机器学习算法预测慢性髓性白血病患者 5 年生存率。

Comparing machine learning algorithms to predict 5-year survival in patients with chronic myeloid leukemia.

机构信息

Department of Health Information Technology, Faculty of Paramedical, Ilam University of Medical Sciences, Ilam, Iran.

Department of Health Information Technology and Management, School of Allied Medical Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran.

出版信息

BMC Med Inform Decis Mak. 2022 Sep 6;22(1):236. doi: 10.1186/s12911-022-01980-w.

DOI:10.1186/s12911-022-01980-w
PMID:36068539
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9450320/
Abstract

INTRODUCTION

Chronic myeloid leukemia (CML) is a myeloproliferative disorder resulting from the translocation of chromosomes 19 and 22. CML includes 15-20% of all cases of leukemia. Although bone marrow transplant and, more recently, tyrosine kinase inhibitors (TKIs) as a first-line treatment have significantly prolonged survival in CML patients, accurate prediction using available patient-level factors can be challenging. We intended to predict 5-year survival among CML patients via eight machine learning (ML) algorithms and compare their performance.

METHODS

The data of 837 CML patients were retrospectively extracted and randomly split into training and test segments (70:30 ratio). The outcome variable was 5-year survival with potential values of alive or deceased. The dataset for the full features and important features selected by minimal redundancy maximal relevance (mRMR) feature selection were fed into eight ML techniques, including eXtreme gradient boosting (XGBoost), multilayer perceptron (MLP), pattern recognition network, k-nearest neighborhood (KNN), probabilistic neural network, support vector machine (SVM) (kernel = linear), SVM (kernel = RBF), and J-48. The scikit-learn library in Python was used to implement the models. Finally, the performance of the developed models was measured using some evaluation criteria with 95% confidence intervals (CI).

RESULTS

Spleen palpable, age, and unexplained hemorrhage were identified as the top three effective features affecting CML 5-year survival. The performance of ML models using the selected-features was superior to that of the full-features dataset. Among the eight ML algorithms, SVM (kernel = RBF) had the best performance in tenfold cross-validation with an accuracy of 85.7%, specificity of 85%, sensitivity of 86%, F-measure of 87%, kappa statistic of 86.1%, and area under the curve (AUC) of 85% for the selected-features. Using the full-features dataset yielded an accuracy of 69.7%, specificity of 69.1%, sensitivity of 71.3%, F-measure of 72%, kappa statistic of 75.2%, and AUC of 70.1%.

CONCLUSIONS

Accurate prediction of the survival likelihood of CML patients can inform caregivers to promote patient prognostication and choose the best possible treatment path. While external validation is required, our developed models will offer customized treatment and may guide the prescription of personalized medicine for CML patients.

摘要

简介

慢性髓性白血病(CML)是一种骨髓增生性疾病,源于染色体 19 和 22 的易位。CML 占所有白血病病例的 15-20%。尽管骨髓移植和最近的酪氨酸激酶抑制剂(TKI)作为一线治疗方法显著延长了 CML 患者的生存时间,但使用现有患者水平的因素进行准确预测可能具有挑战性。我们旨在通过 8 种机器学习(ML)算法预测 CML 患者的 5 年生存率,并比较它们的性能。

方法

回顾性提取 837 例 CML 患者的数据,并将其随机分为训练和测试段(70:30 比例)。结局变量为 5 年生存率,可能的值为存活或死亡。将全特征数据集和通过最小冗余最大相关性(mRMR)特征选择选择的重要特征数据集输入到 8 种 ML 技术中,包括极端梯度增强(XGBoost)、多层感知机(MLP)、模式识别网络、k-最近邻(KNN)、概率神经网络、支持向量机(SVM)(核=线性)、SVM(核=RBF)和 J-48。使用 Python 中的 scikit-learn 库实现模型。最后,使用 95%置信区间(CI)的一些评估标准来衡量开发模型的性能。

结果

脾肿大、年龄和不明原因出血被确定为影响 CML 5 年生存率的前三个有效特征。使用选定特征的 ML 模型的性能优于全特征数据集。在 8 种 ML 算法中,SVM(核=RBF)在 10 倍交叉验证中的表现最佳,准确率为 85.7%,特异性为 85%,灵敏度为 86%,F 度量为 87%,kappa 统计量为 86.1%,选定特征的曲线下面积(AUC)为 85%。使用全特征数据集的准确率为 69.7%,特异性为 69.1%,灵敏度为 71.3%,F 度量为 72%,kappa 统计量为 75.2%,AUC 为 70.1%。

结论

准确预测 CML 患者的生存可能性可以为护理人员提供信息,以促进患者预后判断,并选择最佳的治疗途径。虽然需要外部验证,但我们开发的模型将提供定制的治疗,并可能指导 CML 患者的个性化药物处方。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9313/9450320/c50290c0209d/12911_2022_1980_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9313/9450320/ceb9dc2373c3/12911_2022_1980_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9313/9450320/5b87d84addbe/12911_2022_1980_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9313/9450320/628c624b0f63/12911_2022_1980_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9313/9450320/cfbb6782993c/12911_2022_1980_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9313/9450320/c50290c0209d/12911_2022_1980_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9313/9450320/ceb9dc2373c3/12911_2022_1980_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9313/9450320/5b87d84addbe/12911_2022_1980_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9313/9450320/628c624b0f63/12911_2022_1980_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9313/9450320/cfbb6782993c/12911_2022_1980_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9313/9450320/c50290c0209d/12911_2022_1980_Fig5_HTML.jpg

相似文献

1
Comparing machine learning algorithms to predict 5-year survival in patients with chronic myeloid leukemia.比较机器学习算法预测慢性髓性白血病患者 5 年生存率。
BMC Med Inform Decis Mak. 2022 Sep 6;22(1):236. doi: 10.1186/s12911-022-01980-w.
2
Prediction of acute organophosphate poisoning severity using machine learning techniques.利用机器学习技术预测急性有机磷中毒的严重程度。
Toxicology. 2023 Mar 1;486:153431. doi: 10.1016/j.tox.2023.153431. Epub 2023 Jan 19.
3
Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?预测模型工具能否识别 ACL 重建术后阿片类药物使用时间延长的高风险患者?
Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251.
4
Predicting BRAFV600E mutations in papillary thyroid carcinoma using six machine learning algorithms based on ultrasound elastography.基于超声弹性成像的 6 种机器学习算法预测甲状腺乳头状癌 BRAFV600E 突变。
Sci Rep. 2023 Aug 3;13(1):12604. doi: 10.1038/s41598-023-39747-6.
5
Machine learning model based on enhanced CT radiomics for the preoperative prediction of lymphovascular invasion in esophageal squamous cell carcinoma.基于增强CT影像组学的机器学习模型用于术前预测食管鳞状细胞癌的淋巴管侵犯
Front Oncol. 2024 Feb 23;14:1308317. doi: 10.3389/fonc.2024.1308317. eCollection 2024.
6
Seminal quality prediction using data mining methods.使用数据挖掘方法进行精液质量预测。
Technol Health Care. 2014;22(4):531-45. doi: 10.3233/THC-140816.
7
Application of machine learning model in predicting the likelihood of blood transfusion after hip fracture surgery.机器学习模型在预测髋部骨折手术后输血可能性中的应用。
Aging Clin Exp Res. 2023 Nov;35(11):2643-2656. doi: 10.1007/s40520-023-02550-4. Epub 2023 Sep 21.
8
Predicting hospital readmission risk in patients with COVID-19: A machine learning approach.预测新冠病毒肺炎患者的医院再入院风险:一种机器学习方法。
Inform Med Unlocked. 2022;30:100908. doi: 10.1016/j.imu.2022.100908. Epub 2022 Mar 8.
9
Optimizing prognostic factors of five-year survival in gastric cancer patients using feature selection techniques with machine learning algorithms: a comparative study.使用机器学习算法进行特征选择技术优化胃癌患者五年生存率的预后因素:一项比较研究。
BMC Med Inform Decis Mak. 2023 Apr 6;23(1):54. doi: 10.1186/s12911-023-02154-y.
10
Do we need different machine learning algorithms for QSAR modeling? A comprehensive assessment of 16 machine learning algorithms on 14 QSAR data sets.我们是否需要不同的机器学习算法来进行定量构效关系建模?对 16 种机器学习算法在 14 个定量构效关系数据集上的综合评估。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa321.

引用本文的文献

1
Development and validation of an interpretable machine learning model for predicting Philadelphia chromosome-positive acute lymphoblastic leukaemia using clinical and laboratory parameters: a single-centre retrospective study.利用临床和实验室参数预测费城染色体阳性急性淋巴细胞白血病的可解释机器学习模型的开发与验证:一项单中心回顾性研究
BMJ Open. 2025 Jun 27;15(6):e097526. doi: 10.1136/bmjopen-2024-097526.
2
Identification of relevant features using SEQENS to improve supervised machine learning models predicting AML treatment outcome.使用SEQENS识别相关特征以改进预测急性髓系白血病治疗结果的监督式机器学习模型。
BMC Med Inform Decis Mak. 2025 May 1;25(1):179. doi: 10.1186/s12911-025-03001-y.
3

本文引用的文献

1
Prediction of complete remission and survival in acute myeloid leukemia using supervised machine learning.利用监督机器学习预测急性髓细胞白血病的完全缓解和生存。
Haematologica. 2023 Mar 1;108(3):690-704. doi: 10.3324/haematol.2021.280027.
2
Predicting patient-level new-onset atrial fibrillation from population-based nationwide electronic health records: protocol of FIND-AF for developing a precision medicine prediction model using artificial intelligence.从基于人群的全国性电子健康记录中预测患者新发心房颤动:使用人工智能开发精准医学预测模型的 FIND-AF 研究方案。
BMJ Open. 2021 Nov 2;11(11):e052887. doi: 10.1136/bmjopen-2021-052887.
3
Utilization of Machine Learning in the Prediction, Diagnosis, Prognosis, and Management of Chronic Myeloid Leukemia.
机器学习在慢性髓性白血病的预测、诊断、预后及管理中的应用
Int J Mol Sci. 2025 Mar 12;26(6):2535. doi: 10.3390/ijms26062535.
4
Biological Data Resources and Machine Learning Frameworks for Hematology Research.用于血液学研究的生物数据资源和机器学习框架
Genomics Proteomics Bioinformatics. 2025 May 30;23(2). doi: 10.1093/gpbjnl/qzaf021.
5
Application of artificial intelligence in chronic myeloid leukemia (CML) disease prediction and management: a scoping review.人工智能在慢性髓细胞白血病(CML)疾病预测和管理中的应用:范围综述。
BMC Cancer. 2024 Aug 20;24(1):1026. doi: 10.1186/s12885-024-12764-y.
6
Artificial Intelligence-Based Management of Adult Chronic Myeloid Leukemia: Where Are We and Where Are We Going?基于人工智能的成人慢性髓性白血病管理:我们现状如何,又将走向何方?
Cancers (Basel). 2024 Feb 20;16(5):848. doi: 10.3390/cancers16050848.
7
Artificial Intelligence-Assisted Diagnostic Cytology and Genomic Testing for Hematologic Disorders.人工智能辅助诊断细胞学和血液病基因组检测。
Cells. 2023 Jun 30;12(13):1755. doi: 10.3390/cells12131755.
8
Predicting the Activity of Oral Lichen Planus with Glycolysis-related Molecules: A Scikit-learn-based Function.基于 Sklearn 的算法预测口腔扁平苔藓的活性:与糖酵解相关分子的研究
Curr Med Sci. 2023 Jun;43(3):602-608. doi: 10.1007/s11596-023-2716-7. Epub 2023 Apr 28.
9
Applications of Machine Learning in Chronic Myeloid Leukemia.机器学习在慢性髓性白血病中的应用
Diagnostics (Basel). 2023 Apr 3;13(7):1330. doi: 10.3390/diagnostics13071330.
Survival prognostic factors in patients with acute myeloid leukemia using machine learning techniques.
采用机器学习技术的急性髓系白血病患者生存预后因素。
PLoS One. 2021 Jul 21;16(7):e0254976. doi: 10.1371/journal.pone.0254976. eCollection 2021.
4
A clinical prediction model identifies a subgroup with inferior survival within intermediate risk acute myeloid leukemia.一种临床预测模型在中危急性髓系白血病中识别出了生存期较差的亚组。
J Cancer. 2021 Jun 11;12(16):4912-4923. doi: 10.7150/jca.57231. eCollection 2021.
5
A Machine Learning Model to Successfully Predict Future Diagnosis of Chronic Myelogenous Leukemia With Retrospective Electronic Health Records Data.机器学习模型成功预测慢性髓性白血病的未来诊断,使用回顾性电子健康记录数据。
Am J Clin Pathol. 2021 Nov 8;156(6):1142-1148. doi: 10.1093/ajcp/aqab086.
6
Personalized Survival Prediction of Patients With Acute Myeloblastic Leukemia Using Gene Expression Profiling.利用基因表达谱对急性髓系白血病患者进行个性化生存预测
Front Oncol. 2021 Mar 29;11:657191. doi: 10.3389/fonc.2021.657191. eCollection 2021.
7
A Heterogeneous Ensemble Learning Method For Neuroblastoma Survival Prediction.一种用于神经母细胞瘤生存预测的异构集成学习方法。
IEEE J Biomed Health Inform. 2022 Apr;26(4):1472-1483. doi: 10.1109/JBHI.2021.3073056. Epub 2022 Apr 14.
8
Clinical features and short-term outcomes of COVID-19 in Tehran, Iran: An analysis of mortality and hospital stay.伊朗德黑兰 COVID-19 的临床特征和短期结局:死亡率和住院时间分析。
Acta Biomed. 2020 Nov 10;91(4):e2020147. doi: 10.23750/abm.v91i4.10206.
9
How Do Machines Learn? Artificial Intelligence as a New Era in Medicine.机器如何学习?人工智能作为医学的新时代。
J Pers Med. 2021 Jan 7;11(1):32. doi: 10.3390/jpm11010032.
10
Mortality rate in patients with chronic myeloid leukemia in chronic phase treated with frontline second generation tyrosine kinase inhibitors: a retrospective analysis by the monitoring registries of the Italian Medicines Agency (AIFA).一线第二代酪氨酸激酶抑制剂治疗的慢性期慢性髓性白血病患者的死亡率:意大利药品管理局(AIFA)监测登记处的回顾性分析。
Ann Hematol. 2021 Feb;100(2):481-485. doi: 10.1007/s00277-021-04406-1. Epub 2021 Jan 7.