• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

为何在临床中实施机器学习算法并非即插即用的解决方案:一项针对急性白血病亚型诊断的机器学习算法的模拟研究

Why implementing machine learning algorithms in the clinic is not a plug-and-play solution: a simulation study of a machine learning algorithm for acute leukaemia subtype diagnosis.

作者信息

Pucher Gernot, Rostalski Till, Nensa Felix, Kleesiek Jens, Reinhardt Hans Christian, Sauer Christopher Martin

机构信息

Department of Haematology & Stem Cell Transplantation, West German Cancer Center, University Hospital Essen, Essen, Germany; Laboratory for Clinical Research and Real-World Evidence, Institute for Artificial Intelligence in Medicine, University Hospital Essen, Essen, Germany.

Laboratory for Clinical Research and Real-World Evidence, Institute for Artificial Intelligence in Medicine, University Hospital Essen, Essen, Germany.

出版信息

EBioMedicine. 2025 Jan;111:105526. doi: 10.1016/j.ebiom.2024.105526. Epub 2024 Dec 24.

DOI:10.1016/j.ebiom.2024.105526
PMID:39721215
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11732467/
Abstract

BACKGROUND

Artificial intelligence (AI) and machine learning (ML) algorithms have shown great promise in clinical medicine. Despite the increasing number of published algorithms, most remain unvalidated in real-world clinical settings. This study aims to simulate the practical implementation challenges of a recently developed ML algorithm, AI-PAL, designed for the diagnosis of acute leukaemia and report on its performance.

METHODS

We conducted a detailed simulation of the AI-PAL algorithm's implementation at the University Hospital Essen. Cohort building was performed using our Fast Healthcare Interoperability Resources (FHIR) database, identifying all initially diagnosed patients with acute leukaemia and selected differential diagnoses. The algorithm's performance was assessed by reproducing the original study's results.

FINDINGS

The AI-PAL algorithm demonstrated significantly lower performance in our simulated clinical implementation compared to prior published results. The area under the receiver operating characteristic curve for acute lymphoblastic leukaemia dropped to 0.67 (95% CI: 0.61-0.73) and for acute myeloid leukaemia to 0.71 (95% CI: 0.65-0.76). The recalibration of probability cutoffs determining confident diagnoses increased the number of confident positive diagnosis for acute leukaemia from 98 to 160, highlighting the necessity of local validation and adjustments.

INTERPRETATION

The findings underscore the challenges of implementing ML algorithms in clinical practice. Despite robust development and validation in research settings, ML models like AI-PAL may require significant adjustments and recalibration to maintain performance in different clinical settings. Our results suggest that clinical decision support algorithms should undergo local performance validation before integration into routine care to ensure reliability and safety.

FUNDING

This study was supported by the DFG-cofounded UMEA Clinician Scientist Program and the Ministry of Culture and Science of the State of North Rhine-Westphalia.

摘要

背景

人工智能(AI)和机器学习(ML)算法在临床医学中显示出巨大潜力。尽管已发表的算法数量不断增加,但大多数在实际临床环境中仍未得到验证。本研究旨在模拟一种最近开发的用于诊断急性白血病的ML算法AI-PAL在实际应用中的挑战,并报告其性能。

方法

我们在埃森大学医院对AI-PAL算法的实施进行了详细模拟。使用我们的快速医疗保健互操作性资源(FHIR)数据库构建队列,识别所有最初诊断为急性白血病的患者以及选定的鉴别诊断。通过重现原始研究的结果来评估该算法的性能。

结果

与先前发表的结果相比,AI-PAL算法在我们模拟的临床应用中表现出明显较低的性能。急性淋巴细胞白血病的受试者操作特征曲线下面积降至0.67(95%置信区间:0.61-0.73),急性髓细胞白血病降至0.71(95%置信区间:0.65-0.76)。重新校准确定确诊诊断的概率阈值后,急性白血病的确诊阳性诊断数量从98增加到160,突出了进行本地验证和调整的必要性。

解读

研究结果强调了在临床实践中实施ML算法的挑战。尽管在研究环境中进行了稳健的开发和验证,但像AI-PAL这样的ML模型在不同临床环境中可能需要进行重大调整和重新校准以维持性能。我们的结果表明,临床决策支持算法在整合到常规护理之前应进行本地性能验证,以确保可靠性和安全性。

资金

本研究得到了德国研究基金会共同资助的UME临床科学家计划以及北莱茵-威斯特法伦州文化和科学部的支持。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6dfe/11732467/b06b5705805e/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6dfe/11732467/734b8d0a8aa9/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6dfe/11732467/1b64dbe51a1a/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6dfe/11732467/b06b5705805e/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6dfe/11732467/734b8d0a8aa9/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6dfe/11732467/1b64dbe51a1a/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6dfe/11732467/b06b5705805e/gr3.jpg

相似文献

1
Why implementing machine learning algorithms in the clinic is not a plug-and-play solution: a simulation study of a machine learning algorithm for acute leukaemia subtype diagnosis.为何在临床中实施机器学习算法并非即插即用的解决方案:一项针对急性白血病亚型诊断的机器学习算法的模拟研究
EBioMedicine. 2025 Jan;111:105526. doi: 10.1016/j.ebiom.2024.105526. Epub 2024 Dec 24.
2
Evaluation of a machine-learning model based on laboratory parameters for the prediction of acute leukaemia subtypes: a multicentre model development and validation study in France.基于实验室参数的机器学习模型对急性白血病亚型预测的评估:法国多中心模型开发和验证研究。
Lancet Digit Health. 2024 May;6(5):e323-e333. doi: 10.1016/S2589-7500(24)00044-X.
3
Prediction of Allogeneic Hematopoietic Stem-Cell Transplantation Mortality 100 Days After Transplantation Using a Machine Learning Algorithm: A European Group for Blood and Marrow Transplantation Acute Leukemia Working Party Retrospective Data Mining Study.基于机器学习算法的移植后 100 天异基因造血干细胞移植死亡率预测:欧洲血液和骨髓移植协会急性白血病工作组回顾性数据挖掘研究。
J Clin Oncol. 2015 Oct 1;33(28):3144-51. doi: 10.1200/JCO.2014.59.1339. Epub 2015 Aug 3.
4
AAPM task group report 273: Recommendations on best practices for AI and machine learning for computer-aided diagnosis in medical imaging.AAPM 工作组报告 273:关于医学影像计算机辅助诊断中人工智能和机器学习的最佳实践建议。
Med Phys. 2023 Feb;50(2):e1-e24. doi: 10.1002/mp.16188. Epub 2023 Jan 6.
5
IoMT-Based Automated Detection and Classification of Leukemia Using Deep Learning.基于物联网的深度学习白血病自动检测与分类
J Healthc Eng. 2020 Dec 3;2020:6648574. doi: 10.1155/2020/6648574. eCollection 2020.
6
Optimizing Initial Vancomycin Dosing in Hospitalized Patients Using Machine Learning Approach for Enhanced Therapeutic Outcomes: Algorithm Development and Validation Study.使用机器学习方法优化住院患者初始万古霉素剂量以提高治疗效果:算法开发与验证研究
J Med Internet Res. 2025 Mar 31;27:e63983. doi: 10.2196/63983.
7
Comparison of Chest Radiograph Interpretations by Artificial Intelligence Algorithm vs Radiology Residents.人工智能算法与放射科住院医师对胸部 X 线片解读的比较。
JAMA Netw Open. 2020 Oct 1;3(10):e2022779. doi: 10.1001/jamanetworkopen.2020.22779.
8
Complete Blood Count and Monocyte Distribution Width-Based Machine Learning Algorithms for Sepsis Detection: Multicentric Development and External Validation Study.基于全血细胞计数和单核细胞分布宽度的脓毒症检测机器学习算法:多中心开发与外部验证研究
J Med Internet Res. 2025 Feb 26;27:e55492. doi: 10.2196/55492.
9
Artificial intelligence for breast cancer detection and its health technology assessment: A scoping review.用于乳腺癌检测的人工智能及其健康技术评估:一项范围综述。
Comput Biol Med. 2025 Jan;184:109391. doi: 10.1016/j.compbiomed.2024.109391. Epub 2024 Nov 22.
10
The development an artificial intelligence algorithm for early sepsis diagnosis in the intensive care unit.开发一种用于重症监护病房早期脓毒症诊断的人工智能算法。
Int J Med Inform. 2020 Sep;141:104176. doi: 10.1016/j.ijmedinf.2020.104176. Epub 2020 May 21.

本文引用的文献

1
Generalization-a key challenge for responsible AI in patient-facing clinical applications.泛化——面向患者的临床应用中负责任人工智能的关键挑战。
NPJ Digit Med. 2024 May 21;7(1):126. doi: 10.1038/s41746-024-01127-3.
2
Evaluation of a machine-learning model based on laboratory parameters for the prediction of acute leukaemia subtypes: a multicentre model development and validation study in France.基于实验室参数的机器学习模型对急性白血病亚型预测的评估:法国多中心模型开发和验证研究。
Lancet Digit Health. 2024 May;6(5):e323-e333. doi: 10.1016/S2589-7500(24)00044-X.
3
All models are wrong and yours are useless: making clinical prediction models impactful for patients.
所有模型都是有缺陷的,而你的模型毫无用处:让临床预测模型对患者产生影响。
NPJ Precis Oncol. 2024 Feb 28;8(1):54. doi: 10.1038/s41698-024-00553-6.
4
Illusory generalizability of clinical prediction models.临床预测模型的虚幻泛化性。
Science. 2024 Jan 12;383(6679):164-167. doi: 10.1126/science.adg8538. Epub 2024 Jan 11.
5
Computerized cognitive training for memory functions in mild cognitive impairment or dementia: a systematic review and meta-analysis.针对轻度认知障碍或痴呆患者记忆功能的计算机化认知训练:一项系统评价与荟萃分析。
NPJ Digit Med. 2024 Jan 3;7(1):1. doi: 10.1038/s41746-023-00987-5.
6
Exploiting tumor aneuploidy as a biomarker and therapeutic target in patients treated with immune checkpoint blockade.将肿瘤非整倍体作为生物标志物和治疗靶点应用于接受免疫检查点阻断治疗的患者。
NPJ Precis Oncol. 2024 Jan 2;8(1):1. doi: 10.1038/s41698-023-00492-8.
7
A Systematic Review of the Barriers to the Implementation of Artificial Intelligence in Healthcare.医疗保健领域人工智能实施障碍的系统评价
Cureus. 2023 Oct 4;15(10):e46454. doi: 10.7759/cureus.46454. eCollection 2023 Oct.
8
FHIR-PYrate: a data science friendly Python package to query FHIR servers.FHIR-PYrate:一个面向数据科学的 Python 包,用于查询 FHIR 服务器。
BMC Health Serv Res. 2023 Jul 6;23(1):734. doi: 10.1186/s12913-023-09498-1.
9
Systematic Review and Comparison of Publicly Available ICU Data Sets-A Decision Guide for Clinicians and Data Scientists.系统综述和比较公开可用的 ICU 数据集——临床医生和数据科学家的决策指南。
Crit Care Med. 2022 Jun 1;50(6):e581-e588. doi: 10.1097/CCM.0000000000005517. Epub 2022 Mar 2.
10
AI in health and medicine.人工智能在医疗中的应用。
Nat Med. 2022 Jan;28(1):31-38. doi: 10.1038/s41591-021-01614-0. Epub 2022 Jan 20.