跨医疗环境的机器学习可推广性：来自多地点新冠病毒筛查的见解

Machine learning generalizability across healthcare settings: insights from multi-site COVID-19 screening.

作者信息

Yang Jenny, Soltan Andrew A S, Clifton David A

机构信息

Institute of Biomedical Engineering, Dept. Engineering Science, University of Oxford, Oxford, UK.

John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, UK.

出版信息

NPJ Digit Med. 2022 Jun 7;5(1):69. doi: 10.1038/s41746-022-00614-9.

DOI:10.1038/s41746-022-00614-9

PMID:35672368

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9174159/

Abstract

As patient health information is highly regulated due to privacy concerns, most machine learning (ML)-based healthcare studies are unable to test on external patient cohorts, resulting in a gap between locally reported model performance and cross-site generalizability. Different approaches have been introduced for developing models across multiple clinical sites, however less attention has been given to adopting ready-made models in new settings. We introduce three methods to do this-(1) applying a ready-made model "as-is" (2); readjusting the decision threshold on the model's output using site-specific data and (3); finetuning the model using site-specific data via transfer learning. Using a case study of COVID-19 diagnosis across four NHS Hospital Trusts, we show that all methods achieve clinically-effective performances (NPV > 0.959), with transfer learning achieving the best results (mean AUROCs between 0.870 and 0.925). Our models demonstrate that site-specific customization improves predictive performance when compared to other ready-made approaches.

摘要

由于隐私问题，患者健康信息受到严格监管，大多数基于机器学习（ML）的医疗保健研究无法在外部患者队列上进行测试，导致本地报告的模型性能与跨站点通用性之间存在差距。已经引入了不同的方法来在多个临床站点开发模型，然而，在新环境中采用现成模型的关注度较低。我们介绍了三种方法来做到这一点——（1）直接应用现成模型；（2）使用特定于站点的数据重新调整模型输出的决策阈值；（3）通过迁移学习使用特定于站点的数据对模型进行微调。通过对四个英国国民保健服务（NHS）医院信托机构的新冠肺炎诊断进行案例研究，我们表明所有方法都能实现临床有效的性能（阴性预测值>0.959），迁移学习取得了最佳结果（平均曲线下面积在0.870至0.925之间）。我们的模型表明，与其他现成方法相比，特定于站点的定制提高了预测性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b95b/9174159/21fba14b2c8f/41746_2022_614_Fig1_HTML.jpg

相似文献

Machine learning generalizability across healthcare settings: insights from multi-site COVID-19 screening.

NPJ Digit Med. 2022 Jun 7;5(1):69. doi: 10.1038/s41746-022-00614-9.

A scalable federated learning solution for secondary care using low-cost microcomputing: privacy-preserving development and evaluation of a COVID-19 screening test in UK hospitals.

Lancet Digit Health. 2024 Feb;6(2):e93-e104. doi: 10.1016/S2589-7500(23)00226-1.

Improving prediction for medical institution with limited patient data: Leveraging hospital-specific data based on multicenter collaborative research network.

Artif Intell Med. 2021 Mar;113:102024. doi: 10.1016/j.artmed.2021.102024. Epub 2021 Jan 23.

An adversarial training framework for mitigating algorithmic biases in clinical machine learning.

NPJ Digit Med. 2023 Mar 29;6(1):55. doi: 10.1038/s41746-023-00805-y.

Deep convolutional neural network and IoT technology for healthcare.

Digit Health. 2024 Jan 17;10:20552076231220123. doi: 10.1177/20552076231220123. eCollection 2024 Jan-Dec.

Towards global model generalizability: independent cross-site feature evaluation for patient-level risk prediction models using the OHDSI network.

J Am Med Inform Assoc. 2024 Apr 19;31(5):1051-1061. doi: 10.1093/jamia/ocae028.

Can Machine Learning Algorithms Predict Which Patients Will Achieve Minimally Clinically Important Differences From Total Joint Arthroplasty?

Clin Orthop Relat Res. 2019 Jun;477(6):1267-1279. doi: 10.1097/CORR.0000000000000687.

Generalizability of machine learning for classification of schizophrenia based on resting-state functional MRI data.

Hum Brain Mapp. 2020 Jan;41(1):172-184. doi: 10.1002/hbm.24797. Epub 2019 Oct 1.

Information extraction from multi-institutional radiology reports.

Artif Intell Med. 2016 Jan;66:29-39. doi: 10.1016/j.artmed.2015.09.007. Epub 2015 Oct 3.

Assessing the Generalizability of a Clinical Machine Learning Model Across Multiple Emergency Departments.

Mayo Clin Proc Innov Qual Outcomes. 2022 Apr 26;6(3):193-199. doi: 10.1016/j.mayocpiqo.2022.03.003. eCollection 2022 Jun.

引用本文的文献

Multiplex Targeted Proteomic Analysis of Cytokine Ratios for ICU Mortality in Severe COVID-19.

Proteomes. 2025 Aug 2;13(3):35. doi: 10.3390/proteomes13030035.

What's next for computational systems biology?

Front Syst Biol. 2023 Sep 19;3:1250228. doi: 10.3389/fsysb.2023.1250228. eCollection 2023.

Statistical variability in comparing accuracy of neuroimaging based classification models via cross validation.

Sci Rep. 2025 Aug 6;15(1):28745. doi: 10.1038/s41598-025-12026-2.

Prognostic prediction of dengue hemorrhagic fever in pediatric patients with suspected dengue infection: A multi-site study.

PLoS One. 2025 Aug 4;20(8):e0327360. doi: 10.1371/journal.pone.0327360. eCollection 2025.

Multicenter Validation of a Machine Learning Model for Surgical Transfusion Risk at 45 US Hospitals.

JAMA Netw Open. 2025 Jun 2;8(6):e2517760. doi: 10.1001/jamanetworkopen.2025.17760.

Artificial Intelligence Algorithm Predicts Response to Immune Checkpoint Inhibitors.

Clin Cancer Res. 2025 Aug 14;31(16):3526-3536. doi: 10.1158/1078-0432.CCR-24-3720.

Advancing Musculoskeletal Care Using AI and Digital Health Applications: A Review of Commercial Solutions.

HSS J. 2025 May 30:15563316251341321. doi: 10.1177/15563316251341321.

Hybrid machine learning for real-time prediction of edema trajectory in large middle cerebral artery stroke.

NPJ Digit Med. 2025 May 17;8(1):288. doi: 10.1038/s41746-025-01687-y.

Scientific Evidence for Clinical Text Summarization Using Large Language Models: Scoping Review.

J Med Internet Res. 2025 May 15;27:e68998. doi: 10.2196/68998.

Application of machine learning for the analysis of peripheral blood biomarkers in oral mucosal diseases: a cross-sectional study.

BMC Oral Health. 2025 May 10;25(1):703. doi: 10.1186/s12903-025-06095-y.

本文引用的文献

Advancing COVID-19 diagnosis with privacy-preserving collaboration in artificial intelligence.

Nat Mach Intell. 2021 Dec;3(12):1081-1089. doi: 10.1038/s42256-021-00421-z. Epub 2021 Dec 15.

Real-world evaluation of rapid and laboratory-free COVID-19 triage for emergency care: external validation and pilot deployment of artificial intelligence driven screening.

Lancet Digit Health. 2022 Apr;4(4):e266-e278. doi: 10.1016/S2589-7500(21)00272-7. Epub 2022 Mar 9.

Prediction across healthcare settings: a case study in predicting emergency department disposition.

NPJ Digit Med. 2021 Dec 15;4(1):169. doi: 10.1038/s41746-021-00537-x.

Federated learning for predicting clinical outcomes in patients with COVID-19.

Nat Med. 2021 Oct;27(10):1735-1743. doi: 10.1038/s41591-021-01506-3. Epub 2021 Sep 15.

Novel deep transfer learning model for COVID-19 patient detection using X-ray chest images.

J Ambient Intell Humaniz Comput. 2023;14(1):469-478. doi: 10.1007/s12652-021-03306-6. Epub 2021 May 15.

Reproducibility in machine learning for health research: Still a ways to go.

Sci Transl Med. 2021 Mar 24;13(586). doi: 10.1126/scitranslmed.abb1655.

CNN-based transfer learning-BiLSTM network: A novel approach for COVID-19 infection detection.

Appl Soft Comput. 2021 Jan;98:106912. doi: 10.1016/j.asoc.2020.106912. Epub 2020 Nov 18.

Fostering reproducibility and generalizability in machine learning for clinical prediction modeling in spine surgery.

Spine J. 2021 Oct;21(10):1610-1616. doi: 10.1016/j.spinee.2020.10.006. Epub 2020 Oct 13.

Transparency and reproducibility in artificial intelligence.

Nature. 2020 Oct;586(7829):E14-E16. doi: 10.1038/s41586-020-2766-y. Epub 2020 Oct 14.

Laboratory diagnosis of COVID-19.

J Pediatr (Rio J). 2021 Jan-Feb;97(1):7-12. doi: 10.1016/j.jped.2020.08.001. Epub 2020 Aug 31.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

跨医疗环境的机器学习可推广性：来自多地点新冠病毒筛查的见解

Machine learning generalizability across healthcare settings: insights from multi-site COVID-19 screening.

作者信息

Yang Jenny, Soltan Andrew A S, Clifton David A

机构信息

Institute of Biomedical Engineering, Dept. Engineering Science, University of Oxford, Oxford, UK.

John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, UK.

出版信息

NPJ Digit Med. 2022 Jun 7;5(1):69. doi: 10.1038/s41746-022-00614-9.

DOI:10.1038/s41746-022-00614-9

PMID:35672368

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9174159/

Abstract

摘要

跨医疗环境的机器学习可推广性：来自多地点新冠病毒筛查的见解

Machine learning generalizability across healthcare settings: insights from multi-site COVID-19 screening.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

跨医疗环境的机器学习可推广性：来自多地点新冠病毒筛查的见解

Machine learning generalizability across healthcare settings: insights from multi-site COVID-19 screening.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献