• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

跨医疗环境的机器学习可推广性:来自多地点新冠病毒筛查的见解

Machine learning generalizability across healthcare settings: insights from multi-site COVID-19 screening.

作者信息

Yang Jenny, Soltan Andrew A S, Clifton David A

机构信息

Institute of Biomedical Engineering, Dept. Engineering Science, University of Oxford, Oxford, UK.

John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, UK.

出版信息

NPJ Digit Med. 2022 Jun 7;5(1):69. doi: 10.1038/s41746-022-00614-9.

DOI:10.1038/s41746-022-00614-9
PMID:35672368
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9174159/
Abstract

As patient health information is highly regulated due to privacy concerns, most machine learning (ML)-based healthcare studies are unable to test on external patient cohorts, resulting in a gap between locally reported model performance and cross-site generalizability. Different approaches have been introduced for developing models across multiple clinical sites, however less attention has been given to adopting ready-made models in new settings. We introduce three methods to do this-(1) applying a ready-made model "as-is" (2); readjusting the decision threshold on the model's output using site-specific data and (3); finetuning the model using site-specific data via transfer learning. Using a case study of COVID-19 diagnosis across four NHS Hospital Trusts, we show that all methods achieve clinically-effective performances (NPV > 0.959), with transfer learning achieving the best results (mean AUROCs between 0.870 and 0.925). Our models demonstrate that site-specific customization improves predictive performance when compared to other ready-made approaches.

摘要

由于隐私问题,患者健康信息受到严格监管,大多数基于机器学习(ML)的医疗保健研究无法在外部患者队列上进行测试,导致本地报告的模型性能与跨站点通用性之间存在差距。已经引入了不同的方法来在多个临床站点开发模型,然而,在新环境中采用现成模型的关注度较低。我们介绍了三种方法来做到这一点——(1)直接应用现成模型;(2)使用特定于站点的数据重新调整模型输出的决策阈值;(3)通过迁移学习使用特定于站点的数据对模型进行微调。通过对四个英国国民保健服务(NHS)医院信托机构的新冠肺炎诊断进行案例研究,我们表明所有方法都能实现临床有效的性能(阴性预测值>0.959),迁移学习取得了最佳结果(平均曲线下面积在0.870至0.925之间)。我们的模型表明,与其他现成方法相比,特定于站点的定制提高了预测性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b95b/9174159/3d3b246407a1/41746_2022_614_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b95b/9174159/21fba14b2c8f/41746_2022_614_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b95b/9174159/c6c64ab18a3b/41746_2022_614_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b95b/9174159/3d3b246407a1/41746_2022_614_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b95b/9174159/21fba14b2c8f/41746_2022_614_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b95b/9174159/c6c64ab18a3b/41746_2022_614_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b95b/9174159/3d3b246407a1/41746_2022_614_Fig3_HTML.jpg

相似文献

1
Machine learning generalizability across healthcare settings: insights from multi-site COVID-19 screening.跨医疗环境的机器学习可推广性:来自多地点新冠病毒筛查的见解
NPJ Digit Med. 2022 Jun 7;5(1):69. doi: 10.1038/s41746-022-00614-9.
2
A scalable federated learning solution for secondary care using low-cost microcomputing: privacy-preserving development and evaluation of a COVID-19 screening test in UK hospitals.一种使用低成本微型计算机的二级医疗可扩展联邦学习解决方案:英国医院 COVID-19 筛查测试的隐私保护开发和评估。
Lancet Digit Health. 2024 Feb;6(2):e93-e104. doi: 10.1016/S2589-7500(23)00226-1.
3
Improving prediction for medical institution with limited patient data: Leveraging hospital-specific data based on multicenter collaborative research network.利用多中心协作研究网络的医院特有数据提高有限患者数据医疗机构的预测能力。
Artif Intell Med. 2021 Mar;113:102024. doi: 10.1016/j.artmed.2021.102024. Epub 2021 Jan 23.
4
An adversarial training framework for mitigating algorithmic biases in clinical machine learning.一种用于减轻临床机器学习中算法偏差的对抗训练框架。
NPJ Digit Med. 2023 Mar 29;6(1):55. doi: 10.1038/s41746-023-00805-y.
5
Deep convolutional neural network and IoT technology for healthcare.用于医疗保健的深度卷积神经网络和物联网技术。
Digit Health. 2024 Jan 17;10:20552076231220123. doi: 10.1177/20552076231220123. eCollection 2024 Jan-Dec.
6
Towards global model generalizability: independent cross-site feature evaluation for patient-level risk prediction models using the OHDSI network.迈向全球模型通用性:使用 OHDSI 网络进行患者水平风险预测模型的独立跨站点特征评估。
J Am Med Inform Assoc. 2024 Apr 19;31(5):1051-1061. doi: 10.1093/jamia/ocae028.
7
Can Machine Learning Algorithms Predict Which Patients Will Achieve Minimally Clinically Important Differences From Total Joint Arthroplasty?机器学习算法能否预测哪些患者将从全关节置换术中获得最小临床重要差异?
Clin Orthop Relat Res. 2019 Jun;477(6):1267-1279. doi: 10.1097/CORR.0000000000000687.
8
Generalizability of machine learning for classification of schizophrenia based on resting-state functional MRI data.基于静息态功能磁共振成像数据的机器学习在精神分裂症分类中的泛化能力。
Hum Brain Mapp. 2020 Jan;41(1):172-184. doi: 10.1002/hbm.24797. Epub 2019 Oct 1.
9
Information extraction from multi-institutional radiology reports.从多机构放射学报告中提取信息。
Artif Intell Med. 2016 Jan;66:29-39. doi: 10.1016/j.artmed.2015.09.007. Epub 2015 Oct 3.
10
Assessing the Generalizability of a Clinical Machine Learning Model Across Multiple Emergency Departments.评估临床机器学习模型在多个急诊科的可推广性。
Mayo Clin Proc Innov Qual Outcomes. 2022 Apr 26;6(3):193-199. doi: 10.1016/j.mayocpiqo.2022.03.003. eCollection 2022 Jun.

引用本文的文献

1
Multiplex Targeted Proteomic Analysis of Cytokine Ratios for ICU Mortality in Severe COVID-19.用于重症新型冠状病毒肺炎患者重症监护病房死亡率的细胞因子比值多重靶向蛋白质组学分析
Proteomes. 2025 Aug 2;13(3):35. doi: 10.3390/proteomes13030035.
2
What's next for computational systems biology?计算系统生物学的下一步是什么?
Front Syst Biol. 2023 Sep 19;3:1250228. doi: 10.3389/fsysb.2023.1250228. eCollection 2023.
3
Statistical variability in comparing accuracy of neuroimaging based classification models via cross validation.通过交叉验证比较基于神经影像学的分类模型准确性时的统计变异性。

本文引用的文献

1
Advancing COVID-19 diagnosis with privacy-preserving collaboration in artificial intelligence.通过人工智能中的隐私保护协作推进新冠病毒疾病诊断
Nat Mach Intell. 2021 Dec;3(12):1081-1089. doi: 10.1038/s42256-021-00421-z. Epub 2021 Dec 15.
2
Real-world evaluation of rapid and laboratory-free COVID-19 triage for emergency care: external validation and pilot deployment of artificial intelligence driven screening.真实世界环境下的 COVID-19 快速、无实验室分诊用于紧急护理的评估:人工智能驱动的筛查的外部验证和试点部署。
Lancet Digit Health. 2022 Apr;4(4):e266-e278. doi: 10.1016/S2589-7500(21)00272-7. Epub 2022 Mar 9.
3
Sci Rep. 2025 Aug 6;15(1):28745. doi: 10.1038/s41598-025-12026-2.
4
Prognostic prediction of dengue hemorrhagic fever in pediatric patients with suspected dengue infection: A multi-site study.疑似登革热感染的儿科患者登革出血热的预后预测:一项多中心研究。
PLoS One. 2025 Aug 4;20(8):e0327360. doi: 10.1371/journal.pone.0327360. eCollection 2025.
5
Multicenter Validation of a Machine Learning Model for Surgical Transfusion Risk at 45 US Hospitals.美国45家医院手术输血风险机器学习模型的多中心验证
JAMA Netw Open. 2025 Jun 2;8(6):e2517760. doi: 10.1001/jamanetworkopen.2025.17760.
6
Artificial Intelligence Algorithm Predicts Response to Immune Checkpoint Inhibitors.人工智能算法可预测对免疫检查点抑制剂的反应。
Clin Cancer Res. 2025 Aug 14;31(16):3526-3536. doi: 10.1158/1078-0432.CCR-24-3720.
7
Advancing Musculoskeletal Care Using AI and Digital Health Applications: A Review of Commercial Solutions.利用人工智能和数字健康应用推进肌肉骨骼护理:商业解决方案综述
HSS J. 2025 May 30:15563316251341321. doi: 10.1177/15563316251341321.
8
Hybrid machine learning for real-time prediction of edema trajectory in large middle cerebral artery stroke.用于实时预测大脑中动脉大面积卒中水肿轨迹的混合机器学习
NPJ Digit Med. 2025 May 17;8(1):288. doi: 10.1038/s41746-025-01687-y.
9
Scientific Evidence for Clinical Text Summarization Using Large Language Models: Scoping Review.使用大语言模型进行临床文本摘要的科学证据:范围综述
J Med Internet Res. 2025 May 15;27:e68998. doi: 10.2196/68998.
10
Application of machine learning for the analysis of peripheral blood biomarkers in oral mucosal diseases: a cross-sectional study.机器学习在口腔黏膜疾病外周血生物标志物分析中的应用:一项横断面研究。
BMC Oral Health. 2025 May 10;25(1):703. doi: 10.1186/s12903-025-06095-y.
Prediction across healthcare settings: a case study in predicting emergency department disposition.
跨医疗环境的预测:急诊科处置预测的案例研究
NPJ Digit Med. 2021 Dec 15;4(1):169. doi: 10.1038/s41746-021-00537-x.
4
Federated learning for predicting clinical outcomes in patients with COVID-19.基于联邦学习的 COVID-19 患者临床结局预测
Nat Med. 2021 Oct;27(10):1735-1743. doi: 10.1038/s41591-021-01506-3. Epub 2021 Sep 15.
5
Novel deep transfer learning model for COVID-19 patient detection using X-ray chest images.用于使用胸部X光图像检测新冠肺炎患者的新型深度迁移学习模型。
J Ambient Intell Humaniz Comput. 2023;14(1):469-478. doi: 10.1007/s12652-021-03306-6. Epub 2021 May 15.
6
Reproducibility in machine learning for health research: Still a ways to go.机器学习在健康研究中的可重复性:仍有很长的路要走。
Sci Transl Med. 2021 Mar 24;13(586). doi: 10.1126/scitranslmed.abb1655.
7
CNN-based transfer learning-BiLSTM network: A novel approach for COVID-19 infection detection.基于卷积神经网络的迁移学习-双向长短期记忆网络:一种用于新冠病毒感染检测的新方法。
Appl Soft Comput. 2021 Jan;98:106912. doi: 10.1016/j.asoc.2020.106912. Epub 2020 Nov 18.
8
Fostering reproducibility and generalizability in machine learning for clinical prediction modeling in spine surgery.促进机器学习在脊柱外科临床预测建模中的可重复性和泛化能力。
Spine J. 2021 Oct;21(10):1610-1616. doi: 10.1016/j.spinee.2020.10.006. Epub 2020 Oct 13.
9
Transparency and reproducibility in artificial intelligence.人工智能中的透明度和可重复性。
Nature. 2020 Oct;586(7829):E14-E16. doi: 10.1038/s41586-020-2766-y. Epub 2020 Oct 14.
10
Laboratory diagnosis of COVID-19.新型冠状病毒肺炎的实验室诊断。
J Pediatr (Rio J). 2021 Jan-Feb;97(1):7-12. doi: 10.1016/j.jped.2020.08.001. Epub 2020 Aug 31.