一项迁移学习研究：利用来自多家医院的数据提高特定医院的预测能力。

A study in transfer learning: leveraging data from multiple hospitals to enhance hospital-specific predictions.

机构信息

Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA.

Microsoft Research, Redmond, Washington, USA.

出版信息

J Am Med Inform Assoc. 2014 Jul-Aug;21(4):699-706. doi: 10.1136/amiajnl-2013-002162. Epub 2014 Jan 30.

DOI:10.1136/amiajnl-2013-002162

PMID:24481703

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4078276/

Abstract

BACKGROUND

Data-driven risk stratification models built using data from a single hospital often have a paucity of training data. However, leveraging data from other hospitals can be challenging owing to institutional differences with patients and with data coding and capture.

OBJECTIVE

To investigate three approaches to learning hospital-specific predictions about the risk of hospital-associated infection with Clostridium difficile, and perform a comparative analysis of the value of different ways of using external data to enhance hospital-specific predictions.

MATERIALS AND METHODS

We evaluated each approach on 132 853 admissions from three hospitals, varying in size and location. The first approach was a single-task approach, in which only training data from the target hospital (ie, the hospital for which the model was intended) were used. The second used only data from the other two hospitals. The third approach jointly incorporated data from all hospitals while seeking a solution in the target space.

RESULTS

The relative performance of the three different approaches was found to be sensitive to the hospital selected as the target. However, incorporating data from all hospitals consistently had the highest performance.

DISCUSSION

The results characterize the challenges and opportunities that come with (1) using data or models from collections of hospitals without adapting them to the site at which the model will be used, and (2) using only local data to build models for small institutions or rare events.

CONCLUSIONS

We show how external data from other hospitals can be successfully and efficiently incorporated into hospital-specific models.

摘要

背景

使用单一医院数据构建的数据驱动风险分层模型通常训练数据较少。但是，由于患者和数据编码及采集方面的机构差异，利用其他医院的数据可能具有挑战性。

目的

研究三种方法来学习关于艰难梭菌医院相关性感染风险的特定医院预测，并对使用外部数据增强特定医院预测的不同方法的价值进行比较分析。

材料和方法

我们在三个医院的 132853 次住院中评估了每种方法，这些医院在规模和位置上有所不同。第一种方法是单任务方法，仅使用目标医院（即模型所针对的医院）的训练数据。第二种方法仅使用其他两个医院的数据。第三种方法则同时合并了所有医院的数据，同时在目标空间中寻求解决方案。

结果

三种不同方法的相对性能发现对所选目标医院敏感。然而，合并所有医院的数据始终具有最高的性能。

讨论

结果描述了在（1）不将数据或模型适配到模型将被使用的地点，而使用来自医院集合的数据或模型，以及（2）仅使用本地数据为小型机构或罕见事件构建模型时，所面临的挑战和机遇。

结论

我们展示了如何成功有效地将来自其他医院的外部数据合并到特定医院的模型中。

相似文献

A study in transfer learning: leveraging data from multiple hospitals to enhance hospital-specific predictions.

J Am Med Inform Assoc. 2014 Jul-Aug;21(4):699-706. doi: 10.1136/amiajnl-2013-002162. Epub 2014 Jan 30.

The association of hospital prevention processes and patient risk factors with the risk of Clostridium difficile infection: a population-based cohort study.

BMJ Qual Saf. 2015 Jul;24(7):435-43. doi: 10.1136/bmjqs-2014-003863. Epub 2015 Apr 24.

The potential economic value of screening hospital admissions for Clostridium difficile.

Eur J Clin Microbiol Infect Dis. 2012 Nov;31(11):3163-71. doi: 10.1007/s10096-012-1681-z. Epub 2012 Jun 30.

Transfer and transport: incorporating causal methods for improving predictive models.

J Am Med Inform Assoc. 2014 Oct;21(e2):e374-5. doi: 10.1136/amiajnl-2014-002968. Epub 2014 Jul 9.

A Generalizable, Data-Driven Approach to Predict Daily Risk of Clostridium difficile Infection at Two Large Academic Health Centers.

Infect Control Hosp Epidemiol. 2018 Apr;39(4):425-433. doi: 10.1017/ice.2018.16.

Frequent hospital readmissions for Clostridium difficile infection and the impact on estimates of hospital-associated C. difficile burden.

Infect Control Hosp Epidemiol. 2012 Jan;33(1):20-8. doi: 10.1086/663209. Epub 2011 Nov 11.

Effectiveness of targeted enhanced terminal room disinfection on hospital-wide acquisition and infection with multidrug-resistant organisms and Clostridium difficile: a secondary analysis of a multicentre cluster randomised controlled trial with crossover design (BETR Disinfection).

Lancet Infect Dis. 2018 Aug;18(8):845-853. doi: 10.1016/S1473-3099(18)30278-0. Epub 2018 Jun 4.

Variability in testing policies and impact on reported Clostridium difficile infection rates: results from the pilot Longitudinal European Clostridium difficile Infection Diagnosis surveillance study (LuCID).

Eur J Clin Microbiol Infect Dis. 2016 Dec;35(12):1949-1956. doi: 10.1007/s10096-016-2746-1. Epub 2016 Sep 2.

Michigan Clostridium difficile hospital discharges: frequency, mortality, and charges, 2002-2008.

Public Health Rep. 2012 Jan-Feb;127(1):62-71. doi: 10.1177/003335491212700107.

The burden of Clostridium difficile in surgical patients in the United States.

Surg Infect (Larchmt). 2007 Dec;8(6):557-66. doi: 10.1089/sur.2006.062.

引用本文的文献

Prediction of 1p/19q state in glioma by integrated deep learning method based on MRI radiomics.

BMC Cancer. 2025 Jul 28;25(1):1228. doi: 10.1186/s12885-025-14454-9.

Detecting and Remediating Harmful Data Shifts for the Responsible Deployment of Clinical AI Models.

JAMA Netw Open. 2025 Jun 2;8(6):e2513685. doi: 10.1001/jamanetworkopen.2025.13685.

Transfer learning for mortality risk: A case study on the United Kingdom.

PLoS One. 2025 May 23;20(5):e0313378. doi: 10.1371/journal.pone.0313378. eCollection 2025.

Doubly Robust Augmented Model Accuracy Transfer Inference with High Dimensional Features.

J Am Stat Assoc. 2025;120(549):524-534. doi: 10.1080/01621459.2024.2356291. Epub 2024 Jun 24.

Deep learning applications for human embryo assessment using time-lapse imaging: scoping review.

Front Reprod Health. 2025 Apr 8;7:1549642. doi: 10.3389/frph.2025.1549642. eCollection 2025.

Data Mining Models in Prediction of Vancomycin-Intermediate in Methicillin-Resistant (MRSA) Bacteremia Patients in a Clinical Care Setting.

Microorganisms. 2025 Jan 7;13(1):101. doi: 10.3390/microorganisms13010101.

Development and transfer learning of self-attention model for major adverse cardiovascular events prediction across hospitals.

Sci Rep. 2024 Oct 8;14(1):23443. doi: 10.1038/s41598-024-74366-9.

Bridging the Gap: Exploring Bronchopulmonary Dysplasia through the Lens of Biomedical Informatics.

J Clin Med. 2024 Feb 14;13(4):1077. doi: 10.3390/jcm13041077.

An integrated pipeline for prediction of Clostridioides difficile infection.

Sci Rep. 2023 Oct 2;13(1):16532. doi: 10.1038/s41598-023-41753-7.

Circulating proteins to predict COVID-19 severity.

Sci Rep. 2023 Apr 17;13(1):6236. doi: 10.1038/s41598-023-31850-y.

本文引用的文献

Predictors of prolonged hospital stay for the treatment of severe neuropsychiatric symptoms in patients with dementia: a cohort study in multiple hospitals.

Int Psychogeriatr. 2013 Aug;25(8):1365-73. doi: 10.1017/S1041610213000483. Epub 2013 Apr 23.

Integrating syndromic surveillance data across multiple locations: effects on outbreak detection performance.

AMIA Annu Symp Proc. 2003;2003:549-53.

The National Surgical Quality Improvement Program in non-veterans administration hospitals: initial demonstration of feasibility.

Ann Surg. 2002 Sep;236(3):344-53; discussion 353-4. doi: 10.1097/00000658-200209000-00011.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一项迁移学习研究：利用来自多家医院的数据提高特定医院的预测能力。

A study in transfer learning: leveraging data from multiple hospitals to enhance hospital-specific predictions.

机构信息

出版信息

BACKGROUND

OBJECTIVE

MATERIALS AND METHODS

RESULTS

DISCUSSION

CONCLUSIONS

背景

目的

材料和方法

结果

讨论

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献