• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
COMMUTE: Communication-efficient transfer learning for multi-site risk prediction.通勤:面向多站点风险预测的通信高效迁移学习。
J Biomed Inform. 2023 Jan;137:104243. doi: 10.1016/j.jbi.2022.104243. Epub 2022 Nov 18.
2
TARGETING UNDERREPRESENTED POPULATIONS IN PRECISION MEDICINE: A FEDERATED TRANSFER LEARNING APPROACH.精准医学中针对代表性不足人群:一种联邦迁移学习方法。
Ann Appl Stat. 2023 Dec;17(4):2970-2992. doi: 10.1214/23-AOAS1747. Epub 2023 Oct 30.
3
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
4
Federated learning of predictive models from federated Electronic Health Records.从联邦电子健康记录中联合学习预测模型。
Int J Med Inform. 2018 Apr;112:59-67. doi: 10.1016/j.ijmedinf.2018.01.007. Epub 2018 Jan 12.
5
Predicting treatment response in multicenter non-small cell lung cancer patients based on federated learning.基于联邦学习预测多中心非小细胞肺癌患者的治疗反应。
BMC Cancer. 2024 Jun 5;24(1):688. doi: 10.1186/s12885-024-12456-7.
6
Predicting the risk of emergency admission with machine learning: Development and validation using linked electronic health records.使用机器学习预测急诊入院风险:基于电子健康记录的开发和验证。
PLoS Med. 2018 Nov 20;15(11):e1002695. doi: 10.1371/journal.pmed.1002695. eCollection 2018 Nov.
7
Learning from electronic health records across multiple sites: A communication-efficient and privacy-preserving distributed algorithm.从多个站点的电子健康记录中学习:一种通信高效且隐私保护的分布式算法。
J Am Med Inform Assoc. 2020 Mar 1;27(3):376-385. doi: 10.1093/jamia/ocz199.
8
FedSepsis: A Federated Multi-Modal Deep Learning-Based Internet of Medical Things Application for Early Detection of Sepsis from Electronic Health Records Using Raspberry Pi and Jetson Nano Devices.FedSepsis:一种基于联邦多模态深度学习的医疗物联网应用,使用树莓派和 Jetson Nano 设备从电子健康记录中早期检测败血症。
Sensors (Basel). 2023 Jan 14;23(2):970. doi: 10.3390/s23020970.
9
A scalable federated learning solution for secondary care using low-cost microcomputing: privacy-preserving development and evaluation of a COVID-19 screening test in UK hospitals.一种使用低成本微型计算机的二级医疗可扩展联邦学习解决方案:英国医院 COVID-19 筛查测试的隐私保护开发和评估。
Lancet Digit Health. 2024 Feb;6(2):e93-e104. doi: 10.1016/S2589-7500(23)00226-1.
10
Prediction of myopia development among Chinese school-aged children using refraction data from electronic medical records: A retrospective, multicentre machine learning study.基于电子病历中的屈光数据预测中国学龄儿童近视进展:一项回顾性、多中心机器学习研究。
PLoS Med. 2018 Nov 6;15(11):e1002674. doi: 10.1371/journal.pmed.1002674. eCollection 2018 Nov.

引用本文的文献

1
Bridging Data Gaps in Healthcare: A Scoping Review of Transfer Learning in Structured Data Analysis.弥合医疗保健领域的数据差距:结构化数据分析中迁移学习的范围综述
Health Data Sci. 2025 Sep 3;5:0321. doi: 10.34133/hds.0321. eCollection 2025.
2
Robust angle-based transfer learning in high dimensions.高维空间中基于稳健角度的迁移学习
J R Stat Soc Series B Stat Methodol. 2024 Dec 3;87(3):723-745. doi: 10.1093/jrsssb/qkae111. eCollection 2025 Jul.
3
Doubly Robust Augmented Model Accuracy Transfer Inference with High Dimensional Features.具有高维特征的双稳健增强模型精度转移推断
J Am Stat Assoc. 2025;120(549):524-534. doi: 10.1080/01621459.2024.2356291. Epub 2024 Jun 24.
4
Multi-Task Learning with Summary Statistics.基于汇总统计量的多任务学习
Adv Neural Inf Process Syst. 2023;36:54020-54031. Epub 2024 May 30.
5
Learning across diverse biomedical data modalities and cohorts: Challenges and opportunities for innovation.跨多种生物医学数据模式和队列的学习:创新面临的挑战与机遇
Patterns (N Y). 2024 Jan 17;5(2):100913. doi: 10.1016/j.patter.2023.100913. eCollection 2024 Feb 9.
6
A synthetic data integration framework to leverage external summary-level information from heterogeneous populations.一种综合数据集成框架,用于利用来自异构人群的外部汇总级信息。
Biometrics. 2023 Dec;79(4):3831-3845. doi: 10.1111/biom.13852. Epub 2023 Apr 4.

本文引用的文献

1
Transfer Learning under High-dimensional Generalized Linear Models.高维广义线性模型下的迁移学习
J Am Stat Assoc. 2023;118(544):2684-2697. doi: 10.1080/01621459.2022.2071278. Epub 2022 Jun 27.
2
A Tree-based Model Averaging Approach for Personalized Treatment Effect Estimation from Heterogeneous Data Sources.一种基于树的模型平均方法,用于从异构数据源估计个性化治疗效果。
Proc Mach Learn Res. 2022 Jul;162:21013-21036.
3
Global Biobank Meta-analysis Initiative: Powering genetic discovery across human disease.全球生物样本库荟萃分析计划:推动人类疾病的基因发现
Cell Genom. 2022 Oct 12;2(10):100192. doi: 10.1016/j.xgen.2022.100192.
4
SCEHR: Supervised Contrastive Learning for Clinical Risk Prediction using Electronic Health Records.SCEHR:使用电子健康记录进行临床风险预测的监督对比学习
Proc IEEE Int Conf Data Min. 2021 Dec;2021:857-866. doi: 10.1109/icdm51629.2021.00097.
5
SurvMaximin: Robust federated approach to transporting survival risk prediction models.SurvMaximin:稳健的联邦式方法,用于传输生存风险预测模型。
J Biomed Inform. 2022 Oct;134:104176. doi: 10.1016/j.jbi.2022.104176. Epub 2022 Aug 23.
6
Preparing for the next pandemic via transfer learning from existing diseases with hierarchical multi-modal BERT: a study on COVID-19 outcome prediction.通过使用层次化多模态 BERT 从现有疾病进行迁移学习,为下一次大流行做准备:一项关于 COVID-19 结果预测的研究。
Sci Rep. 2022 Jun 24;12(1):10748. doi: 10.1038/s41598-022-13072-w.
7
Transfer Learning for High-Dimensional Linear Regression: Prediction, Estimation and Minimax Optimality.高维线性回归的迁移学习:预测、估计与极小极大最优性
J R Stat Soc Series B Stat Methodol. 2022 Feb;84(1):149-173. doi: 10.1111/rssb.12479. Epub 2021 Nov 16.
8
Use of the PsycheMERGE Network to Investigate the Association Between Depression Polygenic Scores and White Blood Cell Count.利用 PsycheMERGE 网络研究抑郁多基因评分与白细胞计数之间的关联。
JAMA Psychiatry. 2021 Dec 1;78(12):1365-1374. doi: 10.1001/jamapsychiatry.2021.2959.
9
Differential privacy in health research: A scoping review.健康研究中的差分隐私:范围综述。
J Am Med Inform Assoc. 2021 Sep 18;28(10):2269-2276. doi: 10.1093/jamia/ocab135.
10
A unified framework identifies new links between plasma lipids and diseases from electronic medical records across large-scale cohorts.一个统一的框架从大规模队列的电子病历中识别出血浆脂质与疾病之间的新联系。
Nat Genet. 2021 Jul;53(7):972-981. doi: 10.1038/s41588-021-00879-y. Epub 2021 Jun 17.

通勤:面向多站点风险预测的通信高效迁移学习。

COMMUTE: Communication-efficient transfer learning for multi-site risk prediction.

机构信息

Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, United States.

Department of Psychiatry, Harvard Medical School, Boston, MA, United States; Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, United States; Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, United States.

出版信息

J Biomed Inform. 2023 Jan;137:104243. doi: 10.1016/j.jbi.2022.104243. Epub 2022 Nov 18.

DOI:10.1016/j.jbi.2022.104243
PMID:36403757
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9868117/
Abstract

OBJECTIVES

We propose a communication-efficient transfer learning approach (COMMUTE) that effectively incorporates multi-site healthcare data for training a risk prediction model in a target population of interest, accounting for challenges including population heterogeneity and data sharing constraints across sites.

METHODS

We first train population-specific source models locally within each site. Using data from a given target population, COMMUTE learns a calibration term for each source model, which adjusts for potential data heterogeneity through flexible distance-based regularizations. In a centralized setting where multi-site data can be directly pooled, all data are combined to train the target model after calibration. When individual-level data are not shareable in some sites, COMMUTE requests only the locally trained models from these sites, with which, COMMUTE generates heterogeneity-adjusted synthetic data for training the target model. We evaluate COMMUTE via extensive simulation studies and an application to multi-site data from the electronic Medical Records and Genomics (eMERGE) Network to predict extreme obesity.

RESULTS

Simulation studies show that COMMUTE outperforms methods without adjusting for population heterogeneity and methods trained in a single population over a broad spectrum of settings. Using eMERGE data, COMMUTE achieves an area under the receiver operating characteristic curve (AUC) around 0.80, which outperforms other benchmark methods with AUC ranging from 0.51 to 0.70.

CONCLUSION

COMMUTE improves the risk prediction in a target population with limited samples and safeguards against negative transfer when some source populations are highly different from the target. In a federated setting, it is highly communication efficient as it only requires each site to share model parameter estimates once, and no iterative communication or higher-order terms are needed.

摘要

目的

我们提出了一种高效的通信转移学习方法(COMMUTE),可以有效地整合多站点医疗保健数据,以在目标人群中训练风险预测模型,同时考虑到包括人群异质性和站点间数据共享限制在内的挑战。

方法

我们首先在每个站点内进行特定人群的本地训练。使用来自给定目标人群的数据,COMMUTE 为每个源模型学习校准项,通过灵活的基于距离的正则化来调整潜在的数据异质性。在可以直接汇总多站点数据的集中设置中,在进行校准后,所有数据都被组合在一起训练目标模型。当某些站点的个体水平数据不可共享时,COMMUTE 仅从这些站点请求本地训练的模型,并使用这些模型生成调整后的异质合成数据来训练目标模型。我们通过广泛的模拟研究和对电子病历和基因组学(eMERGE)网络的多站点数据的应用来评估 COMMUTE,以预测极端肥胖。

结果

模拟研究表明,COMMUTE 在广泛的设置范围内优于不调整人群异质性的方法和在单一人群中训练的方法。使用 eMERGE 数据,COMMUTE 的接收器操作特征曲线下面积(AUC)约为 0.80,优于 AUC 范围在 0.51 到 0.70 之间的其他基准方法。

结论

COMMUTE 可以在样本有限的情况下提高目标人群的风险预测能力,并防止当某些源人群与目标人群高度不同时出现负迁移。在联邦设置中,它的通信效率非常高,因为它只需要每个站点共享一次模型参数估计,而不需要迭代通信或更高阶项。