• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

垂直网格逻辑回归置信区间(VERTIGO-CI)。

VERTIcal Grid lOgistic regression with Confidence Intervals (VERTIGO-CI).

机构信息

University of California San Diego Health System Department of Biomedical Informatics, La Jolla, CA 92130, USA.

These authors contributed equally. Corresponding Author: Lucila Ohno-Machado, MD, MBA, PhD (

出版信息

AMIA Jt Summits Transl Sci Proc. 2021 May 17;2021:355-364. eCollection 2021.

PMID:34457150
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8378611/
Abstract

Federated learning of data from multiple participating parties is getting more attention and has many healthcare applications. We have previously developed VERTIGO, a distributed logistic regression model for vertically partitioned data. The model takes advantage of the linear separation property of kernel matrices of a dual space model to harmonize information in a privacy-preserving manner. However, this method does not handle the variance estimation and only provides point estimates: it cannot report test statistics and associated P-values. In this work, we extend VERTIGO by introducing a novel ring-structure protocol to pass on intermediary statistics among clients and successfully reconstructed the covariance matrix in the dual space. This extension, VERTIGO-CI, is a complete protocol to construct a logistic regression model from vertically partitioned datasets as if it is trained on combined data in a centralized setting. We evaluated our results on synthetic and real data, showing the equivalent accuracy and tolerable performance overhead compared to the centralized version. This novel extension can be applied to other types of generalized linear models that have dual objectives.

摘要

多方参与的联邦学习越来越受到关注,并在医疗保健领域有许多应用。我们之前开发了 VERTIGO,这是一种用于垂直分割数据的分布式逻辑回归模型。该模型利用对偶空间模型核矩阵的线性分离特性,以隐私保护的方式协调信息。然而,这种方法不处理方差估计,只提供点估计:它不能报告检验统计量和相关的 P 值。在这项工作中,我们通过引入一种新的环形结构协议来扩展 VERTIGO,该协议可以在客户端之间传递中间统计信息,并成功重建对偶空间中的协方差矩阵。这个扩展名为 VERTIGO-CI,它是一个完整的协议,可以从垂直分割的数据集中构建逻辑回归模型,就好像它是在集中设置下基于组合数据进行训练的。我们在合成数据和真实数据上评估了我们的结果,与集中版本相比,它具有相当的准确性和可接受的性能开销。这个新的扩展可以应用于具有对偶目标的其他类型的广义线性模型。

相似文献

1
VERTIcal Grid lOgistic regression with Confidence Intervals (VERTIGO-CI).垂直网格逻辑回归置信区间(VERTIGO-CI)。
AMIA Jt Summits Transl Sci Proc. 2021 May 17;2021:355-364. eCollection 2021.
2
VERTIcal Grid lOgistic regression (VERTIGO).垂直网格逻辑回归(VERTIGO)。
J Am Med Inform Assoc. 2016 May;23(3):570-9. doi: 10.1093/jamia/ocv146. Epub 2015 Nov 9.
3
Learning from vertically distributed data across multiple sites: An efficient privacy-preserving algorithm for Cox proportional hazards model with variable selection.从多个站点的垂直分布数据中学习:一种用于具有变量选择的Cox比例风险模型的高效隐私保护算法。
J Biomed Inform. 2024 Jan;149:104581. doi: 10.1016/j.jbi.2023.104581. Epub 2023 Dec 23.
4
Privacy-preserving construction of generalized linear mixed model for biomedical computation.用于生物医学计算的广义线性混合模型的隐私保护构建。
Bioinformatics. 2020 Jul 1;36(Suppl_1):i128-i135. doi: 10.1093/bioinformatics/btaa478.
5
Privacy-preserving logistic regression with secret sharing.基于秘密共享的隐私保护逻辑回归。
BMC Med Inform Decis Mak. 2022 Apr 2;22(1):89. doi: 10.1186/s12911-022-01811-y.
6
Privacy-Preserving Analysis of Distributed Biomedical Data: Designing Efficient and Secure Multiparty Computations Using Distributed Statistical Learning Theory.分布式生物医学数据的隐私保护分析:运用分布式统计学习理论设计高效且安全的多方计算
JMIR Med Inform. 2019 Apr 29;7(2):e12702. doi: 10.2196/12702.
7
Privacy-preserving dataset combination and Lasso regression for healthcare predictions.用于医疗保健预测的隐私保护数据集组合和 Lasso 回归。
BMC Med Inform Decis Mak. 2021 Sep 16;21(1):266. doi: 10.1186/s12911-021-01582-y.
8
Learning From Others Without Sacrificing Privacy: Simulation Comparing Centralized and Federated Machine Learning on Mobile Health Data.从他人身上学习而不牺牲隐私:移动健康数据集中式和联邦机器学习的模拟比较。
JMIR Mhealth Uhealth. 2021 Mar 30;9(3):e23728. doi: 10.2196/23728.
9
Preserving Institutional Privacy in Distributed binary Logistic Regression.在分布式二元逻辑回归中保护机构隐私
AMIA Annu Symp Proc. 2012;2012:1450-8. Epub 2012 Nov 3.
10
A multicenter random forest model for effective prognosis prediction in collaborative clinical research network.多中心随机森林模型在协作临床研究网络中的有效预后预测。
Artif Intell Med. 2020 Mar;103:101814. doi: 10.1016/j.artmed.2020.101814. Epub 2020 Feb 5.

引用本文的文献

1
Distributed cross-learning for equitable federated models - privacy-preserving prediction on data from five California hospitals.用于公平联邦模型的分布式交叉学习——对来自加利福尼亚州五家医院的数据进行隐私保护预测。
Nat Commun. 2025 Feb 5;16(1):1371. doi: 10.1038/s41467-025-56510-9.
2
Recent methodological advances in federated learning for healthcare.医疗保健领域联邦学习的最新方法进展。
Patterns (N Y). 2024 Jun 14;5(6):101006. doi: 10.1016/j.patter.2024.101006.
3
COLLAGENE enables privacy-aware federated and collaborative genomic data analysis.COLLAGENE 实现了隐私感知的联邦和协作基因组数据分析。
Genome Biol. 2023 Sep 11;24(1):204. doi: 10.1186/s13059-023-03039-z.

本文引用的文献

1
Public reactions to direct-to-consumer genetic health tests: A comparison across the US, UK, Japan and Australia.公众对直接面向消费者的基因健康测试的反应:美、英、日、澳的比较。
Eur J Hum Genet. 2020 Mar;28(3):339-348. doi: 10.1038/s41431-019-0529-8. Epub 2019 Oct 23.
2
The association of mannose binding lectin genotype and immune response to Chlamydia pneumoniae: The Strong Heart Study.甘露糖结合凝集素基因型与肺炎衣原体免疫反应的关联:“强壮心脏研究”。
PLoS One. 2019 Jan 10;14(1):e0210640. doi: 10.1371/journal.pone.0210640. eCollection 2019.
3
PCORnet's Collaborative Research Groups.PCORnet合作研究小组。
Patient Relat Outcome Meas. 2018 Feb 9;9:91-95. doi: 10.2147/PROM.S141630. eCollection 2018.
4
PRINCESS: Privacy-protecting Rare disease International Network Collaboration via Encryption through Software guard extensionS.公主:通过软件保护扩展进行加密的隐私保护罕见病国际网络协作。
Bioinformatics. 2017 Mar 15;33(6):871-878. doi: 10.1093/bioinformatics/btw758.
5
VERTIcal Grid lOgistic regression (VERTIGO).垂直网格逻辑回归(VERTIGO)。
J Am Med Inform Assoc. 2016 May;23(3):570-9. doi: 10.1093/jamia/ocv146. Epub 2015 Nov 9.
6
A global reference for human genetic variation.人类遗传变异的全球参考。
Nature. 2015 Oct 1;526(7571):68-74. doi: 10.1038/nature15393.
7
WebDISCO: a web service for distributed cox model learning without patient-level data sharing.WebDISCO:一种用于分布式Cox模型学习且无需患者级数据共享的网络服务。
J Am Med Inform Assoc. 2015 Nov;22(6):1212-9. doi: 10.1093/jamia/ocv083. Epub 2015 Jul 9.
8
The Cancer Genome Atlas (TCGA): an immeasurable source of knowledge.癌症基因组图谱(TCGA):一个不可估量的知识来源。
Contemp Oncol (Pozn). 2015;19(1A):A68-77. doi: 10.5114/wo.2014.47136.
9
Grid Binary LOgistic REgression (GLORE): building shared models without sharing data.网格二进制逻辑回归(GLORE):在不共享数据的情况下构建共享模型。
J Am Med Inform Assoc. 2012 Sep-Oct;19(5):758-64. doi: 10.1136/amiajnl-2012-000862. Epub 2012 Apr 17.
10
Identification of ADAMTS7 as a novel locus for coronary atherosclerosis and association of ABO with myocardial infarction in the presence of coronary atherosclerosis: two genome-wide association studies.鉴定 ADAMTS7 为冠状动脉粥样硬化的新位点,以及在存在冠状动脉粥样硬化的情况下 ABO 与心肌梗死的关联:两项全基因组关联研究。
Lancet. 2011 Jan 29;377(9763):383-92. doi: 10.1016/S0140-6736(10)61996-4. Epub 2011 Jan 14.