• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在联邦环境中保护神经影像学数据集的隐私质量控制。

Privacy-preserving quality control of neuroimaging datasets in federated environments.

机构信息

Georgia Institute of Technology, Atlanta, Georgia, USA.

Tri-Institutional Center for Translational Research in Neuroimaging and Data Science (TReNDS), Georgia State University, Georgia Institute of Technology, and Emory University, Atlanta, Georgia, USA.

出版信息

Hum Brain Mapp. 2022 May;43(7):2289-2310. doi: 10.1002/hbm.25788. Epub 2022 Mar 4.

DOI:10.1002/hbm.25788
PMID:35243723
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8996357/
Abstract

Privacy concerns for rare disease data, institutional or IRB policies, access to local computational or storage resources or download capabilities are among the reasons that may preclude analyses that pool data to a single site. A growing number of multisite projects and consortia were formed to function in the federated environment to conduct productive research under constraints of this kind. In this scenario, a quality control tool that visualizes decentralized data in its entirety via global aggregation of local computations is especially important, as it would allow the screening of samples that cannot be jointly evaluated otherwise. To solve this issue, we present two algorithms: decentralized data stochastic neighbor embedding, dSNE, and its differentially private counterpart, DP-dSNE. We leverage publicly available datasets to simultaneously map data samples located at different sites according to their similarities. Even though the data never leaves the individual sites, dSNE does not provide any formal privacy guarantees. To overcome that, we rely on differential privacy: a formal mathematical guarantee that protects individuals from being identified as contributors to a dataset. We implement DP-dSNE with AdaCliP, a method recently proposed to add less noise to the gradients per iteration. We introduce metrics for measuring the embedding quality and validate our algorithms on these metrics against their centralized counterpart on two toy datasets. Our validation on six multisite neuroimaging datasets shows promising results for the quality control tasks of visualization and outlier detection, highlighting the potential of our private, decentralized visualization approach.

摘要

隐私问题、机构或 IRB 政策、对本地计算或存储资源或下载功能的访问等原因,可能会阻止将数据汇集到单个站点进行分析。为了在这种约束条件下开展富有成效的研究,越来越多的多站点项目和联盟形成了联邦环境下的协作关系。在这种情况下,通过全局聚合本地计算来可视化分散数据的质量控制工具尤为重要,因为它可以筛选否则无法进行联合评估的样本。为了解决这个问题,我们提出了两种算法:去中心化数据随机近邻嵌入(dSNE)及其差分隐私版本 DP-dSNE。我们利用公开数据集,根据数据样本的相似性,同时对位于不同站点的数据样本进行映射。尽管数据从未离开过各个站点,但 dSNE 并未提供任何正式的隐私保证。为了克服这一问题,我们依赖差分隐私:这是一种正式的数学保证,可以保护个人不被识别为数据集的贡献者。我们使用 AdaCliP 实现 DP-dSNE,这是一种最近提出的方法,可以在每次迭代中减少梯度的噪声。我们引入了度量标准来衡量嵌入质量,并在两个玩具数据集上针对其集中式对应物对我们的算法进行了验证。我们在六个多站点神经影像学数据集上的验证结果表明,可视化和异常值检测的质量控制任务具有良好的效果,突出了我们私有、去中心化可视化方法的潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48f9/8996357/ddc120dcdb67/HBM-43-2289-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48f9/8996357/9df07a12b5e0/HBM-43-2289-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48f9/8996357/2d89e3745f18/HBM-43-2289-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48f9/8996357/446c45a8f48f/HBM-43-2289-g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48f9/8996357/cff819204f25/HBM-43-2289-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48f9/8996357/0a70b5d99c47/HBM-43-2289-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48f9/8996357/a8c82bccce1a/HBM-43-2289-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48f9/8996357/d61eb5c7fcbc/HBM-43-2289-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48f9/8996357/c19b99f2d2ab/HBM-43-2289-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48f9/8996357/a9af47a26f5a/HBM-43-2289-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48f9/8996357/ddc120dcdb67/HBM-43-2289-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48f9/8996357/9df07a12b5e0/HBM-43-2289-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48f9/8996357/2d89e3745f18/HBM-43-2289-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48f9/8996357/446c45a8f48f/HBM-43-2289-g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48f9/8996357/cff819204f25/HBM-43-2289-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48f9/8996357/0a70b5d99c47/HBM-43-2289-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48f9/8996357/a8c82bccce1a/HBM-43-2289-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48f9/8996357/d61eb5c7fcbc/HBM-43-2289-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48f9/8996357/c19b99f2d2ab/HBM-43-2289-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48f9/8996357/a9af47a26f5a/HBM-43-2289-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48f9/8996357/ddc120dcdb67/HBM-43-2289-g013.jpg

相似文献

1
Privacy-preserving quality control of neuroimaging datasets in federated environments.在联邦环境中保护神经影像学数据集的隐私质量控制。
Hum Brain Mapp. 2022 May;43(7):2289-2310. doi: 10.1002/hbm.25788. Epub 2022 Mar 4.
2
Multi-site fMRI analysis using privacy-preserving federated learning and domain adaptation: ABIDE results.使用隐私保护联邦学习和域适应的多站点功能磁共振成像分析:ABIDE研究结果
Med Image Anal. 2020 Oct;65:101765. doi: 10.1016/j.media.2020.101765. Epub 2020 Jul 2.
3
A Correlated Noise-assisted Decentralized Differentially Private Estimation Protocol, and its application to fMRI Source Separation.一种相关噪声辅助的分布式差分隐私估计协议及其在功能磁共振成像源分离中的应用。
IEEE Trans Signal Process. 2021;69:6355-6370. doi: 10.1109/tsp.2021.3126546. Epub 2021 Nov 11.
4
A federated graph neural network framework for privacy-preserving personalization.联邦图神经网络框架用于保护隐私的个性化服务。
Nat Commun. 2022 Jun 2;13(1):3091. doi: 10.1038/s41467-022-30714-9.
5
Differentially private knowledge transfer for federated learning.联邦学习中的差异化隐私知识转移。
Nat Commun. 2023 Jun 24;14(1):3785. doi: 10.1038/s41467-023-38794-x.
6
FedSPL: federated self-paced learning for privacy-preserving disease diagnosis.FedSPL:用于保护隐私的疾病诊断的联邦自步学习。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab498.
7
Decentralized federated learning through proxy model sharing.通过代理模型共享的去中心化联邦学习。
Nat Commun. 2023 May 22;14(1):2899. doi: 10.1038/s41467-023-38569-4.
8
PPML-Omics: A privacy-preserving federated machine learning method protects patients' privacy in omic data.PPML-Omics:一种保护隐私的联邦机器学习方法,保护了组学数据中患者的隐私。
Sci Adv. 2024 Feb 2;10(5):eadh8601. doi: 10.1126/sciadv.adh8601. Epub 2024 Jan 31.
9
Blockchain and homomorphic encryption based privacy-preserving model aggregation for medical images.基于区块链和同态加密的医疗图像隐私保护模型聚合。
Comput Med Imaging Graph. 2022 Dec;102:102139. doi: 10.1016/j.compmedimag.2022.102139. Epub 2022 Nov 3.
10
Generalized genomic data sharing for differentially private federated learning.用于差分隐私联邦学习的广义基因组数据共享
J Biomed Inform. 2022 Aug;132:104113. doi: 10.1016/j.jbi.2022.104113. Epub 2022 Jun 9.

引用本文的文献

1
From Serendipity to Precision: Integrating AI, Multi-Omics, and Human-Specific Models for Personalized Neuropsychiatric Care.从意外发现到精准医疗:整合人工智能、多组学和人类特异性模型以实现个性化神经精神疾病护理。
Biomedicines. 2025 Jan 12;13(1):167. doi: 10.3390/biomedicines13010167.
2
dcSBM: A federated constrained source-based morphometry approach for multivariate brain structure mapping.dcSBM:一种基于联邦约束源的多变量脑结构映射方法。
Hum Brain Mapp. 2023 Dec 1;44(17):5892-5905. doi: 10.1002/hbm.26483. Epub 2023 Oct 14.

本文引用的文献

1
Visualizing structure and transitions in high-dimensional biological data.高维生物数据中的结构和转变可视化。
Nat Biotechnol. 2019 Dec;37(12):1482-1492. doi: 10.1038/s41587-019-0336-3. Epub 2019 Dec 3.
2
Multi-Institutional Deep Learning Modeling Without Sharing Patient Data: A Feasibility Study on Brain Tumor Segmentation.无需共享患者数据的多机构深度学习建模:脑肿瘤分割的可行性研究
Brainlesion. 2019;11383:92-104. doi: 10.1007/978-3-030-11723-8_9. Epub 2019 Jan 26.
3
Altered static and dynamic functional network connectivity in Alzheimer's disease and subcortical ischemic vascular disease: shared and specific brain connectivity abnormalities.
阿尔茨海默病和皮质下缺血性血管病患者的静息态和动态功能网络连接改变:共享和特定的脑连接异常。
Hum Brain Mapp. 2019 Aug 1;40(11):3203-3221. doi: 10.1002/hbm.24591. Epub 2019 Apr 5.
4
Precision medicine and global mental health.精准医学与全球精神卫生
Lancet Glob Health. 2019 Jan;7(1):e32. doi: 10.1016/S2214-109X(18)30406-6.
5
Visual analysis of mass cytometry data by hierarchical stochastic neighbour embedding reveals rare cell types.分层随机邻居嵌入的质谱细胞术数据分析的可视化揭示了稀有细胞类型。
Nat Commun. 2017 Nov 23;8(1):1740. doi: 10.1038/s41467-017-01689-9.
6
COINSTAC: A Privacy Enabled Model and Prototype for Leveraging and Processing Decentralized Brain Imaging Data.COINSTAC:一种用于利用和处理去中心化脑成像数据的隐私保护模型及原型。
Front Neurosci. 2016 Aug 19;10:365. doi: 10.3389/fnins.2016.00365. eCollection 2016.
7
A Tool for Interactive Data Visualization: Application to Over 10,000 Brain Imaging and Phantom MRI Data Sets.一种交互式数据可视化工具:应用于一万多个脑成像和体模磁共振成像数据集。
Front Neuroinform. 2016 Mar 15;10:9. doi: 10.3389/fninf.2016.00009. eCollection 2016.
8
ViPAR: a software platform for the Virtual Pooling and Analysis of Research Data.ViPAR:一个用于研究数据虚拟合并与分析的软件平台。
Int J Epidemiol. 2016 Apr;45(2):408-416. doi: 10.1093/ije/dyv193. Epub 2015 Oct 8.
9
The Pediatric Imaging, Neurocognition, and Genetics (PING) Data Repository.儿科影像、神经认知与遗传学(PING)数据存储库
Neuroimage. 2016 Jan 1;124(Pt B):1149-1154. doi: 10.1016/j.neuroimage.2015.04.057. Epub 2015 May 1.
10
DataSHIELD: taking the analysis to the data, not the data to the analysis.数据护盾:将分析带到数据那里,而不是把数据带到分析这边。
Int J Epidemiol. 2014 Dec;43(6):1929-44. doi: 10.1093/ije/dyu188. Epub 2014 Sep 26.