PPML-Omics：一种保护隐私的联邦机器学习方法，保护了组学数据中患者的隐私。

PPML-Omics: A privacy-preserving federated machine learning method protects patients' privacy in omic data.

机构信息

Computer Science Program, Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Kingdom of Saudi Arabia.

Computational Bioscience Research Center, Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Kingdom of Saudi Arabia.

出版信息

Sci Adv. 2024 Feb 2;10(5):eadh8601. doi: 10.1126/sciadv.adh8601. Epub 2024 Jan 31.

DOI:10.1126/sciadv.adh8601

PMID:38295178

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10830108/

Abstract

Modern machine learning models toward various tasks with omic data analysis give rise to threats of privacy leakage of patients involved in those datasets. Here, we proposed a secure and privacy-preserving machine learning method (PPML-Omics) by designing a decentralized differential private federated learning algorithm. We applied PPML-Omics to analyze data from three sequencing technologies and addressed the privacy concern in three major tasks of omic data under three representative deep learning models. We examined privacy breaches in depth through privacy attack experiments and demonstrated that PPML-Omics could protect patients' privacy. In each of these applications, PPML-Omics was able to outperform methods of comparison under the same level of privacy guarantee, demonstrating the versatility of the method in simultaneously balancing the privacy-preserving capability and utility in omic data analysis. Furthermore, we gave the theoretical proof of the privacy-preserving capability of PPML-Omics, suggesting the first mathematically guaranteed method with robust and generalizable empirical performance in protecting patients' privacy in omic data.

摘要

基于组学数据分析的现代机器学习模型引发了涉及这些数据集的患者隐私泄露的威胁。在这里，我们通过设计去中心化差分隐私联邦学习算法，提出了一种安全且保护隐私的机器学习方法（PPML-Omics）。我们将 PPML-Omics 应用于分析来自三种测序技术的数据，并在三个代表性的深度学习模型下的三个主要组学数据分析任务中解决隐私问题。我们通过隐私攻击实验深入研究了隐私泄露问题，并证明了 PPML-Omics 可以保护患者的隐私。在这些应用中，PPML-Omics 在相同的隐私保护级别下都优于比较方法，这表明该方法在同时平衡隐私保护能力和组学数据分析实用性方面具有多功能性。此外，我们给出了 PPML-Omics 的隐私保护能力的理论证明，这是第一个在保护组学数据中患者隐私方面具有稳健且可推广的经验性能的数学保证方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/535d/10830108/f426211a7fde/sciadv.adh8601-f1.jpg

相似文献

PPML-Omics: A privacy-preserving federated machine learning method protects patients' privacy in omic data.

Sci Adv. 2024 Feb 2;10(5):eadh8601. doi: 10.1126/sciadv.adh8601. Epub 2024 Jan 31.

Personalized and privacy-preserving federated heterogeneous medical image analysis with PPPML-HMI.

Comput Biol Med. 2024 Feb;169:107861. doi: 10.1016/j.compbiomed.2023.107861. Epub 2023 Dec 19.

Task-Specific Adaptive Differential Privacy Method for Structured Data.

Sensors (Basel). 2023 Feb 10;23(4):1980. doi: 10.3390/s23041980.

Privacy-preserving federated learning for scalable and high data quality computational-intelligence-as-a-service in Society 5.0.

Multimed Tools Appl. 2022;81(18):25029-25050. doi: 10.1007/s11042-022-12900-5. Epub 2022 Mar 22.

The privacy-explainability trade-off: unraveling the impacts of differential privacy and federated learning on attribution methods.

Front Artif Intell. 2024 Jul 3;7:1236947. doi: 10.3389/frai.2024.1236947. eCollection 2024.

Design of an improved model using federated learning and LSTM autoencoders for secure and transparent blockchain network transactions.

Sci Rep. 2025 Jan 10;15(1):1615. doi: 10.1038/s41598-024-83564-4.

Advancing Privacy-Preserving Health Care Analytics and Implementation of the Personal Health Train: Federated Deep Learning Study.

JMIR AI. 2025 Feb 6;4:e60847. doi: 10.2196/60847.

Decentralised, collaborative, and privacy-preserving machine learning for multi-hospital data.

EBioMedicine. 2024 Mar;101:105006. doi: 10.1016/j.ebiom.2024.105006. Epub 2024 Feb 19.

Federated transfer learning with differential privacy for multi-omics survival analysis.

Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf166.

Multiparty Secure Broad Learning System for Privacy Preserving.

IEEE Trans Cybern. 2023 Oct;53(10):6636-6648. doi: 10.1109/TCYB.2023.3235496. Epub 2023 Sep 15.

引用本文的文献

Medical laboratory data-based models: opportunities, obstacles, and solutions.

J Transl Med. 2025 Jul 24;23(1):823. doi: 10.1186/s12967-025-06802-x.

Federated Deep Learning Enables Cancer Subtyping by Proteomics.

Cancer Discov. 2025 Sep 4;15(9):1803-1818. doi: 10.1158/2159-8290.CD-24-1488.

AI-powered precision medicine: utilizing genetic risk factor optimization to revolutionize healthcare.

NAR Genom Bioinform. 2025 May 5;7(2):lqaf038. doi: 10.1093/nargab/lqaf038. eCollection 2025 Jun.

Privacy-preserving approach for IoT networks using statistical learning with optimization algorithm on high-dimensional big data environment.

Sci Rep. 2025 Jan 27;15(1):3338. doi: 10.1038/s41598-025-87454-1.

Radiogenomics: bridging the gap between imaging and genomics for precision oncology.

MedComm (2020). 2024 Sep 9;5(9):e722. doi: 10.1002/mco2.722. eCollection 2024 Sep.

Pre-trained multimodal large language model enhances dermatological diagnosis using SkinGPT-4.

Nat Commun. 2024 Jul 5;15(1):5649. doi: 10.1038/s41467-024-50043-3.

scFed: federated learning for cell type classification with scRNA-seq.

Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad507.

A unified method to revoke the private data of patients in intelligent healthcare with audit to forget.

Nat Commun. 2023 Oct 6;14(1):6255. doi: 10.1038/s41467-023-41703-x.

本文引用的文献

Citizen-centered, auditable and privacy-preserving population genomics.

Nat Comput Sci. 2021 Mar;1(3):192-198. doi: 10.1038/s43588-021-00044-9. Epub 2021 Mar 25.

Spatial transcriptomics prediction from histology jointly through Transformer and graph neural networks.

Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac297.

Differential Private Deep Learning Models for Analyzing Breast Cancer Omics Data.

Front Oncol. 2022 Jun 23;12:879607. doi: 10.3389/fonc.2022.879607. eCollection 2022.

Genomic Data Sharing under Dependent Local Differential Privacy.

CODASPY. 2022 Apr;2022:77-88. doi: 10.1145/3508398.3511519. Epub 2022 Apr 15.

Federated learning and differential privacy for medical image analysis.

Sci Rep. 2022 Feb 4;12(1):1953. doi: 10.1038/s41598-022-05539-7.

Flimma: a federated and privacy-aware tool for differential gene expression analysis.

Genome Biol. 2021 Dec 14;22(1):338. doi: 10.1186/s13059-021-02553-2.

Functional genomics data: privacy risk assessment and technological mitigation.

Nat Rev Genet. 2022 Apr;23(4):245-258. doi: 10.1038/s41576-021-00428-7. Epub 2021 Nov 10.

Privacy-preserving genotype imputation with fully homomorphic encryption.

Cell Syst. 2022 Feb 16;13(2):173-182.e3. doi: 10.1016/j.cels.2021.10.003. Epub 2021 Nov 9.

Truly privacy-preserving federated analytics for precision medicine with multiparty homomorphic encryption.

Nat Commun. 2021 Oct 11;12(1):5910. doi: 10.1038/s41467-021-25972-y.

Medical imaging deep learning with differential privacy.

Sci Rep. 2021 Jun 29;11(1):13524. doi: 10.1038/s41598-021-93030-0.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

PPML-Omics：一种保护隐私的联邦机器学习方法，保护了组学数据中患者的隐私。

PPML-Omics: A privacy-preserving federated machine learning method protects patients' privacy in omic data.

机构信息

出版信息

Sci Adv. 2024 Feb 2;10(5):eadh8601. doi: 10.1126/sciadv.adh8601. Epub 2024 Jan 31.

DOI:10.1126/sciadv.adh8601

PMID:38295178

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10830108/

Abstract

摘要

PPML-Omics：一种保护隐私的联邦机器学习方法，保护了组学数据中患者的隐私。

PPML-Omics: A privacy-preserving federated machine learning method protects patients' privacy in omic data.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

PPML-Omics：一种保护隐私的联邦机器学习方法，保护了组学数据中患者的隐私。

PPML-Omics: A privacy-preserving federated machine learning method protects patients' privacy in omic data.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献