• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

个体基因组数据的基因型和表型的推理攻击与控制。

Inference Attacks and Controls on Genotypes and Phenotypes for Individual Genomic Data.

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2020 May-Jun;17(3):930-937. doi: 10.1109/TCBB.2018.2810180. Epub 2018 Feb 27.

DOI:10.1109/TCBB.2018.2810180
PMID:29994587
Abstract

The rapid growth of DNA-sequencing technologies motivates more personalized and predictive genetic-oriented services, which further attract individuals to increasingly release their genome information to learn about personalized medicines, disease predispositions, genetic compatibilities, etc. Individual genome information is notoriously privacy-sensitive and highly associated with relatives. In this paper, we present an inference attack algorithm to predict target genotypes and phenotypes based on belief propagation in factor graphs. With this algorithm, an attacker can effectively predict the target genotypes and phenotypes of target individuals based on genome information shared by individuals or their relatives, and genotype and phenotype association from genome-wide association study (GWAS). To address the privacy threats resulted from such inference attacks, we elaborate the metrics to evaluate data utility and privacy and then present a data sanitization method. We evaluate our inference attack algorithm and data sanitization method on real GWAS dataset: Age-related macular degeneration (AMD) case/control dataset. The evaluation results show that our work can effectively defense against genome threats while guaranteeing data utility.

摘要

DNA 测序技术的快速发展推动了更个性化和更具预测性的遗传导向服务,这进一步吸引个人越来越多地发布他们的基因组信息,以了解个性化药物、疾病易感性、基因相容性等。个体基因组信息是众所周知的隐私敏感信息,并且与亲属高度相关。在本文中,我们提出了一种基于因子图中的置信传播进行预测目标基因型和表型的推理攻击算法。利用该算法,攻击者可以有效地根据个体或其亲属共享的基因组信息以及全基因组关联研究(GWAS)中的基因型和表型关联,预测目标个体的目标基因型和表型。为了解决这种推理攻击带来的隐私威胁,我们详细阐述了评估数据效用和隐私的指标,然后提出了一种数据净化方法。我们在真实的 GWAS 数据集上评估了我们的推理攻击算法和数据净化方法:年龄相关性黄斑变性(AMD)病例/对照数据集。评估结果表明,我们的工作可以在保证数据效用的同时,有效地防御基因组威胁。

相似文献

1
Inference Attacks and Controls on Genotypes and Phenotypes for Individual Genomic Data.个体基因组数据的基因型和表型的推理攻击与控制。
IEEE/ACM Trans Comput Biol Bioinform. 2020 May-Jun;17(3):930-937. doi: 10.1109/TCBB.2018.2810180. Epub 2018 Feb 27.
2
Genotype distribution-based inference of collective effects in genome-wide association studies: insights to age-related macular degeneration disease mechanism.基于基因型分布推断全基因组关联研究中的集体效应:对年龄相关性黄斑变性疾病机制的见解
BMC Genomics. 2016 Aug 30;17(1):695. doi: 10.1186/s12864-016-2871-3.
3
Deterministic identification of specific individuals from GWAS results.从全基因组关联研究结果中确定性地识别特定个体。
Bioinformatics. 2015 Jun 1;31(11):1701-7. doi: 10.1093/bioinformatics/btv018. Epub 2015 Jan 27.
4
Using encrypted genotypes and phenotypes for collaborative genomic analyses to maintain data confidentiality.利用加密基因型和表型进行合作基因组分析,以维护数据机密性。
Genetics. 2024 Mar 6;226(3). doi: 10.1093/genetics/iyad210.
5
An Inference Attack on Genomic Data Using Kinship, Complex Correlations, and Phenotype Information.利用亲缘关系、复杂相关性和表型信息对基因组数据进行推理攻击。
IEEE/ACM Trans Comput Biol Bioinform. 2018 Jul-Aug;15(4):1333-1343. doi: 10.1109/TCBB.2017.2709740.
6
Inference attacks against differentially private query results from genomic datasets including dependent tuples.针对包含依赖元组的基因组数据集的差分隐私查询结果的推理攻击。
Bioinformatics. 2020 Jul 1;36(Suppl_1):i136-i145. doi: 10.1093/bioinformatics/btaa475.
7
Genomic prediction using preselected DNA variants from a GWAS with whole-genome sequence data in Holstein-Friesian cattle.在荷斯坦-弗里生奶牛中,利用全基因组序列数据,从全基因组关联研究(GWAS)中预先选择的DNA变异进行基因组预测。
Genet Sel Evol. 2016 Dec 1;48(1):95. doi: 10.1186/s12711-016-0274-1.
8
Impact of imputation methods on the amount of genetic variation captured by a single-nucleotide polymorphism panel in soybeans.插补方法对大豆单核苷酸多态性面板捕获的遗传变异量的影响。
BMC Bioinformatics. 2016 Feb 2;17:55. doi: 10.1186/s12859-016-0899-7.
9
Genomic prediction based on selected variants from imputed whole-genome sequence data in Australian sheep populations.基于澳大利亚绵羊群体中推断的全基因组序列数据中选择的变体的基因组预测。
Genet Sel Evol. 2019 Dec 5;51(1):72. doi: 10.1186/s12711-019-0514-2.
10
Secure count query on encrypted genomic data.加密基因组数据上的安全计数查询。
J Biomed Inform. 2018 May;81:41-52. doi: 10.1016/j.jbi.2018.03.003. Epub 2018 Mar 15.

引用本文的文献

1
Federated privacy-protected meta- and mega-omics data analysis in multi-center studies with a fully open-source analytic platform.在多中心研究中,使用完全开源的分析平台进行联合隐私保护的元组学和宏组学数据分析。
PLoS Comput Biol. 2024 Dec 9;20(12):e1012626. doi: 10.1371/journal.pcbi.1012626. eCollection 2024 Dec.
2
Assessing Privacy Vulnerabilities in Genetic Data Sets: Scoping Review.评估基因数据集的隐私漏洞:范围综述
JMIR Bioinform Biotechnol. 2024 May 27;5:e54332. doi: 10.2196/54332.
3
Assessing transcriptomic reidentification risks using discriminative sequence models.
使用判别序列模型评估转录组再识别风险。
Genome Res. 2023 Jul;33(7):1101-1112. doi: 10.1101/gr.277699.123. Epub 2023 Aug 4.