• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过基因组缺失分析功能基因组信号谱中的敏感信息泄露。

Analysis of sensitive information leakage in functional genomics signal profiles through genomic deletions.

机构信息

Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, 06520, USA.

Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, 06520, USA.

出版信息

Nat Commun. 2018 Jun 22;9(1):2453. doi: 10.1038/s41467-018-04875-5.

DOI:10.1038/s41467-018-04875-5
PMID:29934598
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6015012/
Abstract

Functional genomics experiments, such as RNA-seq, provide non-individual specific information about gene expression under different conditions such as disease and normal. There is great desire to share these data. However, privacy concerns often preclude sharing of the raw reads. To enable safe sharing, aggregated summaries such as read-depth signal profiles and levels of gene expression are used. Projects such as GTEx and ENCODE share these because they ostensibly do not leak much identifying information. Here, we attempt to quantify the validity of this statement, measuring the leakage of genomic deletions from signal profiles. We present information theoretic measures for the degree to which one can genotype these deletions. We then develop practical genotyping approaches and demonstrate how to use these to identify an individual within a large cohort in the context of linking attacks. Finally, we present an anonymization method removing much of the leakage from signal profiles.

摘要

功能基因组学实验,如 RNA-seq,提供了在不同条件下(如疾病和正常)基因表达的非个体特异性信息。人们非常希望分享这些数据。然而,隐私问题常常排除了原始读数的共享。为了实现安全共享,通常使用聚合摘要,如读取深度信号谱和基因表达水平。GTEx 和 ENCODE 等项目共享这些数据,因为它们表面上不会泄露太多的识别信息。在这里,我们试图量化这种说法的有效性,衡量信号谱中基因组缺失的泄露程度。我们提出了信息论度量标准,用于衡量一个人对这些缺失进行基因分型的程度。然后,我们开发了实用的基因分型方法,并演示了如何在链接攻击的背景下使用这些方法来识别大队列中的个体。最后,我们提出了一种匿名化方法,从信号谱中去除了大部分泄漏。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/555f/6015012/bedd9023b97f/41467_2018_4875_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/555f/6015012/d1c0a0b41c80/41467_2018_4875_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/555f/6015012/a68c82aeeb73/41467_2018_4875_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/555f/6015012/5d2525c77ec3/41467_2018_4875_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/555f/6015012/dca85d62c8d8/41467_2018_4875_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/555f/6015012/bedd9023b97f/41467_2018_4875_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/555f/6015012/d1c0a0b41c80/41467_2018_4875_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/555f/6015012/a68c82aeeb73/41467_2018_4875_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/555f/6015012/5d2525c77ec3/41467_2018_4875_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/555f/6015012/dca85d62c8d8/41467_2018_4875_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/555f/6015012/bedd9023b97f/41467_2018_4875_Fig5_HTML.jpg

相似文献

1
Analysis of sensitive information leakage in functional genomics signal profiles through genomic deletions.通过基因组缺失分析功能基因组信号谱中的敏感信息泄露。
Nat Commun. 2018 Jun 22;9(1):2453. doi: 10.1038/s41467-018-04875-5.
2
Data Sanitization to Reduce Private Information Leakage from Functional Genomics.数据清洗以减少功能基因组学中的私人信息泄露。
Cell. 2020 Nov 12;183(4):905-917.e16. doi: 10.1016/j.cell.2020.09.036.
3
Addressing Beacon re-identification attacks: quantification and mitigation of privacy risks.应对信标重新识别攻击:隐私风险的量化与缓解
J Am Med Inform Assoc. 2017 Jul 1;24(4):799-805. doi: 10.1093/jamia/ocw167.
4
RSEQtools: a modular framework to analyze RNA-Seq data using compact, anonymized data summaries.RSEQtools:一个使用简洁、匿名数据摘要分析 RNA-Seq 数据的模块化框架。
Bioinformatics. 2011 Jan 15;27(2):281-3. doi: 10.1093/bioinformatics/btq643. Epub 2010 Dec 5.
5
Functional genomics data: privacy risk assessment and technological mitigation.功能基因组学数据:隐私风险评估与技术缓解措施
Nat Rev Genet. 2022 Apr;23(4):245-258. doi: 10.1038/s41576-021-00428-7. Epub 2021 Nov 10.
6
Privacy preserving data anonymization of spontaneous ADE reporting system dataset.自发不良药物事件报告系统数据集的隐私保护数据匿名化
BMC Med Inform Decis Mak. 2016 Jul 18;16 Suppl 1(Suppl 1):58. doi: 10.1186/s12911-016-0293-4.
7
Expanding Access to Large-Scale Genomic Data While Promoting Privacy: A Game Theoretic Approach.在促进隐私保护的同时扩大对大规模基因组数据的访问:一种博弈论方法。
Am J Hum Genet. 2017 Feb 2;100(2):316-322. doi: 10.1016/j.ajhg.2016.12.002. Epub 2017 Jan 5.
8
Cryptographic Strategy Could Deliver Better Genomic Privacy: Method hides parts of the genome of no specific interest.加密策略可提供更好的基因组隐私保护:该方法可隐藏基因组中无特定研究价值的部分。
Am J Med Genet A. 2017 Dec;173(12):3124-3125. doi: 10.1002/ajmg.a.38546.
9
The evolving privacy and security concerns for genomic data analysis and sharing as observed from the iDASH competition.从 iDASH 竞赛中观察到的基因组数据分析和共享的不断发展的隐私和安全问题。
J Am Med Inform Assoc. 2022 Nov 14;29(12):2182-2190. doi: 10.1093/jamia/ocac165.
10
Utility-Preserving Anonymization in a Real-World Scenario: Evidence from the German Chronic Kidney Disease (GCKD) Study.实用匿名化在真实场景中的应用:来自德国慢性肾脏病(GCKD)研究的证据。
Stud Health Technol Inform. 2023 May 18;302:28-32. doi: 10.3233/SHTI230058.

引用本文的文献

1
Proxy panels enable privacy-aware outsourcing of genotype imputation.代理面板实现了基因型填充的隐私保护外包。
Genome Res. 2025 Feb 14;35(2):326-339. doi: 10.1101/gr.278934.124.
2
Private information leakage from single-cell count matrices.单细胞计数矩阵中的隐私信息泄露。
Cell. 2024 Nov 14;187(23):6537-6549.e10. doi: 10.1016/j.cell.2024.09.012. Epub 2024 Oct 2.
3
FedGMMAT: Federated generalized linear mixed model association tests.FedGMMAT:联邦广义线性混合模型关联测试。

本文引用的文献

1
Comparative Approaches to Genetic Discrimination: Chasing Shadows?比较法视角下的基因歧视问题研究:是在捕风捉影吗?
Trends Genet. 2017 May;33(5):299-302. doi: 10.1016/j.tig.2017.02.002. Epub 2017 Mar 30.
2
Are Data Sharing and Privacy Protection Mutually Exclusive?数据共享和隐私保护是否相互排斥?
Cell. 2016 Nov 17;167(5):1150-1154. doi: 10.1016/j.cell.2016.11.004.
3
A U.S. "Cancer Moonshot" to accelerate cancer research.美国发起“抗癌登月计划”以加速癌症研究。
PLoS Comput Biol. 2024 Jul 24;20(7):e1012142. doi: 10.1371/journal.pcbi.1012142. eCollection 2024 Jul.
4
Assessing Privacy Vulnerabilities in Genetic Data Sets: Scoping Review.评估基因数据集的隐私漏洞:范围综述
JMIR Bioinform Biotechnol. 2024 May 27;5:e54332. doi: 10.2196/54332.
5
Assessing transcriptomic reidentification risks using discriminative sequence models.使用判别序列模型评估转录组再识别风险。
Genome Res. 2023 Jul;33(7):1101-1112. doi: 10.1101/gr.277699.123. Epub 2023 Aug 4.
6
Privacy-aware estimation of relatedness in admixed populations.混合人群中具有隐私意识的亲缘关系估计。
Brief Bioinform. 2022 Nov 19;23(6). doi: 10.1093/bib/bbac473.
7
SVAT: Secure outsourcing of variant annotation and genotype aggregation.SVAT:变体注释和基因型聚合的安全外包。
BMC Bioinformatics. 2022 Oct 1;23(1):409. doi: 10.1186/s12859-022-04959-6.
8
"Big Data" Approaches for Prevention of the Metabolic Syndrome.预防代谢综合征的“大数据”方法
Front Genet. 2022 Apr 27;13:810152. doi: 10.3389/fgene.2022.810152. eCollection 2022.
9
Sociotechnical safeguards for genomic data privacy.基因组数据隐私的社会技术保障措施。
Nat Rev Genet. 2022 Jul;23(7):429-445. doi: 10.1038/s41576-022-00455-y. Epub 2022 Mar 4.
10
Privacy-Preserving Artificial Intelligence Techniques in Biomedicine.生物医学中的隐私保护人工智能技术。
Methods Inf Med. 2022 Jun;61(S 01):e12-e27. doi: 10.1055/s-0041-1740630. Epub 2022 Jan 21.
Science. 2016 Sep 9;353(6304):1105-6. doi: 10.1126/science.aai7862. Epub 2016 Sep 7.
4
Quantification of private information leakage from phenotype-genotype data: linking attacks.从表型-基因型数据中量化隐私信息泄露:链接攻击
Nat Methods. 2016 Mar;13(3):251-6. doi: 10.1038/nmeth.3746. Epub 2016 Feb 1.
5
A global reference for human genetic variation.人类遗传变异的全球参考。
Nature. 2015 Oct 1;526(7571):68-74. doi: 10.1038/nature15393.
6
Human genomics. The Genotype-Tissue Expression (GTEx) pilot analysis: multitissue gene regulation in humans.人类基因组学。基因型-组织表达(GTEx)试点分析:人类多组织基因调控
Science. 2015 May 8;348(6235):648-60. doi: 10.1126/science.1262110. Epub 2015 May 7.
7
Epigenomics: Roadmap for regulation.表观基因组学:调控路线图
Nature. 2015 Feb 19;518(7539):314-6. doi: 10.1038/518314a.
8
A new initiative on precision medicine.一项关于精准医学的新倡议。
N Engl J Med. 2015 Feb 26;372(9):793-5. doi: 10.1056/NEJMp1500523. Epub 2015 Jan 30.
9
A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping.一份碱基对分辨率的人类基因组三维图谱揭示了染色质环化的原理。
Cell. 2014 Dec 18;159(7):1665-80. doi: 10.1016/j.cell.2014.11.021. Epub 2014 Dec 11.
10
MUSIC: identification of enriched regions in ChIP-Seq experiments using a mappability-corrected multiscale signal processing framework.MUSIC:使用映射性校正的多尺度信号处理框架在ChIP-Seq实验中鉴定富集区域。
Genome Biol. 2014;15(10):474. doi: 10.1186/s13059-014-0474-3.