• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Participant identification in genetic association studies: improved methods and practical implications.遗传关联研究中的参与者识别:改进方法及实际意义。
Int J Epidemiol. 2011 Dec;40(6):1629-42. doi: 10.1093/ije/dyr149.
2
On inferring presence of an individual in a mixture: a Bayesian approach.基于贝叶斯方法推断混合物中个体的存在。
Biostatistics. 2010 Oct;11(4):661-73. doi: 10.1093/biostatistics/kxq035. Epub 2010 Jun 3.
3
Resolving individuals contributing trace amounts of DNA to highly complex mixtures using high-density SNP genotyping microarrays.使用高密度单核苷酸多态性(SNP)基因分型微阵列解析对高度复杂混合物贡献微量DNA的个体。
PLoS Genet. 2008 Aug 29;4(8):e1000167. doi: 10.1371/journal.pgen.1000167.
4
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
5
Scalable privacy-preserving data sharing methodology for genome-wide association studies.用于全基因组关联研究的可扩展隐私保护数据共享方法
J Biomed Inform. 2014 Aug;50:133-41. doi: 10.1016/j.jbi.2014.01.008. Epub 2014 Feb 6.
6
SecureMA: protecting participant privacy in genetic association meta-analysis.SecureMA:在基因关联荟萃分析中保护参与者隐私
Bioinformatics. 2014 Dec 1;30(23):3334-41. doi: 10.1093/bioinformatics/btu561. Epub 2014 Aug 21.
7
Genome-wide association study meta-analysis identifies three novel loci for circulating anti-Müllerian hormone levels in women.全基因组关联研究荟萃分析确定了女性循环抗苗勒管激素水平的三个新基因座。
Hum Reprod. 2022 May 3;37(5):1069-1082. doi: 10.1093/humrep/deac028.
8
QTL mapping using high-throughput sequencing.利用高通量测序进行数量性状基因座定位。
Methods Mol Biol. 2015;1284:257-85. doi: 10.1007/978-1-4939-2444-8_13.
9
A polymorphism in the promoter of FRAS1 is a candidate SNP associated with metastatic prostate cancer.FRAS1 启动子中的一个多态性是与转移性前列腺癌相关的候选 SNP。
Prostate. 2021 Jul;81(10):683-693. doi: 10.1002/pros.24148. Epub 2021 May 6.
10
Protecting Genomic Data Privacy with Probabilistic Modeling.用概率模型保护基因组数据隐私
Pac Symp Biocomput. 2019;24:403-414.

引用本文的文献

1
Assessing Privacy Vulnerabilities in Genetic Data Sets: Scoping Review.评估基因数据集的隐私漏洞:范围综述
JMIR Bioinform Biotechnol. 2024 May 27;5:e54332. doi: 10.2196/54332.
2
Privacy and ethical challenges in next-generation sequencing.下一代测序中的隐私和伦理挑战。
Expert Rev Precis Med Drug Dev. 2019;4(2):95-104. doi: 10.1080/23808993.2019.1599685. Epub 2019 Apr 8.
3
Bayesian Network Construction and Genotype-Phenotype Inference Using GWAS Statistics.基于 GWAS 统计数据的贝叶斯网络构建和基因型-表型推断。
IEEE/ACM Trans Comput Biol Bioinform. 2019 Mar-Apr;16(2):475-489. doi: 10.1109/TCBB.2017.2779498. Epub 2017 Dec 4.
4
Better governance, better access: practising responsible data sharing in the METADAC governance infrastructure.更好的治理,更好的获取:在 METADAC 治理基础设施中实践负责任的数据共享。
Hum Genomics. 2018 Apr 26;12(1):24. doi: 10.1186/s40246-018-0154-6.
5
Sharing extended summary data from contemporary genetics studies is unlikely to threaten subject privacy.分享当代遗传学研究的扩展摘要数据不太可能威胁到受试者的隐私。
PLoS One. 2017 Jun 29;12(6):e0179504. doi: 10.1371/journal.pone.0179504. eCollection 2017.
6
Gene and Network Analysis of Common Variants Reveals Novel Associations in Multiple Complex Diseases.常见变异的基因与网络分析揭示了多种复杂疾病中的新关联。
Genetics. 2016 Oct;204(2):783-798. doi: 10.1534/genetics.116.188391. Epub 2016 Aug 3.
7
Addressing Benefits, Risks and Consent in Next Generation Sequencing Studies.下一代测序研究中的益处、风险与知情同意
J Clin Res Bioeth. 2015 Dec;6(6). doi: 10.4172/2155-9627.1000249. Epub 2015 Dec 14.
8
Privacy in the Genomic Era.基因组时代的隐私问题。
ACM Comput Surv. 2015 Sep;48(1). doi: 10.1145/2767007.
9
Privacy-Preserving Data Sharing for Genome-Wide Association Studies.用于全基因组关联研究的隐私保护数据共享
J Priv Confid. 2013;5(1):137-166.
10
Informants a potential threat to confidentiality in small studies.在小型研究中,信息提供者对保密性构成潜在威胁。
Med Health Care Philos. 2015 Feb;18(1):149-52. doi: 10.1007/s11019-014-9579-4.

本文引用的文献

1
Complex mixtures: a critical examination of a paper by Homer et al.复杂混合物:对 Homer 等人论文的批判性审视
Forensic Sci Int Genet. 2012 Jan;6(1):64-9. doi: 10.1016/j.fsigen.2011.02.003. Epub 2011 Mar 22.
2
On inferring presence of an individual in a mixture: a Bayesian approach.基于贝叶斯方法推断混合物中个体的存在。
Biostatistics. 2010 Oct;11(4):661-73. doi: 10.1093/biostatistics/kxq035. Epub 2010 Jun 3.
3
Potential for revealing individual-level information in genome-wide association studies.全基因组关联研究中揭示个体水平信息的潜力。
JAMA. 2010 Feb 17;303(7):659-60. doi: 10.1001/jama.2010.120.
4
A new statistic and its power to infer membership in a genome-wide association study using genotype frequencies.一种新的统计量及其在全基因组关联研究中利用基因型频率推断成员关系的能力。
Nat Genet. 2009 Nov;41(11):1253-7. doi: 10.1038/ng.455. Epub 2009 Oct 4.
5
Identifying individuals in a complex mixture of DNA with unknown ancestry.在祖先未知的DNA复杂混合物中识别个体。
Stat Appl Genet Mol Biol. 2009;8(1):Article 37. doi: 10.2202/1544-6115.1469. Epub 2009 Sep 9.
6
Needles in the haystack: identifying individuals present in pooled genomic data.大海捞针:识别混合基因组数据中的个体
PLoS Genet. 2009 Oct;5(10):e1000668. doi: 10.1371/journal.pgen.1000668. Epub 2009 Oct 2.
7
Public access to genome-wide data: five views on balancing research with privacy and protection.公众对全基因组数据的访问:关于平衡研究与隐私及保护的五种观点。
PLoS Genet. 2009 Oct;5(10):e1000665. doi: 10.1371/journal.pgen.1000665. Epub 2009 Oct 2.
8
The limits of individual identification from sample allele frequencies: theory and statistical analysis.根据样本等位基因频率进行个体识别的局限性:理论与统计分析
PLoS Genet. 2009 Oct;5(10):e1000628. doi: 10.1371/journal.pgen.1000628. Epub 2009 Oct 2.
9
Genomic privacy and limits of individual detection in a pool.基因组隐私与混合样本中个体检测的局限性
Nat Genet. 2009 Sep;41(9):965-7. doi: 10.1038/ng.436. Epub 2009 Aug 23.
10
Data sharing in genomics--re-shaping scientific practice.基因组学中的数据共享——重塑科学实践。
Nat Rev Genet. 2009 May;10(5):331-5. doi: 10.1038/nrg2573.

遗传关联研究中的参与者识别:改进方法及实际意义。

Participant identification in genetic association studies: improved methods and practical implications.

机构信息

Department of Health Sciences, University of Leicester, Leicester, UK.

出版信息

Int J Epidemiol. 2011 Dec;40(6):1629-42. doi: 10.1093/ije/dyr149.

DOI:10.1093/ije/dyr149
PMID:22158671
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3235023/
Abstract

BACKGROUND

In a recent paper by Homer et al. (Resolving individuals contributing trace amounts of DNA to highly complex mixtures using high-density SNP genotyping microarrays. PLoS Genet 2008;4:e1000167), a method for detecting whether a given individual is a contributor to a particular genomic mixture was proposed. This prompted grave concern about the public dissemination of aggregate statistics from genome-wide association studies. It is of clear scientific importance that such data be shared widely, but the confidentiality of study participants must not be compromised. The issue of what summary genomic data can safely be posted on the web is only addressed satisfactorily when the theoretical underpinnings of the proposed method are clarified and its performance evaluated in terms of dependence on underlying assumptions.

METHODS

The original method raised a number of concerns and several alternatives have since been proposed, including a simple linear regression approach. In our proposed generalized estimating equation approach, we maintain the simplicity of the linear regression model but obtain inferences that are more robust to approximation of the variance/covariance structure and can accommodate linkage disequilibrium.

RESULTS

We affirm that, in principle, it is possible to determine that a 'candidate' individual has participated in a study, given a subset of aggregate statistics from that study. However, the methods depend critically on a number of key factors including: the ancestry of participants in the study; the absolute and relative numbers of cases and controls; and the number of single nucleotide polymorphisms.

CONCLUSIONS

Simple guidelines for publication that are based on a single criterion are therefore unlikely to suffice. In particular, 'directed' summary statistics should not be posted openly on the web but could be protected by an internet-based access check as proposed by the P3G_Consortium et al. (Public access to genome-wide data: five views on balancing research with privacy and protection. PLoS Genet 2009;5:e1000665).

摘要

背景

在 Homer 等人最近的一篇论文中(使用高密度 SNP 基因分型微阵列解决痕量 DNA 对高度复杂混合物的个体贡献问题。PLoS Genet 2008;4:e1000167),提出了一种检测特定个体是否为特定基因组混合物贡献者的方法。这引发了人们对全基因组关联研究汇总统计数据公开传播的严重关注。显然,这些数据需要广泛共享,但研究参与者的机密性不得受到损害。只有当提出的方法的理论基础得到澄清,并根据其对基本假设的依赖性来评估其性能时,才能满意地解决可安全发布到网络上的摘要基因组数据的问题。

方法

原始方法引起了一些关注,此后提出了几种替代方法,包括简单的线性回归方法。在我们提出的广义估计方程方法中,我们保持线性回归模型的简单性,但获得的推断结果更能抵抗方差/协方差结构的近似,并且可以适应连锁不平衡。

结果

我们确认,原则上,给定研究的汇总统计数据的一个子集,就有可能确定一个“候选”个体是否参与了该研究。然而,这些方法严重依赖于一些关键因素,包括:研究参与者的祖源;病例和对照的绝对和相对数量;以及单核苷酸多态性的数量。

结论

因此,基于单一标准的简单发布指南不太可能足够。特别是,“定向”汇总统计数据不应公开发布到网络上,但可以通过基于互联网的访问检查来保护,正如 P3G_Consortium 等人所提出的(Public access to genome-wide data: five views on balancing research with privacy and protection. PLoS Genet 2009;5:e1000665)。