Suppr超能文献

遗传关联研究中的参与者识别:改进方法及实际意义。

Participant identification in genetic association studies: improved methods and practical implications.

机构信息

Department of Health Sciences, University of Leicester, Leicester, UK.

出版信息

Int J Epidemiol. 2011 Dec;40(6):1629-42. doi: 10.1093/ije/dyr149.

Abstract

BACKGROUND

In a recent paper by Homer et al. (Resolving individuals contributing trace amounts of DNA to highly complex mixtures using high-density SNP genotyping microarrays. PLoS Genet 2008;4:e1000167), a method for detecting whether a given individual is a contributor to a particular genomic mixture was proposed. This prompted grave concern about the public dissemination of aggregate statistics from genome-wide association studies. It is of clear scientific importance that such data be shared widely, but the confidentiality of study participants must not be compromised. The issue of what summary genomic data can safely be posted on the web is only addressed satisfactorily when the theoretical underpinnings of the proposed method are clarified and its performance evaluated in terms of dependence on underlying assumptions.

METHODS

The original method raised a number of concerns and several alternatives have since been proposed, including a simple linear regression approach. In our proposed generalized estimating equation approach, we maintain the simplicity of the linear regression model but obtain inferences that are more robust to approximation of the variance/covariance structure and can accommodate linkage disequilibrium.

RESULTS

We affirm that, in principle, it is possible to determine that a 'candidate' individual has participated in a study, given a subset of aggregate statistics from that study. However, the methods depend critically on a number of key factors including: the ancestry of participants in the study; the absolute and relative numbers of cases and controls; and the number of single nucleotide polymorphisms.

CONCLUSIONS

Simple guidelines for publication that are based on a single criterion are therefore unlikely to suffice. In particular, 'directed' summary statistics should not be posted openly on the web but could be protected by an internet-based access check as proposed by the P3G_Consortium et al. (Public access to genome-wide data: five views on balancing research with privacy and protection. PLoS Genet 2009;5:e1000665).

摘要

背景

在 Homer 等人最近的一篇论文中(使用高密度 SNP 基因分型微阵列解决痕量 DNA 对高度复杂混合物的个体贡献问题。PLoS Genet 2008;4:e1000167),提出了一种检测特定个体是否为特定基因组混合物贡献者的方法。这引发了人们对全基因组关联研究汇总统计数据公开传播的严重关注。显然,这些数据需要广泛共享,但研究参与者的机密性不得受到损害。只有当提出的方法的理论基础得到澄清,并根据其对基本假设的依赖性来评估其性能时,才能满意地解决可安全发布到网络上的摘要基因组数据的问题。

方法

原始方法引起了一些关注,此后提出了几种替代方法,包括简单的线性回归方法。在我们提出的广义估计方程方法中,我们保持线性回归模型的简单性,但获得的推断结果更能抵抗方差/协方差结构的近似,并且可以适应连锁不平衡。

结果

我们确认,原则上,给定研究的汇总统计数据的一个子集,就有可能确定一个“候选”个体是否参与了该研究。然而,这些方法严重依赖于一些关键因素,包括:研究参与者的祖源;病例和对照的绝对和相对数量;以及单核苷酸多态性的数量。

结论

因此,基于单一标准的简单发布指南不太可能足够。特别是,“定向”汇总统计数据不应公开发布到网络上,但可以通过基于互联网的访问检查来保护,正如 P3G_Consortium 等人所提出的(Public access to genome-wide data: five views on balancing research with privacy and protection. PLoS Genet 2009;5:e1000665)。

相似文献

2
On inferring presence of an individual in a mixture: a Bayesian approach.基于贝叶斯方法推断混合物中个体的存在。
Biostatistics. 2010 Oct;11(4):661-73. doi: 10.1093/biostatistics/kxq035. Epub 2010 Jun 3.
6
SecureMA: protecting participant privacy in genetic association meta-analysis.SecureMA:在基因关联荟萃分析中保护参与者隐私
Bioinformatics. 2014 Dec 1;30(23):3334-41. doi: 10.1093/bioinformatics/btu561. Epub 2014 Aug 21.
8
QTL mapping using high-throughput sequencing.利用高通量测序进行数量性状基因座定位。
Methods Mol Biol. 2015;1284:257-85. doi: 10.1007/978-1-4939-2444-8_13.

引用本文的文献

2
Privacy and ethical challenges in next-generation sequencing.下一代测序中的隐私和伦理挑战。
Expert Rev Precis Med Drug Dev. 2019;4(2):95-104. doi: 10.1080/23808993.2019.1599685. Epub 2019 Apr 8.
3
Bayesian Network Construction and Genotype-Phenotype Inference Using GWAS Statistics.基于 GWAS 统计数据的贝叶斯网络构建和基因型-表型推断。
IEEE/ACM Trans Comput Biol Bioinform. 2019 Mar-Apr;16(2):475-489. doi: 10.1109/TCBB.2017.2779498. Epub 2017 Dec 4.
8
Privacy in the Genomic Era.基因组时代的隐私问题。
ACM Comput Surv. 2015 Sep;48(1). doi: 10.1145/2767007.

本文引用的文献

1
Complex mixtures: a critical examination of a paper by Homer et al.复杂混合物:对 Homer 等人论文的批判性审视
Forensic Sci Int Genet. 2012 Jan;6(1):64-9. doi: 10.1016/j.fsigen.2011.02.003. Epub 2011 Mar 22.
2
On inferring presence of an individual in a mixture: a Bayesian approach.基于贝叶斯方法推断混合物中个体的存在。
Biostatistics. 2010 Oct;11(4):661-73. doi: 10.1093/biostatistics/kxq035. Epub 2010 Jun 3.
5
Identifying individuals in a complex mixture of DNA with unknown ancestry.在祖先未知的DNA复杂混合物中识别个体。
Stat Appl Genet Mol Biol. 2009;8(1):Article 37. doi: 10.2202/1544-6115.1469. Epub 2009 Sep 9.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验