Suppr超能文献

全基因组关联研究中的稳健关系推断。

Robust relationship inference in genome-wide association studies.

机构信息

Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA.

出版信息

Bioinformatics. 2010 Nov 15;26(22):2867-73. doi: 10.1093/bioinformatics/btq559. Epub 2010 Oct 5.

Abstract

MOTIVATION

Genome-wide association studies (GWASs) have been widely used to map loci contributing to variation in complex traits and risk of diseases in humans. Accurate specification of familial relationships is crucial for family-based GWAS, as well as in population-based GWAS with unknown (or unrecognized) family structure. The family structure in a GWAS should be routinely investigated using the SNP data prior to the analysis of population structure or phenotype. Existing algorithms for relationship inference have a major weakness of estimating allele frequencies at each SNP from the entire sample, under a strong assumption of homogeneous population structure. This assumption is often untenable.

RESULTS

Here, we present a rapid algorithm for relationship inference using high-throughput genotype data typical of GWAS that allows the presence of unknown population substructure. The relationship of any pair of individuals can be precisely inferred by robust estimation of their kinship coefficient, independent of sample composition or population structure (sample invariance). We present simulation experiments to demonstrate that the algorithm has sufficient power to provide reliable inference on millions of unrelated pairs and thousands of relative pairs (up to 3rd-degree relationships). Application of our robust algorithm to HapMap and GWAS datasets demonstrates that it performs properly even under extreme population stratification, while algorithms assuming a homogeneous population give systematically biased results. Our extremely efficient implementation performs relationship inference on millions of pairs of individuals in a matter of minutes, dozens of times faster than the most efficient existing algorithm known to us.

AVAILABILITY

Our robust relationship inference algorithm is implemented in a freely available software package, KING, available for download at http://people.virginia.edu/∼wc9c/KING.

摘要

动机

全基因组关联研究(GWAS)已被广泛用于绘制与人类复杂性状和疾病风险相关的基因座。在基于家族的 GWAS 中,以及在基于人群的 GWAS 中(具有未知(或未被识别)的家族结构),准确指定家族关系至关重要。在分析群体结构或表型之前,应使用 SNP 数据常规调查 GWAS 中的家族结构。用于推断关系的现有算法存在一个主要弱点,即根据群体结构均匀的强烈假设,从整个样本中估算每个 SNP 的等位基因频率。这种假设通常是站不住脚的。

结果

在这里,我们提出了一种使用典型 GWAS 的高通量基因型数据进行关系推断的快速算法,该算法允许存在未知的群体亚结构。通过稳健估计其亲缘系数,可以精确推断任何两个人之间的关系,而与样本组成或群体结构无关(样本不变)。我们进行了模拟实验,以证明该算法具有足够的能力为数百万个无关对和数千个相对对(最多 3 级关系)提供可靠的推断。我们稳健算法在 HapMap 和 GWAS 数据集上的应用表明,即使在极端群体分层的情况下,它也能正常运行,而假设群体均匀的算法会给出系统偏置的结果。我们极其高效的实现可以在几分钟内对数百万对个体进行关系推断,比我们所知的最快的现有算法快几十倍。

可用性

我们的稳健关系推断算法已在一个免费提供的软件包 KING 中实现,可在 http://people.virginia.edu/∼wc9c/KING 下载。

相似文献

1
Robust relationship inference in genome-wide association studies.全基因组关联研究中的稳健关系推断。
Bioinformatics. 2010 Nov 15;26(22):2867-73. doi: 10.1093/bioinformatics/btq559. Epub 2010 Oct 5.
3
Estimating kinship in admixed populations.估算混合人群中的亲属关系。
Am J Hum Genet. 2012 Jul 13;91(1):122-38. doi: 10.1016/j.ajhg.2012.05.024. Epub 2012 Jun 28.
6
Identifying Cryptic Relationships.识别隐藏关系。
Methods Mol Biol. 2017;1666:45-60. doi: 10.1007/978-1-4939-7274-6_4.

引用本文的文献

本文引用的文献

7

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验