基于相关性的多等位基因连锁不平衡推断

Correlation-based inference for linkage disequilibrium with multiple alleles.

作者信息

Zaykin Dmitri V, Pudovkin Alexander, Weir Bruce S

机构信息

National Institute of Environmental Health Sciences, National Institutes of Health, Research Triangle Park, North Carolina 27709, USA.

出版信息

Genetics. 2008 Sep;180(1):533-45. doi: 10.1534/genetics.108.089409. Epub 2008 Aug 30.

DOI:10.1534/genetics.108.089409

PMID:18757931

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2535703/

Abstract

The correlation between alleles at a pair of genetic loci is a measure of linkage disequilibrium. The square of the sample correlation multiplied by sample size provides the usual test statistic for the hypothesis of no disequilibrium for loci with two alleles and this relation has proved useful for study design and marker selection. Nevertheless, this relation holds only in a diallelic case, and an extension to multiple alleles has not been made. Here we introduce a similar statistic, R2, which leads to a correlation-based test for loci with multiple alleles: for a pair of loci with k and m alleles, and a sample of n individuals, the approximate distribution of n(k - 1)(m - 1)/(km)R2 under independence between loci is chi2(k-1(m-1). One advantage of this statistic is that it can be interpreted as the total correlation between a pair of loci. When the phase of two-locus genotypes is known, the approach is equivalent to a test for the overall correlation between rows and columns in a contingency table. In the phase-known case, R2 is the sum of the squared sample correlations for all km 2 x 2 subtables formed by collapsing to one allele vs. the rest at each locus. We examine the approximate distribution under the null of independence for R2 and report its close agreement with the exact distribution obtained by permutation. The test for independence using R2 is a strong competitor to approaches such as Pearson's chi square, Fisher's exact test, and a test based on Cressie and Read's power divergence statistic. We combine this approach with our previous composite-disequilibrium measures to address the case when the genotypic phase is unknown. Calculation of the new multiallele test statistic and its P-value is very simple and utilizes the approximate distribution of R2. We provide a computer program that evaluates approximate as well as "exact" permutational P-values.

摘要

一对基因座上等位基因之间的相关性是连锁不平衡的一种度量。样本相关性的平方乘以样本大小为双等位基因座不存在不平衡的假设提供了常用的检验统计量，并且这种关系已被证明对研究设计和标记选择很有用。然而，这种关系仅在双等位基因情况下成立，尚未扩展到多等位基因情况。在此，我们引入一个类似的统计量(R^2)，它可用于对多等位基因座进行基于相关性的检验：对于具有(k)个和(m)个等位基因的一对基因座以及(n)个个体的样本，在基因座之间独立的情况下，(n(k - 1)(m - 1)/(km)R^2)的近似分布为(\chi^2(k - 1)(m - 1))。这个统计量的一个优点是它可以解释为一对基因座之间的总相关性。当双基因座基因型的相位已知时，该方法等同于对列联表中行与列之间的总体相关性进行检验。在相位已知的情况下，(R^2)是通过在每个基因座上将一个等位基因与其余等位基因合并形成的所有(km)个(2×2)子表的样本相关性平方之和。我们研究了(R^2)在独立性原假设下的近似分布，并报告了它与通过置换获得的精确分布的密切一致性。使用(R^2)进行独立性检验是诸如皮尔逊卡方检验、费舍尔精确检验以及基于克雷斯和里德的幂散度统计量的检验等方法的有力竞争对手。我们将此方法与我们之前的复合不平衡度量相结合，以处理基因型相位未知的情况。新的多等位基因检验统计量及其(P)值的计算非常简单，并利用了(R^2)的近似分布。我们提供了一个计算机程序，可评估近似以及“精确”的置换(P)值。

相似文献

Correlation-based inference for linkage disequilibrium with multiple alleles.

Genetics. 2008 Sep;180(1):533-45. doi: 10.1534/genetics.108.089409. Epub 2008 Aug 30.

Testing for genetic association: a powerful score test.

Stat Med. 2008 Sep 30;27(22):4596-609. doi: 10.1002/sim.3328.

Linkage disequilibrium testing when linkage phase is unknown.

Genetics. 2004 Jan;166(1):505-12. doi: 10.1534/genetics.166.1.505.

Power studies for the transmission/disequilibrium tests with multiple alleles.

Am J Hum Genet. 1997 Mar;60(3):691-702.

Exact transmission-disequilibrium tests with multiallelic markers.

Genet Epidemiol. 1997;14(4):337-47. doi: 10.1002/(SICI)1098-2272(1997)14:4<337::AID-GEPI1>3.0.CO;2-0.

Cubic exact solutions for the estimation of pairwise haplotype frequencies: implications for linkage disequilibrium analyses and a web tool 'CubeX'.

BMC Bioinformatics. 2007 Nov 2;8:428. doi: 10.1186/1471-2105-8-428.

A Monte Carlo test of linkage disequilibrium for single nucleotide polymorphisms.

BMC Res Notes. 2011 Apr 14;4:124. doi: 10.1186/1756-0500-4-124.

Sampling variance and distribution of the D' measure of overall gametic disequilibrium between multiallelic loci.

Ann Hum Genet. 2001 Jul;65(Pt 4):395-406. doi: 10.1017/S0003480001008697.

Testing for linkage disequilibrium in genotypic data using the Expectation-Maximization algorithm.

Heredity (Edinb). 1996 Apr;76 ( Pt 4):377-83. doi: 10.1038/hdy.1996.55.

Evaluation of linkage disequilibrium measures between multi-allelic markers as predictors of linkage disequilibrium between markers and QTL.

Genet Res. 2005 Aug;86(1):77-87. doi: 10.1017/S001667230500769X.

引用本文的文献

Genetic parallelism between European flat oyster populations at the edge of their natural range.

Evol Appl. 2022 Aug 6;16(2):393-407. doi: 10.1111/eva.13449. eCollection 2023 Feb.

Extensive Recombination Suppression and Epistatic Selection Causes Chromosome-Wide Differentiation of a Selfish Sex Chromosome in .

Genetics. 2020 Sep;216(1):205-226. doi: 10.1534/genetics.120.303460. Epub 2020 Jul 30.

Insights into herpesvirus assembly from the structure of the pUL7:pUL51 complex.

Elife. 2020 May 11;9:e53789. doi: 10.7554/eLife.53789.

Clustered Mutations at the Murine and Human IgH Locus Exhibit Significant Linkage Consistent with Templated Mutagenesis.

J Immunol. 2019 Sep 1;203(5):1252-1264. doi: 10.4049/jimmunol.1801615. Epub 2019 Aug 2.

The Genomic Complexity of a Large Inversion in Great Tits.

Genome Biol Evol. 2019 Jul 1;11(7):1870-1881. doi: 10.1093/gbe/evz106.

Turning Vice into Virtue: Using Batch-Effects to Detect Errors in Large Genomic Data Sets.

Genome Biol Evol. 2018 Oct 1;10(10):2697-2708. doi: 10.1093/gbe/evy199.

Massive variation of short tandem repeats with functional consequences across strains of .

Genome Res. 2018 Aug;28(8):1169-1178. doi: 10.1101/gr.231753.117. Epub 2018 Jul 3.

Transition from Environmental to Partial Genetic Sex Determination in Daphnia through the Evolution of a Female-Determining Incipient W Chromosome.

Mol Biol Evol. 2017 Mar 1;34(3):575-588. doi: 10.1093/molbev/msw251.

Fitness consequences of polymorphic inversions in the zebra finch genome.

Genome Biol. 2016 Sep 29;17(1):199. doi: 10.1186/s13059-016-1056-3.

Genetic population structure and relatedness in the narrow-striped mongoose (), a social Malagasy carnivore with sexual segregation.

Ecol Evol. 2016 May 5;6(11):3734-3749. doi: 10.1002/ece3.2123. eCollection 2016 Jun.

本文引用的文献

Evaluation of linkage disequilibrium measures between multi-allelic markers as predictors of linkage disequilibrium between single nucleotide polymorphisms.

Genet Res. 2007 Feb;89(1):1-6. doi: 10.1017/S0016672307008634.

Testing Hypotheses about Linkage Disequilibrium with Multiple Alleles.

Genetics. 1978 Mar;88(3):633-42. doi: 10.1093/genetics/88.3.633.

The Interaction of Selection and Linkage. I. General Considerations; Heterotic Models.

Genetics. 1964 Jan;49(1):49-67. doi: 10.1093/genetics/49.1.49.

Contrasting linkage-disequilibrium patterns between cases and controls as a novel association-mapping method.

Am J Hum Genet. 2006 May;78(5):737-746. doi: 10.1086/503710. Epub 2006 Mar 13.

An utter refutation of the "fundamental theorem of the HapMap".

Eur J Hum Genet. 2006 Apr;14(4):426-37. doi: 10.1038/sj.ejhg.5201583.

Evaluation of linkage disequilibrium measures between multi-allelic markers as predictors of linkage disequilibrium between markers and QTL.

Genet Res. 2005 Aug;86(1):77-87. doi: 10.1017/S001667230500769X.

Effect of two- and three-locus linkage disequilibrium on the power to detect marker/phenotype associations.

Genetics. 2004 Oct;168(2):1029-40. doi: 10.1534/genetics.103.022335.

Bounds and normalization of the composite linkage disequilibrium coefficient.

Genet Epidemiol. 2004 Nov;27(3):252-7. doi: 10.1002/gepi.20015.

Linkage disequilibrium testing when linkage phase is unknown.

Genetics. 2004 Jan;166(1):505-12. doi: 10.1534/genetics.166.1.505.

The International HapMap Project.

Nature. 2003 Dec 18;426(6968):789-96. doi: 10.1038/nature02168.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于相关性的多等位基因连锁不平衡推断

Correlation-based inference for linkage disequilibrium with multiple alleles.

作者信息

Zaykin Dmitri V, Pudovkin Alexander, Weir Bruce S

机构信息

National Institute of Environmental Health Sciences, National Institutes of Health, Research Triangle Park, North Carolina 27709, USA.

出版信息

Genetics. 2008 Sep;180(1):533-45. doi: 10.1534/genetics.108.089409. Epub 2008 Aug 30.

DOI:10.1534/genetics.108.089409

PMID:18757931

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2535703/

Abstract

摘要

基于相关性的多等位基因连锁不平衡推断

Correlation-based inference for linkage disequilibrium with multiple alleles.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

基于相关性的多等位基因连锁不平衡推断

Correlation-based inference for linkage disequilibrium with multiple alleles.

作者信息

机构信息

出版信息