Suppr超能文献

连锁不平衡r2测量的数学性质。

Mathematical properties of the r2 measure of linkage disequilibrium.

作者信息

VanLiere Jenna M, Rosenberg Noah A

机构信息

Center for Computational Medicine and Biology, University of Michigan, Ann Arbor, MI 48109, USA.

出版信息

Theor Popul Biol. 2008 Aug;74(1):130-7. doi: 10.1016/j.tpb.2008.05.006. Epub 2008 Jun 1.

Abstract

Statistics for linkage disequilibrium (LD), the non-random association of alleles at two loci, depend on the frequencies of the alleles at the loci under consideration. Here, we examine the r(2) measure of LD and its mathematical relationship to allele frequencies, quantifying the constraints on its maximum value. Assuming independent uniform distributions for the allele frequencies of two biallelic loci, we find that the mean maximum value of r(2) is approximately 0.43051, and that r(2) can exceed a threshold of 4/5 in only approximately 14.232% of the allele frequency space. If one locus is assumed to have known allele frequencies--the situation in an association study in which LD between a known marker locus and an unknown trait locus is of interest--we find that the mean maximum value of r(2) is greatest when the known locus has a minor allele frequency of approximately 0.30131. We find that in 1/4 of the space of allowed values of minor allele frequencies and haplotype frequencies at a pair of loci, the unconstrained maximum r(2) allowing for the possibility of recombination between the loci exceeds the constrained maximum assuming that no recombination has occurred. Finally, we use r(max)(2) to examine the connection between r(2) and the D(') measure of linkage disequilibrium, finding that r(2)/r(max)(2)=D('2) for approximately 72.683% of the space of allowed values of (p(a),p(b),p(ab)). Our results concerning the properties of r(2) have the potential to inform the interpretation of unusual LD behavior and to assist in the design of LD-based association-mapping studies.

摘要

连锁不平衡(LD)统计,即两个位点上等位基因的非随机关联,取决于所考虑位点上等位基因的频率。在此,我们研究了LD的r(2)度量及其与等位基因频率的数学关系,量化了对其最大值的限制。假设两个双等位基因位点的等位基因频率服从独立均匀分布,我们发现r(2)的平均最大值约为0.43051,并且r(2)仅在约14.232%的等位基因频率空间中能超过4/5的阈值。如果假设一个位点具有已知的等位基因频率——这是关联研究中的情况,其中感兴趣的是已知标记位点与未知性状位点之间的LD——我们发现当已知位点的次要等位基因频率约为0.30131时,r(2)的平均最大值最大。我们发现,在一对位点的次要等位基因频率和单倍型频率的允许值空间的1/4中,考虑到位点之间可能发生重组时r(2)的无约束最大值超过了假设未发生重组时的约束最大值。最后,我们使用r(max)(2)来研究r(2)与连锁不平衡的D(')度量之间的联系,发现在(p(a),p(b),p(ab))允许值空间的约72.683%中,r(2)/r(max)(2)=D('2)。我们关于r(2)性质的结果有可能为异常LD行为的解释提供信息,并有助于基于LD的关联作图研究的设计。

相似文献

1
Mathematical properties of the r2 measure of linkage disequilibrium.
Theor Popul Biol. 2008 Aug;74(1):130-7. doi: 10.1016/j.tpb.2008.05.006. Epub 2008 Jun 1.
5
Bounds and normalization of the composite linkage disequilibrium coefficient.
Genet Epidemiol. 2004 Nov;27(3):252-7. doi: 10.1002/gepi.20015.
6
Haplotype and linkage disequilibrium architecture for human cancer-associated genes.
Genome Res. 2002 Dec;12(12):1846-53. doi: 10.1101/gr.483802.
10
Linkage disequilibrium and haplotype block patterns in popcorn populations.
PLoS One. 2019 Sep 25;14(9):e0219417. doi: 10.1371/journal.pone.0219417. eCollection 2019.

引用本文的文献

2
M-CSF Protects Against Ulcerative Colitis via Aconitate: Mendelian Randomization and Experimental Evidence.
J Inflamm Res. 2025 Jul 31;18:10313-10329. doi: 10.2147/JIR.S528072. eCollection 2025.
3
On forensic likelihood ratios from low-coverage sequencing.
Forensic Sci Int Genet. 2025 Sep;79:103302. doi: 10.1016/j.fsigen.2025.103302. Epub 2025 May 27.
4
Mathematical bounds on r and the effect size in case-control genome-wide association studies.
Theor Popul Biol. 2025 Aug;164:1-11. doi: 10.1016/j.tpb.2025.04.003. Epub 2025 May 15.
6
Mathematical bounds on and the effect size in case-control genome-wide association studies.
bioRxiv. 2024 Dec 17:2024.12.17.628943. doi: 10.1101/2024.12.17.628943.
7
Mapping the relative accuracy of cross-ancestry prediction.
Nat Commun. 2024 Dec 2;15(1):10480. doi: 10.1038/s41467-024-54727-8.
9
Joint identity among loci under mutation and regular inbreeding.
Theor Popul Biol. 2024 Oct;159:74-90. doi: 10.1016/j.tpb.2024.08.002. Epub 2024 Aug 30.
10
Phenotypic covariation predicts diversification in an adaptive radiation of pupfishes.
Ecol Evol. 2024 Aug 7;14(8):e11642. doi: 10.1002/ece3.11642. eCollection 2024 Aug.

本文引用的文献

1
Linkage disequilibrium in finite populations.
Theor Appl Genet. 1968 Jun;38(6):226-31. doi: 10.1007/BF01245622.
3
Power to detect risk alleles using genome-wide tag SNP panels.
PLoS Genet. 2007 Oct;3(10):1827-37. doi: 10.1371/journal.pgen.0030170. Epub 2007 Aug 22.
4
Successful design and conduct of genome-wide association studies.
Hum Mol Genet. 2007 Oct 15;16 Spec No. 2(SPEC):R220-5. doi: 10.1093/hmg/ddm161. Epub 2007 Jun 27.
5
The Interaction of Selection and Linkage. I. General Considerations; Heterotic Models.
Genetics. 1964 Jan;49(1):49-67. doi: 10.1093/genetics/49.1.49.
6
Allele frequency matching between SNPs reveals an excess of linkage disequilibrium in genic regions of the human genome.
PLoS Genet. 2006 Sep 8;2(9):e142. doi: 10.1371/journal.pgen.0020142. Epub 2006 Jul 25.
7
Possible ancestral structure in human populations.
PLoS Genet. 2006 Jul;2(7):e105. doi: 10.1371/journal.pgen.0020105.
8
Coverage and power in genomewide association studies.
Am J Hum Genet. 2006 May;78(5):884-888. doi: 10.1086/503751. Epub 2006 Mar 17.
9
An utter refutation of the "fundamental theorem of the HapMap".
Eur J Hum Genet. 2006 Apr;14(4):426-37. doi: 10.1038/sj.ejhg.5201583.
10
Efficiency and power in genetic association studies.
Nat Genet. 2005 Nov;37(11):1217-23. doi: 10.1038/ng1669. Epub 2005 Oct 23.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验