Suppr超能文献

基于纯合性的连锁不平衡统计量的抽样特性。

Sampling properties of homozygosity-based statistics for linkage disequilibrium.

作者信息

Rosenberg Noah A, Blum Michael G B

机构信息

Department of Human Genetics, University of Michigan, 1241 East Catherine Street, Ann Arbor, MI 48109-0618, USA.

出版信息

Math Biosci. 2007 Jul;208(1):33-47. doi: 10.1016/j.mbs.2006.07.001. Epub 2006 Jul 27.

Abstract

Homozygosity-based statistics such as Ohta's identity-in-state (IIS) excess offer the potential to measure linkage disequilibrium for multiallelic loci in small samples. However, previous observations have suggested that for independent loci, in small samples these statistics might produce values that more frequently lie on one side rather than on the other side of zero. Here we investigate the sampling properties of the IIS excess. We find that for any pair of independent polymorphic loci, as sample size n approaches infinity, the sampling distribution of the IIS excess approaches a normal distribution. For large samples, the IIS excess tends towards symmetry around zero, and the probabilities of positive and of negative IIS excess both approach 1/2. Surprisingly, however, we also find that for sufficiently large n, independent loci can be chosen so that the probability of a sample having positive IIS excess is arbitrarily close to either 0 or 1. The results are applied to interpretation of data from human populations, and we conclude that before employing homozygosity-based statistics to measure LD in a particular sample, especially for loci with either very small or very large homozygosities, it is useful to verify that loci with the observed homozygosity values are not likely to produce a large bias in IIS excess in samples of the given size.

摘要

基于纯合性的统计量,如太田的状态同一性(IIS)过剩,为在小样本中测量多等位基因座的连锁不平衡提供了可能。然而,先前的观察表明,对于独立的基因座,在小样本中这些统计量可能产生的值更频繁地位于零的一侧而非另一侧。在这里,我们研究了IIS过剩的抽样特性。我们发现,对于任何一对独立的多态性基因座,随着样本量n趋近于无穷大,IIS过剩的抽样分布趋近于正态分布。对于大样本,IIS过剩趋于围绕零对称,IIS过剩为正和为负的概率都趋近于1/2。然而,令人惊讶的是,我们还发现,对于足够大的n,可以选择独立的基因座,使得样本具有正IIS过剩的概率任意接近于0或1。这些结果被应用于对人类群体数据的解释,并且我们得出结论,在使用基于纯合性的统计量来测量特定样本中的连锁不平衡之前,特别是对于纯合性非常小或非常大的基因座,验证具有观察到的纯合性值的基因座在给定大小的样本中不太可能在IIS过剩中产生大的偏差是很有用的。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验