Divers Jasmin, Redden David T, Carroll Raymond J, Allison David B
Stat Interface. 2011 Jul 1;4(3):327-337. doi: 10.4310/SII.2011.v4.n3.a7.
To show how the variance of the measurement error (ME) associated with individual ancestry proportion estimates can be estimated, especially when the number of ancestral populations () is greater than 2. We extend existing internal consistency measures to estimate the ME variance, and we compare these estimates with the ME variance estimated by use of the repeated measurement (RM) approach. Both approaches work by dividing the genotyped markers into subsets. We examine the effect of the number of subsets and of the allocation of markers to each subset on the performance of each approach. We used simulated data for all comparisons. Independently of the value of , the measures of internal reliability provided less biased and more precise estimates of the ME variance than did those obtained with the RM approach. Both methods tend to perform better when a large number of subsets of markers with similar sizes are considered. Our results will facilitate the use of ME correction methods to address the ME problem in individual ancestry proportion estimates. Our method will improve the ability to control for type I error inflation and loss of power in association tests and other genomic research involving ancestry estimates.
为了展示如何估计与个体祖先比例估计相关的测量误差(ME)的方差,特别是当祖先群体数量()大于2时。我们扩展了现有的内部一致性度量方法来估计ME方差,并将这些估计值与使用重复测量(RM)方法估计的ME方差进行比较。两种方法都是通过将基因分型标记划分为子集来实现的。我们研究了子集数量以及每个子集标记分配对每种方法性能的影响。我们在所有比较中都使用了模拟数据。无论的值如何,内部可靠性度量提供的ME方差估计比RM方法获得的估计偏差更小、更精确。当考虑大量大小相似的标记子集时,这两种方法往往表现得更好。我们的结果将有助于使用ME校正方法来解决个体祖先比例估计中的ME问题。我们的方法将提高在关联测试和其他涉及祖先估计的基因组研究中控制I型错误膨胀和功效损失的能力。