Sitlani Colleen M, McKnight Barbara
Department of Biostatistics, University of Washington, Seattle, USA. csitlani @ uw.edu
Hum Hered. 2011;71(4):246-55. doi: 10.1159/000328858. Epub 2011 Jul 28.
BACKGROUND/AIMS: Genetic single-nucleotide polymorphism (SNP) data are often analyzed using trend tests that rely on a specific assumption about the way that disease frequency varies across genotypes, but the validity of this assumption is not typically known. We explore the relative efficiency of trend tests in which the assumed model may or may not correspond to the true genetic model.
We derive formulae for the asymptotic relative efficiencies (AREs) comparing tests that assume different genetic models. We consider both unstratified and stratified tests, using both case-control and cohort data. We illustrate these formulae using realistic parameters and compare the calculated AREs to simulated relative efficiencies in finite samples.
The AREs are identical for unstratified tests using case-control and cohort data, but differ for stratified tests. Loss of efficiency can be substantial, given specific combinations of high-risk allele frequencies, disease frequencies, and assumed versus actual genetic models. Given reasonably large sample sizes, asymptotic calculations align well with finite sample simulations of relative efficiency.
ARE is a useful estimate of the relative efficiency of statistics using different underlying genetic models. ARE calculations reveal that additive gene doses, which are most commonly used, lead to large losses in power in some settings.
背景/目的:遗传单核苷酸多态性(SNP)数据通常使用趋势检验进行分析,这些检验依赖于关于疾病频率在不同基因型间变化方式的特定假设,但该假设的有效性通常并不明确。我们探讨了假设模型可能与真实遗传模型相符或不相符的趋势检验的相对效率。
我们推导了比较假设不同遗传模型的检验的渐近相对效率(ARE)公式。我们使用病例对照数据和队列数据,考虑了未分层检验和分层检验。我们使用实际参数说明这些公式,并将计算出的ARE与有限样本中的模拟相对效率进行比较。
使用病例对照数据和队列数据的未分层检验的ARE相同,但分层检验的ARE不同。鉴于高风险等位基因频率、疾病频率以及假设与实际遗传模型的特定组合,效率损失可能很大。在样本量足够大的情况下,渐近计算与相对效率的有限样本模拟结果吻合良好。
ARE是使用不同潜在遗传模型的统计量相对效率的有用估计。ARE计算表明,最常用的加性基因剂量在某些情况下会导致检验效能大幅损失。