Ahn Kwangmi, Haynes Chad, Kim Wonkuk, Fleur Rose St, Gordon Derek, Finch Stephen J
Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, NY 11794-3600, USA.
Ann Hum Genet. 2007 Mar;71(Pt 2):249-61. doi: 10.1111/j.1469-1809.2006.00318.x. Epub 2006 Nov 10.
The questions addressed in this paper are: What single nucleotide polymorphism (SNP) genotyping errors are most costly, in terms of minimum sample size necessary (MSSN) to maintain constant asymptotic power and significance level, when performing case-control studies of genetic association applying the Cochran-Armitage trend test? And which trend test or chi2 test is more powerful under standard genetic models with genotyping errors? Our strategy is to expand the non-centrality parameter of the asymptotic distribution of the trend test to approximate the MSSN using a Taylor series linear in the genotyping error rates. We apply our strategy to example scenarios that assume recessive, dominant, additive, or over-dominant disease models. The most costly errors are recording the more common homozygote as the less common homozygote, and the more common homozygote as the heterozygote, with MSSN that become indefinitely large as the minor SNP allele frequency approaches zero. Misclassifying the heterozygote as the less common homozygote is costly when using the recessive trend test on data from a recessive model. The chi2 test has power close to, but less than, the optimal trend test and is never dominated over all genetic models studied by any specific trend test.
在应用 Cochr an - Armitage 趋势检验进行基因关联病例对照研究时,就维持恒定渐近检验效能和显著性水平所需的最小样本量(MSSN)而言,哪些单核苷酸多态性(SNP)基因分型错误成本最高?在存在基因分型错误的标准遗传模型下,哪种趋势检验或卡方检验效能更高?我们的策略是扩展趋势检验渐近分布的非中心参数,利用基因分型错误率的泰勒级数线性近似来估算 MSSN。我们将该策略应用于假设隐性、显性、加性或超显性疾病模型的示例场景。成本最高的错误是将较常见的纯合子记录为较不常见的纯合子,以及将较常见的纯合子记录为杂合子,当次要 SNP 等位基因频率接近零时,MSSN 会变得无限大。在对隐性模型的数据使用隐性趋势检验时,将杂合子误分类为较不常见的纯合子成本较高。卡方检验的效能接近但低于最优趋势检验,并且在所有研究的遗传模型中,它从未被任何特定趋势检验完全超越。