Li Z, Gail M H, Pee D, Gastwirth J L
Department of Statistics, The Biostatistics Center, George Washington University, Rockville, MD 20852, USA.
Hum Hered. 2002;53(3):114-29. doi: 10.1159/000064974.
Risch and Teng [Genome Res 1998;8:1273-1288] and Teng and Risch [Genome Res 1999;9:234-241] proposed a class of transmission/disequilibrium test-like statistical tests based on the difference between the estimated allele frequencies in the affected and control populations. They evaluated the power of a variety of family-based and nonfamily-based designs for detecting an association between a candidate allele and disease. Because they were concerned with diseases with low penetrances, their power calculations assumed that unaffected individuals can be treated as a random sample from the population. They predicted that this assumption rendered their sample size calculations slightly conservative. We generalize their partial ascertainment conditioning by including the status of the unaffected sibs in the calculations of the distribution and power of the statistic used to compare the allele frequency in affected offspring to the estimated frequency in the parents, based on sibships with genotyped affected and unaffected sibs. Sample size formulas for our full ascertainment methods are presented. The sample sizes for our procedure are compared to those of Teng and Risch. The numerical results and simulations indicate that the simplifying assumption used in Teng and Risch can produce both conservative and anticonservative results. The magnitude of the difference between the sample sizes needed by their partial ascertainment approximation and the full ascertainment is small in the circumstances they focused on but can be appreciable in others, especially when the baseline penetrances are moderate. Two other statistics, using different estimators for the variance of the basic statistic comparing the allele frequencies in the affected and unaffected sibs are introduced. One of them incorporates an estimate of the null variance obtained from an auxiliary sample and appears to noticeably decrease the sample sizes required to achieve a prespecified power.
里施和滕[《基因组研究》1998年;8:1273 - 1288]以及滕和里施[《基因组研究》1999年;9:234 - 241]基于患病和对照人群中估计的等位基因频率差异,提出了一类类似传递/不平衡检验的统计检验方法。他们评估了多种基于家系和非基于家系的设计用于检测候选等位基因与疾病之间关联的效能。由于他们关注的是低外显率疾病,其效能计算假设未患病个体可被视为来自总体的随机样本。他们预测这一假设会使他们的样本量计算略显保守。我们通过在计算用于比较患病后代中等位基因频率与父母中估计频率的统计量的分布和效能时纳入未患病同胞的状态,推广了他们的部分确定条件,该统计量基于有基因分型的患病和未患病同胞的同胞关系。给出了我们完全确定方法的样本量公式。将我们方法所需的样本量与滕和里施的样本量进行了比较。数值结果和模拟表明,滕和里施使用的简化假设可能产生保守和反保守的结果。在他们所关注的情况下,他们的部分确定近似法所需样本量与完全确定所需样本量之间的差异幅度较小,但在其他情况下可能相当可观,尤其是当基线外显率为中等时。还引入了另外两个统计量,它们对比较患病和未患病同胞中等位基因频率的基本统计量的方差使用了不同的估计量。其中一个纳入了从辅助样本获得的无效方差估计值,似乎能显著减少达到预先设定效能所需的样本量。