McPeek M S
Department of Statistics, University of Chicago, Illinois 60637, USA.
Genet Epidemiol. 1999;16(3):225-49. doi: 10.1002/(SICI)1098-2272(1999)16:3<225::AID-GEPI1>3.0.CO;2-#.
The choice of allele-sharing statistics can have a great impact on the power of robust affected relative methods. Similarly, when allele-sharing statistics from several pedigrees are combined, the weight applied to each pedigree's statistic can affect power. Here we describe the direct connection between the affected relative methods and traditional parametric linkage analysis, and we use this connection to give explicit formulae for the optimal sharing statistics and weights, applicable to all pedigree types. One surprising consequence is that under any single gene model, the value of the optimal allele-sharing statistic does not depend on whether observed sharing is between more closely or more distantly related affected relatives. This result also holds for any multigene model with loci unlinked, additivity between loci, and all loci having small effect. For specific classes of two-allele models, we give the most powerful statistics and optimal weights for arbitrary pedigrees. When the effect size is small, these also extend to multigene models with additivity between loci. We propose a useful new statistic, S(rob dom), which performs well for dominant and additive models with varying phenocopy rates and varying predisposing allele frequency. We find that the statistic S(_#alleles), performs well for recessive models with varying phenocopy rates and varying redisposing allele frequency. We also find that for models with large deviation from null sharing, the correspondence between allele-sharing statistics and the models for which they are optimal may also depend on which method is used to test for linkage.
等位基因共享统计量的选择会对稳健的患病亲属方法的效能产生重大影响。同样,当合并来自多个家系的等位基因共享统计量时,应用于每个家系统计量的权重会影响效能。在此,我们描述了患病亲属方法与传统参数连锁分析之间的直接联系,并利用这种联系给出适用于所有家系类型的最优共享统计量和权重的明确公式。一个令人惊讶的结果是,在任何单基因模型下,最优等位基因共享统计量的值并不取决于观察到的共享是在关系更密切还是更疏远的患病亲属之间。对于任何具有不连锁位点、位点间加性效应且所有位点效应较小的多基因模型,该结果同样成立。对于特定类别的双等位基因模型,我们给出了适用于任意家系的最有效统计量和最优权重。当效应大小较小时,这些结果也可扩展到具有位点间加性效应的多基因模型。我们提出了一个有用的新统计量S(rob dom),它在具有不同表型模拟率和不同易感等位基因频率的显性和加性模型中表现良好。我们发现统计量S(_#alleles)在具有不同表型模拟率和不同再分布等位基因频率的隐性模型中表现良好。我们还发现,对于与零共享有较大偏差的模型,等位基因共享统计量与其最优模型之间的对应关系可能还取决于用于检验连锁的方法。