Kaplan N L, Martin E R, Weir B S
Biostatistics Branch, NIEHS, Research Triangle Park, NC 27709, USA.
Am J Hum Genet. 1997 Mar;60(3):691-702.
Case-control studies compare marker-allele distributions in affected and unaffected individuals, and significant results suggest linkage but may simply reflect population structure. For markers with m alleles (m > or = 2), a McNemar-like statistic, I, estimates the level of population association between marker and disease loci. To test for linkage after significant case-control tests, within-family tests are performed. These operate on the contingency table, with i, jth element equal to the number of parents that transmit marker allele Mi and do not transmit marker allele Mi to an affected offspring. The dimension of the table is the number of alleles at the marker locus. Three test statistics have recently been proposed in the literature: Tc compares symmetric pairs of cells (i, j) and (j, i), Tm compares row and column totals for the same marker allele, and a likelihood ratio statistic Tl uses all the cells in the table. In addition, we consider a new statistic, Tmhet, that uses only the heterozygous parents and is approximately chi2 with (m - 1) df. We use a Monte Carlo test to guarantee valid tests and to demonstrate the inferiority of Tc and the equality of Tm and Tl in terms of power. The power of the Tmhet test is close but not always equal to the power of the Tm test. We also show that under the alternative hypothesis of linkage, Tm is approximately noncentral chi2 with (m - 1) df and noncentrality parameter 2NT(1 - 2theta)2I*, when data on single affecteds in NT families are used. If the disease has a low population frequency, then I* is estimated using the case-control statistic I. This offers a basis for choosing sample size, or choosing a marker system.
病例对照研究比较了患病个体和未患病个体中标记等位基因的分布情况,显著的结果表明存在连锁关系,但也可能仅仅反映了群体结构。对于具有m个等位基因(m≥2)的标记,一个类似McNemar的统计量I估计了标记与疾病位点之间的群体关联水平。在病例对照检验显著后,为了检验连锁关系,需进行家系内检验。这些检验基于列联表进行,其中第i行第j列的元素等于将标记等位基因Mi传递给患病后代而不传递标记等位基因Mj的父母数量。该表的维度是标记位点上等位基因的数量。最近文献中提出了三种检验统计量:Tc比较对称的单元格对(i,j)和(j,i),Tm比较同一标记等位基因的行总和与列总和,似然比统计量Tl使用表中的所有单元格。此外,我们考虑一个新的统计量Tmhet,它仅使用杂合子父母,并且近似服从自由度为(m - 1)的卡方分布。我们使用蒙特卡罗检验来确保检验有效,并证明Tc在检验效能方面的劣势以及Tm和Tl检验效能的相等性。Tmhet检验的效能接近但并不总是等于Tm检验的效能。我们还表明,在连锁的备择假设下,当使用NT个家庭中单个患病个体的数据时,Tm近似服从自由度为(m - 1)且非中心参数为2NT(1 - 2θ)2I的非中心卡方分布。如果疾病在群体中的频率较低,那么使用病例对照统计量I来估计I。这为选择样本量或选择标记系统提供了依据。