Leutenegger Anne-Louise, Génin Emmanuelle, Thompson Elizabeth A, Clerget-Darpoux Françoise
Unité de Recherche d'Epidémiologie Génétique, INSERM U535, Kremlin-Bicêtre, France.
Genet Epidemiol. 2002 Nov;23(4):413-25. doi: 10.1002/gepi.10190.
Many studies are done in small isolated populations and populations where marriages between relatives are encouraged. In this paper, we point out some problems with applying the maximum lod score (MLS) method (Risch, [1990] Am. J. Hum. Genet. 46:242-253) in these populations where relationships exist between the two parents of the affected sib-pairs. Characterizing the parental relationships by the kinship coefficient between the parents (f), the maternal inbreeding coefficient (alpha(m), and the paternal inbreeding coefficient (alpha(p)), we explored the relationship between the identity by descent (IBD) vector expected under the null hypothesis of no linkage and these quantities. We find that the expected IBD vector is no longer (0.25, 0.5, 0.25) when f, alpha(m), and alpha(p) differ from zero. In addition, the expected IBD vector does not always follow the triangle constraints recommended by Holmans ([1993] Am. J. Hum. Genet. 52:362-374). So the classically used MLS statistic needs to be adapted to the presence of parental relationships. We modified the software GENEHUNTER (Kruglyak et al. [1996] Am. J. Hum. Genet. 58: 1347-1363) to do so. Indeed, the current version of the software does not compute the likelihood properly under the null hypothesis. We studied the adapted statistic by simulating data on three different family structures: (1) parents are double first cousins (f=0.125, alpha(m)=alpha(p)=0), (2) each parent is the offspring of first cousins (f=0, alpha(m)=alpha(p)=0.0625), and (3) parents are related as in the pedigree from Goddard et al. ([1996] Am. J. Hum. Genet. 58:1286-1302) (f=0.109, alpha(m)=alpha(p)=0.0625). The appropriate threshold needs to be derived for each case in order to get the correct type I error. And using the classical statistic in the presence of both parental kinship and parental inbreeding almost always leads to false conclusions.
许多研究是在小型孤立群体以及鼓励近亲结婚的群体中进行的。在本文中,我们指出在受影响同胞对的双亲之间存在亲缘关系的这些群体中应用最大似然比分数(MLS)方法(里斯克,[1990]《美国人类遗传学杂志》46:242 - 253)存在的一些问题。通过双亲之间的亲缘系数(f)、母系近亲繁殖系数(αm)和父系近亲繁殖系数(αp)来描述双亲关系,我们探讨了在无连锁的零假设下预期的同源等位基因状态(IBD)向量与这些量之间的关系。我们发现当f、αm和αp不为零时,预期的IBD向量不再是(0.25, 0.5, 0.25)。此外,预期的IBD向量并不总是遵循霍尔曼斯([1993]《美国人类遗传学杂志》52:362 - 374)推荐的三角形约束。因此,经典使用的MLS统计量需要针对双亲关系的存在进行调整。我们对软件GENEHUNTER(克鲁格利亚克等人,[1996]《美国人类遗传学杂志》58: 1347 - 1363)进行了修改以实现此目的。实际上,该软件的当前版本在零假设下不能正确计算似然性。我们通过模拟三种不同家庭结构的数据来研究调整后的统计量:(1)双亲是双重一级表亲(f = 0.125,αm = αp = 0),(2)每个双亲是一级表亲的后代(f = 0,αm = αp = 0.0625),以及(3)双亲的亲缘关系如戈达德等人([1996]《美国人类遗传学杂志》58:1286 - 1302)的系谱所示(f = 0.109,αm = αp = 0.0625)。为了得到正确的I型错误,需要为每种情况推导合适的阈值。并且在双亲亲缘关系和双亲近亲繁殖都存在的情况下使用经典统计量几乎总是会导致错误结论。