Ekstrøm Claus Thorn
Department of Mathematics and Physics, Royal Veterinary and Agricultural University, Frederiksberg, Denmark.
Genet Epidemiol. 2003 Nov;25(3):214-24. doi: 10.1002/gepi.10259.
Genetic marker data play a crucial role in gene mapping, and genotyping errors may have substantial influence on the power to detect and the precision to locate disease loci. Statistical methods can identify individuals, markers, or pedigrees with a high likelihood of containing genotyping errors, and the putative erroneous genotypes can then be rechecked and either verified, removed, or corrected to reduce the loss of power introduced by errors. We present a method to identify genetic markers with a high genotyping error rate. Genotyping errors are likely to appear as double recombinations which expand the genetic map around the marker. Markers flagged as map expanders (i.e., having an excessive number of double recombinations) can then be reread or regenotyped, or a replacement marker of higher quality can be used instead. The proposed method can be applied to any type of pedigree. Simulation studies of nuclear pedigrees and sib-pairs show that the proposed method generally has high power to identify map expanders when the set of markers is reasonably dense (average intermarker distance of 5 cM), even when the nominal genotyping error rate is low (2%). Not surprisingly, the power to detect map expanders increases with marker heterozygosity and genotyping error rate, and is reduced with increasing intermarker distance. When the method was applied to a real dataset consisting of 56 nuclear pedigrees genotyped for 20 microsatellite markers on chromosome 4, the method diagnosed three markers as map expanders. Subsequent examination of these markers proved that they all had high genotyping error frequencies.
遗传标记数据在基因定位中起着至关重要的作用,而基因分型错误可能会对检测疾病位点的效能以及定位的精度产生重大影响。统计方法能够识别出极有可能包含基因分型错误的个体、标记或家系,随后可以对推定的错误基因型进行重新检查,进而予以验证、剔除或修正,以减少错误所导致的效能损失。我们提出了一种识别具有高基因分型错误率的遗传标记的方法。基因分型错误很可能表现为双交换,这会使标记周围的遗传图谱扩展。那些被标记为图谱扩展者(即具有过多双交换的标记)随后可以重新读取或重新进行基因分型,或者可以使用质量更高的替代标记。所提出的方法可应用于任何类型的家系。对核心家系和同胞对的模拟研究表明,当标记集相当密集(标记间平均距离为5厘摩)时,即便名义基因分型错误率很低(2%),所提出的方法通常也具有较高的效能来识别图谱扩展者。不出所料,检测图谱扩展者的效能会随着标记杂合性和基因分型错误率的增加而提高,并且会随着标记间距离的增加而降低。当将该方法应用于一个由56个核心家系组成的真实数据集时,这些家系对4号染色体上的20个微卫星标记进行了基因分型,该方法诊断出三个标记为图谱扩展者。随后对这些标记的检查证明它们都具有很高的基因分型错误频率。