Wellcome Trust Sanger Institute, Cambridge, UK.
Hum Mol Genet. 2010 Jul 1;19(13):2539-53. doi: 10.1093/hmg/ddq102. Epub 2010 Mar 8.
We describe a novel approach for evaluating SNP genotypes of a genome-wide association scan to identify "ethnic outlier" subjects whose ethnicity is different or admixed compared to most other subjects in the genotyped sample set. Each ethnic outlier is detected by counting a genomic excess of "rare" heterozygotes and/or homozygotes whose frequencies are low (<1%) within genotypes of the sample set being evaluated. This method also enables simple and striking visualization of non-Caucasian chromosomal DNA segments interspersed within the chromosomes of ethnically admixed individuals. We show that this visualization of the mosaic structure of admixed human chromosomes gives results similar to another visualization method (SABER) but with much less computational time and burden. We also show that other methods for detecting ethnic outliers are enhanced by evaluating only genomic regions of visualized admixture rather than diluting outlier ancestry by evaluating the entire genome considered in aggregate. We have validated our method in the Wellcome Trust Case Control Consortium (WTCCC) study of 17,000 subjects as well as in HapMap subjects and simulated outliers of known ethnicity and admixture. The method's ability to precisely delineate chromosomal segments of non-Caucasian ethnicity has enabled us to demonstrate previously unreported non-Caucasian admixture in two HapMap Caucasian parents and in a number of WTCCC subjects. Its sensitive detection of ethnic outliers and simple visual discrimination of discrete chromosomal segments of different ethnicity implies that this method of rare heterozygotes and homozygotes (RHH) is likely to have diverse and important applications in humans and other species.
我们描述了一种评估全基因组关联扫描 SNP 基因型的新方法,以识别与所分析样本集中大多数其他个体相比具有不同或混合种族背景的“种族外显子”个体。通过计算样本集中罕见杂合子和/或纯合子的基因组过多,即频率较低(<1%)的个体,从而检测到每个种族外显子。该方法还能够简单而显著地可视化非高加索人染色体 DNA 片段在混合个体染色体中的分布。我们发现,这种混合人类染色体镶嵌结构的可视化结果与另一种可视化方法(SABER)相似,但计算时间和负担要小得多。我们还发现,通过仅评估可视化混合区域,而不是通过整体评估整个基因组来稀释外显子祖先,其他检测种族外显子的方法可以得到增强。我们已经在超过 17000 个个体的 Wellcome Trust Case Control Consortium (WTCCC)研究以及 HapMap 个体和已知种族和混合的模拟外显子中验证了我们的方法。该方法能够精确描绘非高加索人种族的染色体片段,使我们能够在两个 HapMap 高加索人父母和许多 WTCCC 个体中展示以前未报告的非高加索人混合。其对种族外显子的敏感检测和不同种族离散染色体片段的简单视觉区分,意味着这种罕见杂合子和纯合子(RHH)方法很可能在人类和其他物种中有广泛而重要的应用。