Predoctoral Program in Human Genetics, Johns Hopkins School of Medicine, Baltimore, Maryland, United States of America.
PLoS One. 2012;7(11):e49575. doi: 10.1371/journal.pone.0049575. Epub 2012 Nov 19.
Correct annotation of the genetic relationships between samples is essential for population genomic studies, which could be biased by errors or omissions. To this end, we used identity-by-state (IBS) and identity-by-descent (IBD) methods to assess genetic relatedness of individuals within HapMap phase III data. We analyzed data from 1,397 individuals across 11 ethnic populations. Our results support previous studies (Pemberton et al., 2010; Kyriazopoulou-Panagiotopoulou et al., 2011) assessing unknown relatedness present within this population. Additionally, we present evidence for 1,657 novel pairwise relationships across 9 populations. Surprisingly, significant Cotterman's coefficients of relatedness K1 (IBD1) values were detected between pairs of known parents. Furthermore, significant K2 (IBD2) values were detected in 32 previously annotated parent-child relationships. Consistent with a hypothesis of inbreeding, regions of homozygosity (ROH) were identified in the offspring of related parents, of which a subset overlapped those reported in previous studies (Gibson et al. 2010; Johnson et al. 2011). In total, we inferred 28 inbred individuals with ROH that overlapped areas of relatedness between the parents and/or IBD2 sharing at a different genomic locus between a child and a parent. Finally, 8 previously annotated parent-child relationships had unexpected K0 (IBD0) values (resulting from a chromosomal abnormality or genotype error), and 10 previously annotated second-degree relationships along with 38 other novel pairwise relationships had unexpected IBD2 (indicating two separate paths of recent ancestry). These newly described types of relatedness may impact the outcome of previous studies and should inform the design of future studies relying on the HapMap Phase III resource.
正确注释样本之间的遗传关系对于群体基因组研究至关重要,这些关系可能会因错误或遗漏而产生偏差。为此,我们使用基于状态的身份(IBS)和基于血统的身份(IBD)方法来评估 HapMap 第三阶段数据中个体的遗传关系。我们分析了来自 11 个人种的 1397 个人的数据。我们的结果支持了先前的研究(Pemberton 等人,2010;Kyriazopoulou-Panagiotopoulou 等人,2011),这些研究评估了该人群中存在的未知亲缘关系。此外,我们还提供了 9 个人群中 1657 个新的成对关系的证据。令人惊讶的是,在已知父母对之间检测到了显著的 Cotterman 相关系数 K1(IBD1)值。此外,在 32 个先前注释的亲子关系中检测到了显著的 K2(IBD2)值。与近亲繁殖的假设一致,在相关父母的后代中鉴定出了纯合区域(ROH),其中一部分与先前的研究(Gibson 等人,2010;Johnson 等人,2011)报告的重叠。总的来说,我们推断出 28 个具有 ROH 的近交个体,这些个体与父母之间的相关区域或子与父之间的不同基因组位置的 IBD2 共享重叠。最后,8 个先前注释的亲子关系具有意外的 K0(IBD0)值(由染色体异常或基因型错误引起),10 个先前注释的二级关系以及 38 个其他新的成对关系具有意外的 IBD2(表明最近有两个不同的祖先路径)。这些新描述的亲缘关系类型可能会影响先前研究的结果,并应告知依赖 HapMap 第三阶段资源的未来研究的设计。