Sun Lei, Dimitromanolakis Apostolos
Division of Biostatistics, Dalla Lana School of Public Health, University of Toronto, Canada ; Department of Statistical Sciences, University of Toronto, Canada.
Department of Clinical Biochemistry, University Health Network, Canada.
BMC Proc. 2014 Jun 17;8(Suppl 1 Genetic Analysis Workshop 18Vanessa Olmo):S23. doi: 10.1186/1753-6561-8-S1-S23. eCollection 2014.
Pedigree errors and cryptic relatedness often appear in families or population samples collected for genetic studies. If not identified, these issues can lead to either increased false negatives or false positives in both linkage and association analyses. To identify pedigree errors and cryptic relatedness among individuals from the 20 San Antonio Family Studies (SAFS) families and cryptic relatedness among the 157 putatively unrelated individuals, we apply PREST-plus to the genome-wide single-nucleotide polymorphism (SNP) data and analyze estimated identity-by-descent (IBD) distributions for all pairs of genotyped individuals. Based on the given pedigrees alone, PREST-plus identifies the following putative pairs: 1091 full-sib, 162 half-sib, 360 grandparent-grandchild, 2269 avuncular, 2717 first cousin, 402 half-avuncular, 559 half-first cousin, 2 half-sib+first cousin, 957 parent-offspring and 440,546 unrelated. Using the genotype data, PREST-plus detects 7 mis-specified relative pairs, with their IBD estimates clearly deviating from the null expectations, and it identifies 4 cryptic related pairs involving 7 individuals from 6 families.
在为基因研究收集的家庭或群体样本中,家系错误和隐匿亲缘关系经常出现。如果未被识别,这些问题会在连锁分析和关联分析中导致假阴性或假阳性增加。为了识别来自20个圣安东尼奥家庭研究(SAFS)家庭个体间的家系错误和隐匿亲缘关系,以及157个推定无亲缘关系个体间的隐匿亲缘关系,我们将PREST-plus应用于全基因组单核苷酸多态性(SNP)数据,并分析所有基因分型个体对的估计同源等位基因(IBD)分布。仅基于给定的家系,PREST-plus识别出以下推定亲属对:1091对全同胞、162对半同胞、360对祖孙、2269对叔侄、2717对一级表亲、402对半叔侄、559对半一级表亲、2对半同胞加一级表亲、957对亲子和440,546对无亲缘关系个体。利用基因型数据,PREST-plus检测到7对错误指定的亲属对,其IBD估计值明显偏离零假设预期,并且识别出4对涉及来自6个家庭的7个个体的隐匿亲缘关系对。