King Daniel A, Fitzgerald Tomas W, Miller Ray, Canham Natalie, Clayton-Smith Jill, Johnson Diana, Mansour Sahar, Stewart Fiona, Vasudevan Pradeep, Hurles Matthew E
Wellcome Trust Sanger Institute, Hinxton, Cambridge CB10 1HH, United Kingdom;
Genome Res. 2014 Apr;24(4):673-87. doi: 10.1101/gr.160465.113. Epub 2013 Dec 19.
Exome sequencing of parent-offspring trios is a popular strategy for identifying causative genetic variants in children with rare diseases. This method owes its strength to the leveraging of inheritance information, which facilitates de novo variant calling, inference of compound heterozygosity, and the identification of inheritance anomalies. Uniparental disomy describes the inheritance of a homologous chromosome pair from only one parent. This aberration is important to detect in genetic disease studies because it can result in imprinting disorders and recessive diseases. We have developed a software tool to detect uniparental disomy from child-mother-father genotype data that uses a binomial test to identify chromosomes with a significant burden of uniparentally inherited genotypes. This tool is the first to read VCF-formatted genotypes, to perform integrated copy number filtering, and to use a statistical test inherently robust for use in platforms of varying genotyping density and noise characteristics. Simulations demonstrated superior accuracy compared with previously developed approaches. We implemented the method on 1057 trios from the Deciphering Developmental Disorders project, a trio-based rare disease study, and detected six validated events, a significant enrichment compared with the population prevalence of UPD (1 in 3500), suggesting that most of these events are pathogenic. One of these events represents a known imprinting disorder, and exome analyses have identified rare homozygous candidate variants, mainly in the isodisomic regions of UPD chromosomes, which, among other variants, provide targets for further genetic and functional evaluation.
对亲子三联体进行外显子组测序是一种在患有罕见疾病的儿童中识别致病基因变异的常用策略。这种方法的优势在于利用了遗传信息,这有助于进行新生变异的检测、复合杂合性的推断以及遗传异常的识别。单亲二体描述的是仅从一个亲本遗传而来的一对同源染色体的情况。在遗传疾病研究中检测到这种畸变很重要,因为它可能导致印记障碍和隐性疾病。我们开发了一种软件工具,可从儿童-母亲-父亲的基因型数据中检测单亲二体,该工具使用二项式检验来识别具有大量单亲遗传基因型的染色体。该工具是首个读取VCF格式基因型、进行综合拷贝数过滤并使用在不同基因分型密度和噪声特征的平台上都具有固有稳健性的统计检验的工具。模拟结果表明,与先前开发的方法相比,其准确性更高。我们在“解读发育障碍”项目(一项基于三联体的罕见疾病研究)的1057个三联体上实施了该方法,检测到6个经过验证的事件,与单亲二体的人群患病率(3500分之一)相比有显著富集,这表明这些事件中的大多数是致病性的。其中一个事件代表一种已知的印记障碍,外显子组分析已鉴定出罕见的纯合候选变异,主要存在于单亲二体染色体的等臂区域,这些变异与其他变异一起为进一步的遗传和功能评估提供了靶点。