一种用于指导家系测序选择的统计框架。

A statistical framework to guide sequencing choices in pedigrees.

机构信息

Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, WA 98195, USA; Department of Biostatistics, University of Washington, Seattle, WA 98195, USA.

Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, WA 98195, USA.

出版信息

Am J Hum Genet. 2014 Feb 6;94(2):257-67. doi: 10.1016/j.ajhg.2014.01.005.

DOI:10.1016/j.ajhg.2014.01.005

PMID:24507777

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3928665/

Abstract

The use of large pedigrees is an effective design for identifying rare functional variants affecting heritable traits. Cost-effective studies using sequence data can be achieved via pedigree-based genotype imputation in which some subjects are sequenced and missing genotypes are inferred on the remaining subjects. Because of high cost, it is important to carefully prioritize subjects for sequencing. Here, we introduce a statistical framework that enables systematic comparison among subject-selection choices for sequencing. We introduce a metric "local coverage," which allows the use of inferred inheritance vectors to measure genotype-imputation ability specifically in a region of interest, such as one with prior evidence of linkage. In the absence of linkage information, we can instead use a "genome-wide coverage" metric computed with the pedigree structure. These metrics enable the development of a method that identifies efficient selection choices for sequencing. As implemented in GIGI-Pick, this method also flexibly allows initial manual selection of subjects and optimizes selections within the constraint that only some subjects might be available for sequencing. In the present study, we used simulations to compare GIGI-Pick with PRIMUS, ExomePicks, and common ad hoc methods of selecting subjects. In genotype imputation of both common and rare alleles, GIGI-Pick substantially outperformed all other methods considered and had the added advantage of incorporating prior linkage information. We also used a real pedigree to demonstrate the utility of our approach in identifying causal mutations. Our work enables prioritization of subjects for sequencing to facilitate dissection of the genetic basis of heritable traits.

摘要

利用大型家系是鉴定影响遗传性状的罕见功能变异的有效设计。通过基于家系的基因型推断，可以使用经济有效的序列数据进行研究，其中一些个体进行测序，其余个体的缺失基因型进行推断。由于成本高昂，因此仔细优先选择测序对象非常重要。在这里，我们介绍了一种统计框架，可实现对测序对象选择的系统比较。我们引入了一个度量标准“局部覆盖率”，该标准允许使用推断的遗传向量来专门测量目标区域（例如具有先前连锁证据的区域）中的基因型推断能力。在没有连锁信息的情况下，我们可以使用基于家系结构计算的“全基因组覆盖率”度量标准。这些指标使我们能够开发一种方法，该方法可以识别用于测序的有效选择方案。作为 GIGI-Pick 的实现方法，该方法还可以灵活地进行初始手动选择，并在仅一些个体可能进行测序的约束条件下优化选择。在本研究中，我们使用模拟来比较 GIGI-Pick 与 PRIMUS、ExomePicks 和常见的选择对象的特定方法。在常见和罕见等位基因的基因型推断中，GIGI-Pick 均明显优于所有其他考虑的方法，并且具有纳入先前连锁信息的额外优势。我们还使用真实的家系证明了我们的方法在识别因果突变中的实用性。我们的工作可以优先选择测序对象，以促进对遗传性状遗传基础的剖析。

相似文献

A statistical framework to guide sequencing choices in pedigrees.

Am J Hum Genet. 2014 Feb 6;94(2):257-67. doi: 10.1016/j.ajhg.2014.01.005.

GIGI: an approach to effective imputation of dense genotypes on large pedigrees.

Am J Hum Genet. 2013 Apr 4;92(4):504-16. doi: 10.1016/j.ajhg.2013.02.011.

Power of family-based association designs to detect rare variants in large pedigrees using imputed genotypes.

Genet Epidemiol. 2014 Jan;38(1):1-9. doi: 10.1002/gepi.21776. Epub 2013 Nov 15.

Combining family- and population-based imputation data for association analysis of rare and common variants in large pedigrees.

Genet Epidemiol. 2014 Nov;38(7):579-90. doi: 10.1002/gepi.21844. Epub 2014 Aug 1.

Whole-genome characterization in pedigreed non-human primates using genotyping-by-sequencing (GBS) and imputation.

BMC Genomics. 2016 Aug 24;17(1):676. doi: 10.1186/s12864-016-2966-x.

GIGI-Quick: a fast approach to impute missing genotypes in genome-wide association family data.

Bioinformatics. 2018 May 1;34(9):1591-1593. doi: 10.1093/bioinformatics/btx782.

Revisit Population-based and Family-based Genotype Imputation.

Sci Rep. 2019 Feb 12;9(1):1800. doi: 10.1038/s41598-018-38469-4.

Comparison and assessment of family- and population-based genotype imputation methods in large pedigrees.

Genome Res. 2019 Jan;29(1):125-134. doi: 10.1101/gr.236315.118. Epub 2018 Dec 4.

Multipoint linkage analysis with many multiallelic or dense diallelic markers: Markov chain-Monte Carlo provides practical approaches for genome scans on general pedigrees.

Am J Hum Genet. 2006 Nov;79(5):846-58. doi: 10.1086/508472. Epub 2006 Sep 20.

PedBLIMP: extending linear predictors to impute genotypes in pedigrees.

Genet Epidemiol. 2014 Sep;38(6):531-41. doi: 10.1002/gepi.21838. Epub 2014 Jul 12.

引用本文的文献

Revisit Population-based and Family-based Genotype Imputation.

Sci Rep. 2019 Feb 12;9(1):1800. doi: 10.1038/s41598-018-38469-4.

Robust Rare-Variant Association Tests For Quantitative Traits in General Pedigrees.

Stat Biosci. 2018 Dec;10(3):491-505. doi: 10.1007/s12561-017-9197-9. Epub 2017 Jun 5.

Hybrid peeling for fast and accurate calling, phasing, and imputation with sequence data of any coverage in pedigrees.

Genet Sel Evol. 2018 Dec 18;50(1):67. doi: 10.1186/s12711-018-0438-2.

Inferring Transmission Histories of Rare Alleles in Population-Scale Genealogies.

Am J Hum Genet. 2018 Dec 6;103(6):893-906. doi: 10.1016/j.ajhg.2018.10.017.

Comparison and assessment of family- and population-based genotype imputation methods in large pedigrees.

Genome Res. 2019 Jan;29(1):125-134. doi: 10.1101/gr.236315.118. Epub 2018 Dec 4.

Application of genome analysis strategies in the clinical testing for pediatric diseases.

Pediatr Investig. 2018 Jul 16;2(2):72-81. doi: 10.1002/ped4.12044.

Identity-by-descent estimation with population- and pedigree-based imputation in admixed family data.

BMC Proc. 2016 Oct 18;10(Suppl 7):295-301. doi: 10.1186/s12919-016-0046-5. eCollection 2016.

Whole-genome characterization in pedigreed non-human primates using genotyping-by-sequencing (GBS) and imputation.

BMC Genomics. 2016 Aug 24;17(1):676. doi: 10.1186/s12864-016-2966-x.

G-STRATEGY: Optimal Selection of Individuals for Sequencing in Genetic Association Studies.

Genet Epidemiol. 2016 Sep;40(6):446-60. doi: 10.1002/gepi.21982. Epub 2016 Jun 3.

Family-based approaches: design, imputation, analysis, and beyond.

BMC Genet. 2016 Feb 3;17 Suppl 2(Suppl 2):9. doi: 10.1186/s12863-015-0318-5.

本文引用的文献

Two-phase and family-based designs for next-generation sequencing studies.

Front Genet. 2013 Dec 13;4:276. doi: 10.3389/fgene.2013.00276.

Joint linkage and association analysis with exome sequence data implicates SLC25A40 in hypertriglyceridemia.

Am J Hum Genet. 2013 Dec 5;93(6):1035-45. doi: 10.1016/j.ajhg.2013.10.019. Epub 2013 Nov 21.

Power of family-based association designs to detect rare variants in large pedigrees using imputed genotypes.

Genet Epidemiol. 2014 Jan;38(1):1-9. doi: 10.1002/gepi.21776. Epub 2013 Nov 15.

Family-based exome-sequencing approach identifies rare susceptibility variants for lithium-responsive bipolar disorder.

Genome. 2013 Oct;56(10):634-40. doi: 10.1139/gen-2013-0081. Epub 2013 Sep 17.

Recurrent gain-of-function mutation in PRKG1 causes thoracic aortic aneurysms and acute aortic dissections.

Am J Hum Genet. 2013 Aug 8;93(2):398-404. doi: 10.1016/j.ajhg.2013.06.019. Epub 2013 Aug 1.

Combined sequence-based and genetic mapping analysis of complex traits in outbred rats.

Nat Genet. 2013 Jul;45(7):767-75. doi: 10.1038/ng.2644. Epub 2013 May 26.

Mutations in BICD2, which encodes a golgin and important motor adaptor, cause congenital autosomal-dominant spinal muscular atrophy.

Am J Hum Genet. 2013 Jun 6;92(6):946-54. doi: 10.1016/j.ajhg.2013.04.011. Epub 2013 May 9.

GIGI: an approach to effective imputation of dense genotypes on large pedigrees.

Am J Hum Genet. 2013 Apr 4;92(4):504-16. doi: 10.1016/j.ajhg.2013.02.011.

Exome sequencing and genome-wide linkage analysis in 17 families illustrate the complex contribution of TTN truncating variants to dilated cardiomyopathy.

Circ Cardiovasc Genet. 2013 Apr;6(2):144-53. doi: 10.1161/CIRCGENETICS.111.000062. Epub 2013 Feb 15.

Sequence kernel association test for quantitative traits in family samples.

Genet Epidemiol. 2013 Feb;37(2):196-204. doi: 10.1002/gepi.21703. Epub 2012 Dec 26.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种用于指导家系测序选择的统计框架。

A statistical framework to guide sequencing choices in pedigrees.

机构信息

Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, WA 98195, USA; Department of Biostatistics, University of Washington, Seattle, WA 98195, USA.

Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, WA 98195, USA.

出版信息

Am J Hum Genet. 2014 Feb 6;94(2):257-67. doi: 10.1016/j.ajhg.2014.01.005.

DOI:10.1016/j.ajhg.2014.01.005

PMID:24507777

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3928665/

Abstract

摘要

一种用于指导家系测序选择的统计框架。

A statistical framework to guide sequencing choices in pedigrees.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

一种用于指导家系测序选择的统计框架。

A statistical framework to guide sequencing choices in pedigrees.

机构信息

出版信息