利用序列数据在孟德尔疾病中寻找疾病变异：方法与应用。

Finding disease variants in Mendelian disorders by using sequence data: methods and applications.

机构信息

Department of Biostatistics, Columbia University, New York, NY 10032, USA.

出版信息

Am J Hum Genet. 2011 Dec 9;89(6):701-12. doi: 10.1016/j.ajhg.2011.11.003. Epub 2011 Dec 1.

DOI:10.1016/j.ajhg.2011.11.003

PMID:22137099

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3234377/

Abstract

Many sequencing studies are now underway to identify the genetic causes for both Mendelian and complex traits. Via exome-sequencing, genes harboring variants implicated in several Mendelian traits have already been identified. The underlying methodology in these studies is a multistep algorithm based on filtering variants identified in a small number of affected individuals and depends on whether they are novel (not yet seen in public resources such as dbSNP), shared among affected individuals, and other external functional information on the variants. Although intuitive, these filter-based methods are nonoptimal and do not provide any measure of statistical uncertainty. We describe here a formal statistical approach that has several distinct advantages: (1) it provides fast computation of approximate p values for individual genes, (2) it adjusts for the background variation in each gene, (3) it allows for incorporation of functional or linkage-based information, and (4) it accommodates designs based on both affected relative pairs and unrelated affected individuals. We show via simulations that the proposed approach can be used in conjunction with the existing filter-based methods to achieve a substantially better ranking of a gene relevant for disease when compared to currently used filter-based approaches, this is especially so in the presence of disease locus heterogeneity. We revisit recent studies on three Mendelian diseases and show that the proposed approach results in the implicated gene being ranked first in all studies, and approximate p values of 10(-6) for the Miller Syndrome gene, 1.0 × 10(-4) for the Freeman-Sheldon Syndrome gene, and 3.5 × 10(-5) for the Kabuki Syndrome gene.

摘要

许多测序研究现在正在进行，以确定孟德尔和复杂性状的遗传原因。通过外显子组测序，已经确定了携带几种孟德尔性状相关变异的基因。这些研究中的基本方法是一种多步骤算法，基于对少数受影响个体中识别出的变体进行过滤，并且取决于它们是否是新颖的（尚未在公共资源如 dbSNP 中看到）、在受影响个体中共享，以及变体的其他外部功能信息。虽然直观，但这些基于过滤的方法不是最优的，并且不提供任何统计不确定性的度量。我们在这里描述一种正式的统计方法，它具有几个明显的优点：(1) 它为个体基因提供了快速计算近似 p 值的方法，(2) 它调整了每个基因中的背景变异，(3) 它允许包含功能或基于连锁的信息，以及 (4) 它适应了基于受影响相对对和无关受影响个体的设计。我们通过模拟表明，所提出的方法可以与现有的基于过滤的方法结合使用，与目前使用的基于过滤的方法相比，可以更有效地对与疾病相关的基因进行排名，在存在疾病位点异质性的情况下尤其如此。我们重新研究了最近关于三种孟德尔疾病的研究，并表明所提出的方法导致所涉及的基因在所有研究中排名第一，并且 Miller 综合征基因的近似 p 值为 10(-6)，Freeman-Sheldon 综合征基因的近似 p 值为 1.0×10(-4)，Kabuki 综合征基因的近似 p 值为 3.5×10(-5)。

相似文献

Finding disease variants in Mendelian disorders by using sequence data: methods and applications.

Am J Hum Genet. 2011 Dec 9;89(6):701-12. doi: 10.1016/j.ajhg.2011.11.003. Epub 2011 Dec 1.

Revisiting Mendelian disorders through exome sequencing.

Hum Genet. 2011 Apr;129(4):351-70. doi: 10.1007/s00439-011-0964-2. Epub 2011 Feb 18.

Miller syndrome with novel dihydroorotate dehydrogenase gene mutations.

Pediatr Int. 2011 Aug;53(4):587-91. doi: 10.1111/j.1442-200X.2010.03303.x.

Under the mask of Kabuki syndrome: Elucidation of genetic-and phenotypic heterogeneity in patients with Kabuki-like phenotype.

Eur J Med Genet. 2018 Jun;61(6):315-321. doi: 10.1016/j.ejmg.2018.01.005. Epub 2018 Jan 4.

Miller (Genee-Wiedemann) syndrome represents a clinically and biochemically distinct subgroup of postaxial acrofacial dysostosis associated with partial deficiency of DHODH.

Hum Mol Genet. 2012 Sep 15;21(18):3969-83. doi: 10.1093/hmg/dds218. Epub 2012 Jun 12.

Protein instability and functional defects caused by mutations of dihydro-orotate dehydrogenase in Miller syndrome patients.

Biosci Rep. 2012 Dec;32(6):631-9. doi: 10.1042/BSR20120046.

Elevated plasma dihydroorotate in Miller syndrome: Biochemical, diagnostic and clinical implications, and treatment with uridine.

Mol Genet Metab. 2016 Sep;119(1-2):83-90. doi: 10.1016/j.ymgme.2016.06.008. Epub 2016 Jun 14.

Identification of KMT2D and KDM6A variants by targeted sequencing from patients with Kabuki syndrome and other congenital disorders.

Gene. 2020 Mar 20;731:144360. doi: 10.1016/j.gene.2020.144360. Epub 2020 Jan 11.

Mutation spectrum of MLL2 in a cohort of Kabuki syndrome patients.

Orphanet J Rare Dis. 2011 Jun 9;6:38. doi: 10.1186/1750-1172-6-38.

BioBin: a bioinformatics tool for automating the binning of rare variants using publicly available biological knowledge.

BMC Med Genomics. 2013;6 Suppl 2(Suppl 2):S6. doi: 10.1186/1755-8794-6-S2-S6. Epub 2013 May 7.

引用本文的文献

MethPhaser: methylation-based long-read haplotype phasing of human genomes.

Nat Commun. 2024 Jun 22;15(1):5327. doi: 10.1038/s41467-024-49588-0.

Cauchy combination methods for the detection of gene-environment interactions for rare variants related to quantitative phenotypes.

Heredity (Edinb). 2023 Oct;131(4):241-252. doi: 10.1038/s41437-023-00640-7. Epub 2023 Jul 22.

Personalized structural biology reveals the molecular mechanisms underlying heterogeneous epileptic phenotypes caused by KCNC2 variants.

HGG Adv. 2022 Jul 19;3(4):100131. doi: 10.1016/j.xhgg.2022.100131. eCollection 2022 Oct 13.

Identifying digenic disease genes via machine learning in the Undiagnosed Diseases Network.

Am J Hum Genet. 2021 Oct 7;108(10):1946-1963. doi: 10.1016/j.ajhg.2021.08.010. Epub 2021 Sep 15.

A unified method for rare variant analysis of gene-environment interactions.

Stat Med. 2020 Mar 15;39(6):801-813. doi: 10.1002/sim.8446. Epub 2019 Dec 4.

metaFARVAT: An Efficient Tool for Meta-Analysis of Family-Based, Case-Control, and Population-Based Rare Variant Association Studies.

Front Genet. 2019 Jun 19;10:572. doi: 10.3389/fgene.2019.00572. eCollection 2019.

Inferring disease risk genes from sequencing data in multiplex pedigrees through sharing of rare variants.

Genet Epidemiol. 2019 Feb;43(1):37-49. doi: 10.1002/gepi.22155. Epub 2018 Sep 24.

Gene-based segregation method for identifying rare variants in family-based sequencing studies.

Genet Epidemiol. 2017 May;41(4):309-319. doi: 10.1002/gepi.22037. Epub 2017 Feb 13.

FamPipe: An Automatic Analysis Pipeline for Analyzing Sequencing Data in Families for Disease Studies.

PLoS Comput Biol. 2016 Jun 6;12(6):e1004980. doi: 10.1371/journal.pcbi.1004980. eCollection 2016 Jun.

Beyond Rare-Variant Association Testing: Pinpointing Rare Causal Variants in Case-Control Sequencing Study.

Sci Rep. 2016 Feb 23;6:21824. doi: 10.1038/srep21824.

本文引用的文献

Study designs for identification of rare disease variants in complex diseases: the utility of family-based designs.

Genetics. 2011 Nov;189(3):1061-8. doi: 10.1534/genetics.111.131813. Epub 2011 Aug 11.

Rare-variant association testing for sequencing data with the sequence kernel association test.

Am J Hum Genet. 2011 Jul 15;89(1):82-93. doi: 10.1016/j.ajhg.2011.05.029. Epub 2011 Jul 7.

Genomic contributions to Mendelian disease.

Genome Res. 2011 May;21(5):643-4. doi: 10.1101/gr.123554.111.

Testing for an unusual distribution of rare variants.

PLoS Genet. 2011 Mar;7(3):e1001322. doi: 10.1371/journal.pgen.1001322. Epub 2011 Mar 3.

A new testing strategy to identify rare variants with either risk or protective effect on disease.

PLoS Genet. 2011 Feb 3;7(2):e1001289. doi: 10.1371/journal.pgen.1001289.

An evolutionary framework for association testing in resequencing studies.

PLoS Genet. 2010 Nov 11;6(11):e1001202. doi: 10.1371/journal.pgen.1001202.

A novel adaptive method for the analysis of next-generation sequencing data to detect complex trait associations with rare variants due to gene main effects and interactions.

PLoS Genet. 2010 Oct 14;6(10):e1001156. doi: 10.1371/journal.pgen.1001156.

A covering method for detecting genetic associations between rare variants and common phenotypes.

PLoS Comput Biol. 2010 Oct 14;6(10):e1000954. doi: 10.1371/journal.pcbi.1000954.

Exome sequencing identifies MLL2 mutations as a cause of Kabuki syndrome.

Nat Genet. 2010 Sep;42(9):790-3. doi: 10.1038/ng.646. Epub 2010 Aug 15.

The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data.

Genome Res. 2010 Sep;20(9):1297-303. doi: 10.1101/gr.107524.110. Epub 2010 Jul 19.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用序列数据在孟德尔疾病中寻找疾病变异：方法与应用。

Finding disease variants in Mendelian disorders by using sequence data: methods and applications.

机构信息

Department of Biostatistics, Columbia University, New York, NY 10032, USA.

出版信息

Am J Hum Genet. 2011 Dec 9;89(6):701-12. doi: 10.1016/j.ajhg.2011.11.003. Epub 2011 Dec 1.

DOI:10.1016/j.ajhg.2011.11.003

PMID:22137099

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3234377/

Abstract

摘要

利用序列数据在孟德尔疾病中寻找疾病变异：方法与应用。

Finding disease variants in Mendelian disorders by using sequence data: methods and applications.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

利用序列数据在孟德尔疾病中寻找疾病变异：方法与应用。

Finding disease variants in Mendelian disorders by using sequence data: methods and applications.

机构信息

出版信息