基于带有基因特征的进化模型对基因约束进行贝叶斯估计。

Bayesian estimation of gene constraint from an evolutionary model with gene features.

作者信息

Zeng Tony, Spence Jeffrey P, Mostafavi Hakhamanesh, Pritchard Jonathan K

机构信息

Department of Genetics, Stanford University, Stanford CA.

Department of Biology, Stanford University, Stanford CA.

出版信息

bioRxiv. 2024 Apr 10:2023.05.19.541520. doi: 10.1101/2023.05.19.541520.

DOI:10.1101/2023.05.19.541520

PMID:37292653

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10245655/

Abstract

Measures of selective constraint on genes have been used for many applications including clinical interpretation of rare coding variants, disease gene discovery, and studies of genome evolution. However, widely-used metrics are severely underpowered at detecting constraint for the shortest ∼25% of genes, potentially causing important pathogenic mutations to be overlooked. We developed a framework combining a population genetics model with machine learning on gene features to enable accurate inference of an interpretable constraint metric, . Our estimates outperform existing metrics for prioritizing genes important for cell essentiality, human disease, and other phenotypes, especially for short genes. Our new estimates of selective constraint should have wide utility for characterizing genes relevant to human disease. Finally, our inference framework, GeneBayes, provides a flexible platform that can improve estimation of many gene-level properties, such as rare variant burden or gene expression differences.

摘要

对基因的选择性约束测量已被用于许多应用，包括罕见编码变异的临床解读、疾病基因发现以及基因组进化研究。然而，广泛使用的指标在检测最短约25%的基因的约束方面能力严重不足，可能导致重要的致病突变被忽视。我们开发了一个框架，将群体遗传学模型与基于基因特征的机器学习相结合，以实现对一个可解释的约束指标的准确推断。我们的估计在对细胞必需性、人类疾病和其他表型重要的基因进行优先级排序方面优于现有指标，特别是对于短基因。我们对选择性约束的新估计在表征与人类疾病相关的基因方面应具有广泛的用途。最后，我们的推断框架GeneBayes提供了一个灵活的平台，可以改进对许多基因水平特性的估计，如罕见变异负担或基因表达差异。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40e4/11017861/2181a4329650/nihpp-2023.05.19.541520v2-f0001.jpg

相似文献

Bayesian estimation of gene constraint from an evolutionary model with gene features.

bioRxiv. 2024 Apr 10:2023.05.19.541520. doi: 10.1101/2023.05.19.541520.

Bayesian estimation of gene constraint from an evolutionary model with gene features.

Res Sq. 2023 Jun 13:rs.3.rs-3012879. doi: 10.21203/rs.3.rs-3012879/v1.

Bayesian estimation of gene constraint from an evolutionary model with gene features.

Nat Genet. 2024 Aug;56(8):1632-1643. doi: 10.1038/s41588-024-01820-9. Epub 2024 Jul 8.

Genetic constraint at single amino acid resolution in protein domains improves missense variant prioritisation and gene discovery.

Genome Med. 2024 Jul 11;16(1):88. doi: 10.1186/s13073-024-01358-9.

Combining genetic constraint with predictions of alternative splicing to prioritize deleterious splicing in rare disease studies.

BMC Bioinformatics. 2022 Nov 14;23(1):482. doi: 10.1186/s12859-022-05041-x.

Genic constraint against nonsynonymous variation across the mouse genome.

BMC Genomics. 2023 Sep 22;24(1):562. doi: 10.1186/s12864-023-09637-2.

Quantifying constraint in the human mitochondrial genome.

Nature. 2024 Nov;635(8038):390-397. doi: 10.1038/s41586-024-08048-x. Epub 2024 Oct 16.

and human gene essentiality estimations capture contrasting functional constraints.

NAR Genom Bioinform. 2021 Jul 13;3(3):lqab063. doi: 10.1093/nargab/lqab063. eCollection 2021 Sep.

Bayesian models for syndrome- and gene-specific probabilities of novel variant pathogenicity.

Genome Med. 2015 Jan 28;7(1):5. doi: 10.1186/s13073-014-0120-4. eCollection 2015.

Cardiovascular Disease Pathogenicity Predictor (CVD-PP): A Tissue-Specific In Silico Tool for Discriminating Pathogenicity of Variants of Unknown Significance in Cardiovascular Disease Genes.

Circ Genom Precis Med. 2024 Dec;17(6):e004464. doi: 10.1161/CIRCGEN.123.004464. Epub 2024 Oct 29.

本文引用的文献

Differences in 5'untranslated regions highlight the importance of translational regulation of dosage sensitive genes.

Genome Biol. 2024 Apr 29;25(1):111. doi: 10.1186/s13059-024-03248-0.

A genomic mutational constraint map using variation in 76,156 human genomes.

Nature. 2024 Jan;625(7993):92-100. doi: 10.1038/s41586-023-06045-0. Epub 2023 Dec 6.

Systematic differences in discovery of genetic effects on gene expression and complex traits.

Nat Genet. 2023 Nov;55(11):1866-1875. doi: 10.1038/s41588-023-01529-1. Epub 2023 Oct 19.

Scaling the discrete-time Wright-Fisher model to biobank-scale datasets.

Genetics. 2023 Nov 1;225(3). doi: 10.1093/genetics/iyad168.

An unsupervised deep learning framework for predicting human essential genes from population and functional genomic data.

BMC Bioinformatics. 2023 Sep 18;24(1):347. doi: 10.1186/s12859-023-05481-z.

Leveraging polygenic enrichments of gene features to predict genes underlying complex traits and diseases.

Nat Genet. 2023 Aug;55(8):1267-1276. doi: 10.1038/s41588-023-01443-6. Epub 2023 Jul 13.

An empirical Bayes method for differential expression analysis of single cells with deep generative models.

Proc Natl Acad Sci U S A. 2023 May 23;120(21):e2209124120. doi: 10.1073/pnas.2209124120. Epub 2023 May 16.

Leveraging base-pair mammalian constraint to understand genetic variation and human disease.

Science. 2023 Apr 28;380(6643):eabn2937. doi: 10.1126/science.abn2937.

Genomic Diagnosis of Rare Pediatric Disease in the United Kingdom and Ireland.

N Engl J Med. 2023 Apr 27;388(17):1559-1571. doi: 10.1056/NEJMoa2209046. Epub 2023 Apr 12.

Network expansion of genetic associations defines a pleiotropy map of human cell biology.

Nat Genet. 2023 Mar;55(3):389-398. doi: 10.1038/s41588-023-01327-9. Epub 2023 Feb 23.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于带有基因特征的进化模型对基因约束进行贝叶斯估计。

Bayesian estimation of gene constraint from an evolutionary model with gene features.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献