预测遗传变异导致的基因结构变化：基于外显子定义特征。

Predicting gene structure changes resulting from genetic variants via exon definition features.

机构信息

Program in Computational Biology and Bioinformatics, Duke University, Durham, NC, USA.

Center for Genomic and Computational Biology, Duke University Medical School, Durham, NC, USA.

出版信息

Bioinformatics. 2018 Nov 1;34(21):3616-3623. doi: 10.1093/bioinformatics/bty324.

DOI:10.1093/bioinformatics/bty324

PMID:29701825

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6198862/

Abstract

MOTIVATION

Genetic variation that disrupts gene function by altering gene splicing between individuals can substantially influence traits and disease. In those cases, accurately predicting the effects of genetic variation on splicing can be highly valuable for investigating the mechanisms underlying those traits and diseases. While methods have been developed to generate high quality computational predictions of gene structures in reference genomes, the same methods perform poorly when used to predict the potentially deleterious effects of genetic changes that alter gene splicing between individuals. Underlying that discrepancy in predictive ability are the common assumptions by reference gene finding algorithms that genes are conserved, well-formed and produce functional proteins.

RESULTS

We describe a probabilistic approach for predicting recent changes to gene structure that may or may not conserve function. The model is applicable to both coding and non-coding genes, and can be trained on existing gene annotations without requiring curated examples of aberrant splicing. We apply this model to the problem of predicting altered splicing patterns in the genomes of individual humans, and we demonstrate that performing gene-structure prediction without relying on conserved coding features is feasible. The model predicts an unexpected abundance of variants that create de novo splice sites, an observation supported by both simulations and empirical data from RNA-seq experiments. While these de novo splice variants are commonly misinterpreted by other tools as coding or non-coding variants of little or no effect, we find that in some cases they can have large effects on splicing activity and protein products and we propose that they may commonly act as cryptic factors in disease.

AVAILABILITY AND IMPLEMENTATION

The software is available from geneprediction.org/SGRF.

SUPPLEMENTARY INFORMATION

Supplementary information is available at Bioinformatics online.

摘要

动机

通过改变个体间基因剪接来破坏基因功能的遗传变异，可以显著影响性状和疾病。在这种情况下，准确预测遗传变异对剪接的影响对于研究这些性状和疾病的潜在机制非常有价值。虽然已经开发出了用于生成参考基因组中基因结构的高质量计算预测的方法，但当用于预测改变个体间基因剪接的遗传变化的潜在有害影响时，这些方法的性能就很差。导致预测能力差异的原因是参考基因发现算法的常见假设，即基因是保守的、结构良好的，并产生功能性蛋白质。

结果

我们描述了一种预测基因结构最近变化的概率方法，这些变化可能保留功能，也可能不保留功能。该模型适用于编码和非编码基因，并且可以在不依赖异常剪接的 curated 示例的情况下，在现有基因注释上进行训练。我们将该模型应用于个体人类基因组中改变剪接模式的预测问题，并证明不依赖保守编码特征进行基因结构预测是可行的。该模型预测了大量创建新剪接位点的变体，这一观察结果得到了模拟和来自 RNA-seq 实验的经验数据的支持。虽然这些新剪接变体通常被其他工具错误地解释为编码或非编码变体，对功能影响很小或没有，但我们发现，在某些情况下，它们对剪接活性和蛋白质产物有很大的影响，我们提出它们可能通常作为疾病中的隐匿因子。

可用性和实现

软件可从 geneprediction.org/SGRF 获取。

补充信息

补充信息可在 Bioinformatics 在线获取。

相似文献

Predicting gene structure changes resulting from genetic variants via exon definition features.

Bioinformatics. 2018 Nov 1;34(21):3616-3623. doi: 10.1093/bioinformatics/bty324.

Computational discovery of human coding and non-coding transcripts with conserved splice sites.

Bioinformatics. 2011 Jul 15;27(14):1894-900. doi: 10.1093/bioinformatics/btr314. Epub 2011 May 26.

Combining genetic constraint with predictions of alternative splicing to prioritize deleterious splicing in rare disease studies.

BMC Bioinformatics. 2022 Nov 14;23(1):482. doi: 10.1186/s12859-022-05041-x.

High-throughput interpretation of gene structure changes in human and nonhuman resequencing data, using ACE.

Bioinformatics. 2017 May 15;33(10):1437-1446. doi: 10.1093/bioinformatics/btw799.

SNPlice: variants that modulate Intron retention from RNA-sequencing data.

Bioinformatics. 2015 Apr 15;31(8):1191-8. doi: 10.1093/bioinformatics/btu804. Epub 2014 Dec 6.

Spliceogen: an integrative, scalable tool for the discovery of splice-altering variants.

Bioinformatics. 2019 Nov 1;35(21):4405-4407. doi: 10.1093/bioinformatics/btz263.

ChopStitch: exon annotation and splice graph construction using transcriptome assembly and whole genome sequencing data.

Bioinformatics. 2018 May 15;34(10):1697-1704. doi: 10.1093/bioinformatics/btx839.

Functional analysis of a large set of BRCA2 exon 7 variants highlights the predictive value of hexamer scores in detecting alterations of exonic splicing regulatory elements.

Hum Mutat. 2013 Nov;34(11):1547-57. doi: 10.1002/humu.22428. Epub 2013 Sep 18.

Prediction of mutant mRNA splice isoforms by information theory-based exon definition.

Hum Mutat. 2013 Apr;34(4):557-65. doi: 10.1002/humu.22277. Epub 2013 Feb 21.

Interpretable prioritization of splice variants in diagnostic next-generation sequencing.

Am J Hum Genet. 2021 Sep 2;108(9):1564-1577. doi: 10.1016/j.ajhg.2021.06.014. Epub 2021 Jul 21.

引用本文的文献

Polymorphism rs259983 of the gene is associated with the risk of anemia in pregnant women with gestational diabetes.

Egypt J Med Hum Genet. 2025;26(1):94. doi: 10.1186/s43042-025-00723-6. Epub 2025 Jun 3.

Genome-Wide Identification of the Gene Family in Kiwifruit and Regulatory Role of for Chlorophyll a Content.

Int J Mol Sci. 2022 Jun 10;23(12):6528. doi: 10.3390/ijms23126528.

Bayesian estimation of genetic regulatory effects in high-throughput reporter assays.

Bioinformatics. 2020 Jan 15;36(2):331-338. doi: 10.1093/bioinformatics/btz545.

Assessing cell-specific effects of genetic variations using tRNA microarrays.

BMC Genomics. 2019 Jul 16;20(Suppl 8):549. doi: 10.1186/s12864-019-5864-1.

本文引用的文献

High-throughput interpretation of gene structure changes in human and nonhuman resequencing data, using ACE.

Bioinformatics. 2017 May 15;33(10):1437-1446. doi: 10.1093/bioinformatics/btw799.

Araport11: a complete reannotation of the Arabidopsis thaliana reference genome.

Plant J. 2017 Feb;89(4):789-804. doi: 10.1111/tpj.13415. Epub 2017 Feb 10.

The Ensembl Variant Effect Predictor.

Genome Biol. 2016 Jun 6;17(1):122. doi: 10.1186/s13059-016-0974-4.

Learning the sequence determinants of alternative splicing from millions of random sequences.

Cell. 2015 Oct 22;163(3):698-711. doi: 10.1016/j.cell.2015.09.054.

A global reference for human genetic variation.

Nature. 2015 Oct 1;526(7571):68-74. doi: 10.1038/nature15393.

SRSF1 and hnRNP H antagonistically regulate splicing of COLQ exon 16 in a congenital myasthenic syndrome.

Sci Rep. 2015 Aug 18;5:13208. doi: 10.1038/srep13208.

Widespread alternative and aberrant splicing revealed by lariat sequencing.

Nucleic Acids Res. 2015 Sep 30;43(17):8488-501. doi: 10.1093/nar/gkv763. Epub 2015 Aug 10.

Human genomics. The human transcriptome across tissues and individuals.

Science. 2015 May 8;348(6235):660-5. doi: 10.1126/science.aaa0355.

RNA. Prescribing splicing.

Science. 2015 Jan 9;347(6218):124-5. doi: 10.1126/science.aaa4864.

RNA splicing. The human splicing code reveals new insights into the genetic determinants of disease.

Science. 2015 Jan 9;347(6218):1254806. doi: 10.1126/science.1254806. Epub 2014 Dec 18.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

预测遗传变异导致的基因结构变化：基于外显子定义特征。

Predicting gene structure changes resulting from genetic variants via exon definition features.

机构信息

Program in Computational Biology and Bioinformatics, Duke University, Durham, NC, USA.

Center for Genomic and Computational Biology, Duke University Medical School, Durham, NC, USA.

出版信息

Bioinformatics. 2018 Nov 1;34(21):3616-3623. doi: 10.1093/bioinformatics/bty324.

DOI:10.1093/bioinformatics/bty324

PMID:29701825

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6198862/

Abstract

MOTIVATION

RESULTS

AVAILABILITY AND IMPLEMENTATION

The software is available from geneprediction.org/SGRF.

SUPPLEMENTARY INFORMATION

Supplementary information is available at Bioinformatics online.

摘要

动机

结果

可用性和实现

软件可从 geneprediction.org/SGRF 获取。

补充信息

补充信息可在 Bioinformatics 在线获取。

预测遗传变异导致的基因结构变化：基于外显子定义特征。

Predicting gene structure changes resulting from genetic variants via exon definition features.

机构信息

出版信息

MOTIVATION

RESULTS

AVAILABILITY AND IMPLEMENTATION

SUPPLEMENTARY INFORMATION

动机

结果

可用性和实现

补充信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

预测遗传变异导致的基因结构变化：基于外显子定义特征。

Predicting gene structure changes resulting from genetic variants via exon definition features.

机构信息

出版信息

MOTIVATION

RESULTS

AVAILABILITY AND IMPLEMENTATION

SUPPLEMENTARY INFORMATION

动机

结果

可用性和实现

补充信息