人类基因组中剪接改变单核苷酸变异的计算机模拟预测

In silico prediction of splice-altering single nucleotide variants in the human genome.

作者信息

Jian Xueqiu, Boerwinkle Eric, Liu Xiaoming

出版信息

Nucleic Acids Res. 2014 Dec 16;42(22):13534-44. doi: 10.1093/nar/gku1206.

DOI:10.1093/nar/gku1206

PMID:25416802

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4267638/

Abstract

In silico tools have been developed to predict variants that may have an impact on pre-mRNA splicing. The major limitation of the application of these tools to basic research and clinical practice is the difficulty in interpreting the output. Most tools only predict potential splice sites given a DNA sequence without measuring splicing signal changes caused by a variant. Another limitation is the lack of large-scale evaluation studies of these tools. We compared eight in silico tools on 2959 single nucleotide variants within splicing consensus regions (scSNVs) using receiver operating characteristic analysis. The Position Weight Matrix model and MaxEntScan outperformed other methods. Two ensemble learning methods, adaptive boosting and random forests, were used to construct models that take advantage of individual methods. Both models further improved prediction, with outputs of directly interpretable prediction scores. We applied our ensemble scores to scSNVs from the Catalogue of Somatic Mutations in Cancer database. Analysis showed that predicted splice-altering scSNVs are enriched in recurrent scSNVs and known cancer genes. We pre-computed our ensemble scores for all potential scSNVs across the human genome, providing a whole genome level resource for identifying splice-altering scSNVs discovered from large-scale sequencing studies.

摘要

已开发出计算机工具来预测可能影响前体mRNA剪接的变异。将这些工具应用于基础研究和临床实践的主要限制在于难以解读其输出结果。大多数工具仅根据DNA序列预测潜在的剪接位点，而不测量变异导致的剪接信号变化。另一个限制是缺乏对这些工具的大规模评估研究。我们使用受试者工作特征分析，在剪接共有区域内的2959个单核苷酸变异（scSNV）上比较了八种计算机工具。位置权重矩阵模型和最大熵扫描的表现优于其他方法。两种集成学习方法，即自适应增强和随机森林，被用于构建利用个体方法优势的模型。这两种模型都进一步提高了预测能力，其输出为可直接解读的预测分数。我们将我们的集成分数应用于来自癌症体细胞突变目录数据库的scSNV。分析表明，预测的剪接改变scSNV在复发性scSNV和已知癌症基因中富集。我们预先计算了全人类基因组中所有潜在scSNV的集成分数，为识别从大规模测序研究中发现的剪接改变scSNV提供了一个全基因组水平的资源。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/675e/4267638/cbeb45ded261/gku1206fig1.jpg

相似文献

In silico prediction of splice-altering single nucleotide variants in the human genome.

Nucleic Acids Res. 2014 Dec 16;42(22):13534-44. doi: 10.1093/nar/gku1206.

Combining genetic constraint with predictions of alternative splicing to prioritize deleterious splicing in rare disease studies.

BMC Bioinformatics. 2022 Nov 14;23(1):482. doi: 10.1186/s12859-022-05041-x.

Thorough in silico and in vitro cDNA analysis of 21 putative BRCA1 and BRCA2 splice variants and a complex tandem duplication in BRCA2 allowing the identification of activated cryptic splice donor sites in BRCA2 exon 11.

Hum Mutat. 2018 Apr;39(4):515-526. doi: 10.1002/humu.23390. Epub 2018 Jan 22.

Comparison of Tools for Splice-Altering Variant Prediction Using Established Spliceogenic Variants: An End-User's Point of View.

Int J Genomics. 2022 Oct 13;2022:5265686. doi: 10.1155/2022/5265686. eCollection 2022.

Analysis of 30 putative BRCA1 splicing mutations in hereditary breast and ovarian cancer families identifies exonic splice site mutations that escape in silico prediction.

PLoS One. 2012;7(12):e50800. doi: 10.1371/journal.pone.0050800. Epub 2012 Dec 11.

SpliceVarDB: A comprehensive database of experimentally validated human splicing variants.

Am J Hum Genet. 2024 Oct 3;111(10):2164-2175. doi: 10.1016/j.ajhg.2024.08.002. Epub 2024 Sep 2.

Reference-informed prediction of alternative splicing and splicing-altering mutations from sequences.

Genome Res. 2024 Aug 20;34(7):1052-1065. doi: 10.1101/gr.279044.124.

Read-Split-Run: an improved bioinformatics pipeline for identification of genome-wide non-canonical spliced regions using RNA-Seq data.

BMC Genomics. 2016 Aug 22;17 Suppl 7(Suppl 7):503. doi: 10.1186/s12864-016-2896-7.

Exon first nucleotide mutations in splicing: evaluation of in silico prediction tools.

PLoS One. 2014 Feb 21;9(2):e89570. doi: 10.1371/journal.pone.0089570. eCollection 2014.

Identification of alternative 5'/3' splice sites based on the mechanism of splice site competition.

Nucleic Acids Res. 2006;34(21):6305-13. doi: 10.1093/nar/gkl900. Epub 2006 Nov 10.

引用本文的文献

Hidden in the Genome: The First Italian Family with North Carolina Macular Dystrophy Carrying a Novel and Duplication.

Biomedicines. 2025 Aug 5;13(8):1904. doi: 10.3390/biomedicines13081904.

Comprehensive genotype-phenotype analysis in POLR3-related disorders.

HGG Adv. 2025 Jul 18;6(4):100481. doi: 10.1016/j.xhgg.2025.100481.

Exome sequencing of patients with syndromic tall stature reveals four novel candidate genes.

Endocr Connect. 2025 Jul 15;14(7). doi: 10.1530/EC-25-0137. Print 2025 Jul 1.

Whole genome sequencing and single-cell transcriptomics identify KMT2D inactivation as a potential new driver for pituitary tumors: a case report.

BJC Rep. 2025 Jun 16;3(1):43. doi: 10.1038/s44276-025-00155-0.

Cracking rare disorders: a new minimally invasive RNA-seq protocol.

NPJ Genom Med. 2025 May 28;10(1):45. doi: 10.1038/s41525-025-00502-7.

Uncovering a Novel Pathogenic Mechanism of in Mitochondrial Disorders: Insights from Functional Studies on the c.38A>G Variant.

Int J Mol Sci. 2025 Apr 12;26(8):3670. doi: 10.3390/ijms26083670.

Identification of new families and variants in autosomal dominant macular dystrophy associated with THRB.

Sci Rep. 2025 Apr 28;15(1):14904. doi: 10.1038/s41598-025-97768-9.

UDP-glucose dehydrogenase variants cause dystroglycanopathy.

Ann Clin Transl Neurol. 2025 Jun;12(6):1302-1308. doi: 10.1002/acn3.70002. Epub 2025 Apr 17.

Deleterious variants in intolerant genes reveal new candidates for self-limited delayed puberty.

Eur J Endocrinol. 2025 Mar 27;192(4):481-490. doi: 10.1093/ejendo/lvaf061.

Deciphering the Genetic Basis of Degenerative and Developmental Eye Disorders in 50 Pakistani Consanguineous Families Using Whole-Exome Sequencing.

Int J Mol Sci. 2025 Mar 18;26(6):2715. doi: 10.3390/ijms26062715.

本文引用的文献

Validation of predicted mRNA splicing mutations using high-throughput transcriptome data.

F1000Res. 2014 Jan 13;3:8. doi: 10.12688/f1000research.3-8.v2. eCollection 2014.

LaSSO, a strategy for genome-wide mapping of intronic lariats and branch points using RNA-seq.

Genome Res. 2014 Jul;24(7):1169-79. doi: 10.1101/gr.166819.113. Epub 2014 Apr 7.

Synonymous mutations frequently act as driver mutations in human cancers.

Cell. 2014 Mar 13;156(6):1324-1335. doi: 10.1016/j.cell.2014.01.051.

Identification of novel point mutations in splicing sites integrating whole-exome and RNA-seq data in myeloproliferative diseases.

Mol Genet Genomic Med. 2013 Nov;1(4):246-59. doi: 10.1002/mgg3.23. Epub 2013 Jul 7.

A general framework for estimating the relative pathogenicity of human genetic variants.

Nat Genet. 2014 Mar;46(3):310-5. doi: 10.1038/ng.2892. Epub 2014 Feb 2.

MutPred Splice: machine learning-based prediction of exonic variants that disrupt splicing.

Genome Biol. 2014 Jan 13;15(1):R19. doi: 10.1186/gb-2014-15-1-r19.

Ensembl 2014.

Nucleic Acids Res. 2014 Jan;42(Database issue):D749-55. doi: 10.1093/nar/gkt1196. Epub 2013 Dec 6.

In silico tools for splicing defect prediction: a survey from the viewpoint of end users.

Genet Med. 2014 Jul;16(7):497-503. doi: 10.1038/gim.2013.176. Epub 2013 Nov 21.

RefSeq: an update on mammalian reference sequences.

Nucleic Acids Res. 2014 Jan;42(Database issue):D756-63. doi: 10.1093/nar/gkt1114. Epub 2013 Nov 19.

dbNSFP v2.0: a database of human non-synonymous SNVs and their functional predictions and annotations.

Hum Mutat. 2013 Sep;34(9):E2393-402. doi: 10.1002/humu.22376. Epub 2013 Jul 10.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

人类基因组中剪接改变单核苷酸变异的计算机模拟预测

In silico prediction of splice-altering single nucleotide variants in the human genome.

作者信息

Jian Xueqiu, Boerwinkle Eric, Liu Xiaoming

出版信息

Nucleic Acids Res. 2014 Dec 16;42(22):13534-44. doi: 10.1093/nar/gku1206.

DOI:10.1093/nar/gku1206

PMID:25416802

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4267638/

Abstract

摘要

人类基因组中剪接改变单核苷酸变异的计算机模拟预测

In silico prediction of splice-altering single nucleotide variants in the human genome.

作者信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

人类基因组中剪接改变单核苷酸变异的计算机模拟预测

In silico prediction of splice-altering single nucleotide variants in the human genome.

作者信息

出版信息

相似文献

引用本文的文献

本文引用的文献