• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

人类基因组中剪接改变单核苷酸变异的计算机模拟预测

In silico prediction of splice-altering single nucleotide variants in the human genome.

作者信息

Jian Xueqiu, Boerwinkle Eric, Liu Xiaoming

出版信息

Nucleic Acids Res. 2014 Dec 16;42(22):13534-44. doi: 10.1093/nar/gku1206.

DOI:10.1093/nar/gku1206
PMID:25416802
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4267638/
Abstract

In silico tools have been developed to predict variants that may have an impact on pre-mRNA splicing. The major limitation of the application of these tools to basic research and clinical practice is the difficulty in interpreting the output. Most tools only predict potential splice sites given a DNA sequence without measuring splicing signal changes caused by a variant. Another limitation is the lack of large-scale evaluation studies of these tools. We compared eight in silico tools on 2959 single nucleotide variants within splicing consensus regions (scSNVs) using receiver operating characteristic analysis. The Position Weight Matrix model and MaxEntScan outperformed other methods. Two ensemble learning methods, adaptive boosting and random forests, were used to construct models that take advantage of individual methods. Both models further improved prediction, with outputs of directly interpretable prediction scores. We applied our ensemble scores to scSNVs from the Catalogue of Somatic Mutations in Cancer database. Analysis showed that predicted splice-altering scSNVs are enriched in recurrent scSNVs and known cancer genes. We pre-computed our ensemble scores for all potential scSNVs across the human genome, providing a whole genome level resource for identifying splice-altering scSNVs discovered from large-scale sequencing studies.

摘要

已开发出计算机工具来预测可能影响前体mRNA剪接的变异。将这些工具应用于基础研究和临床实践的主要限制在于难以解读其输出结果。大多数工具仅根据DNA序列预测潜在的剪接位点,而不测量变异导致的剪接信号变化。另一个限制是缺乏对这些工具的大规模评估研究。我们使用受试者工作特征分析,在剪接共有区域内的2959个单核苷酸变异(scSNV)上比较了八种计算机工具。位置权重矩阵模型和最大熵扫描的表现优于其他方法。两种集成学习方法,即自适应增强和随机森林,被用于构建利用个体方法优势的模型。这两种模型都进一步提高了预测能力,其输出为可直接解读的预测分数。我们将我们的集成分数应用于来自癌症体细胞突变目录数据库的scSNV。分析表明,预测的剪接改变scSNV在复发性scSNV和已知癌症基因中富集。我们预先计算了全人类基因组中所有潜在scSNV的集成分数,为识别从大规模测序研究中发现的剪接改变scSNV提供了一个全基因组水平的资源。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/675e/4267638/e0ca0d8a265e/gku1206fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/675e/4267638/cbeb45ded261/gku1206fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/675e/4267638/82102ec93ce9/gku1206fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/675e/4267638/e0ca0d8a265e/gku1206fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/675e/4267638/cbeb45ded261/gku1206fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/675e/4267638/82102ec93ce9/gku1206fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/675e/4267638/e0ca0d8a265e/gku1206fig3.jpg

相似文献

1
In silico prediction of splice-altering single nucleotide variants in the human genome.人类基因组中剪接改变单核苷酸变异的计算机模拟预测
Nucleic Acids Res. 2014 Dec 16;42(22):13534-44. doi: 10.1093/nar/gku1206.
2
Combining genetic constraint with predictions of alternative splicing to prioritize deleterious splicing in rare disease studies.结合遗传约束和选择性剪接预测,优先考虑罕见病研究中的有害剪接。
BMC Bioinformatics. 2022 Nov 14;23(1):482. doi: 10.1186/s12859-022-05041-x.
3
Thorough in silico and in vitro cDNA analysis of 21 putative BRCA1 and BRCA2 splice variants and a complex tandem duplication in BRCA2 allowing the identification of activated cryptic splice donor sites in BRCA2 exon 11.对 21 个推定的 BRCA1 和 BRCA2 剪接变体和 BRCA2 中的复杂串联重复进行了深入的计算机分析和体外 cDNA 分析,从而鉴定出 BRCA2 外显子 11 中激活的隐性剪接供体位点。
Hum Mutat. 2018 Apr;39(4):515-526. doi: 10.1002/humu.23390. Epub 2018 Jan 22.
4
Comparison of Tools for Splice-Altering Variant Prediction Using Established Spliceogenic Variants: An End-User's Point of View.使用已确定的剪接变异体进行剪接改变变异预测工具的比较:终端用户视角
Int J Genomics. 2022 Oct 13;2022:5265686. doi: 10.1155/2022/5265686. eCollection 2022.
5
Analysis of 30 putative BRCA1 splicing mutations in hereditary breast and ovarian cancer families identifies exonic splice site mutations that escape in silico prediction.对 30 个遗传性乳腺癌和卵巢癌家族的假定 BRCA1 剪接突变进行分析,确定了逃避计算机预测的外显子剪接突变。
PLoS One. 2012;7(12):e50800. doi: 10.1371/journal.pone.0050800. Epub 2012 Dec 11.
6
SpliceVarDB: A comprehensive database of experimentally validated human splicing variants.SpliceVarDB:一个全面的人类剪接变异实验验证数据库。
Am J Hum Genet. 2024 Oct 3;111(10):2164-2175. doi: 10.1016/j.ajhg.2024.08.002. Epub 2024 Sep 2.
7
Reference-informed prediction of alternative splicing and splicing-altering mutations from sequences.基于参考的序列预测剪接和剪接改变突变。
Genome Res. 2024 Aug 20;34(7):1052-1065. doi: 10.1101/gr.279044.124.
8
Read-Split-Run: an improved bioinformatics pipeline for identification of genome-wide non-canonical spliced regions using RNA-Seq data.读取-分割-运行:一种利用RNA测序数据识别全基因组非经典剪接区域的改进型生物信息学流程。
BMC Genomics. 2016 Aug 22;17 Suppl 7(Suppl 7):503. doi: 10.1186/s12864-016-2896-7.
9
Exon first nucleotide mutations in splicing: evaluation of in silico prediction tools.剪接中外显子首个核苷酸突变:计算机预测工具的评估
PLoS One. 2014 Feb 21;9(2):e89570. doi: 10.1371/journal.pone.0089570. eCollection 2014.
10
Identification of alternative 5'/3' splice sites based on the mechanism of splice site competition.基于剪接位点竞争机制鉴定可变5'/3'剪接位点
Nucleic Acids Res. 2006;34(21):6305-13. doi: 10.1093/nar/gkl900. Epub 2006 Nov 10.

引用本文的文献

1
Hidden in the Genome: The First Italian Family with North Carolina Macular Dystrophy Carrying a Novel and Duplication.隐藏于基因组中:首个携带新型重复突变的患北卡罗来纳黄斑营养不良的意大利家族
Biomedicines. 2025 Aug 5;13(8):1904. doi: 10.3390/biomedicines13081904.
2
Comprehensive genotype-phenotype analysis in POLR3-related disorders.POLR3相关疾病的综合基因型-表型分析
HGG Adv. 2025 Jul 18;6(4):100481. doi: 10.1016/j.xhgg.2025.100481.
3
Exome sequencing of patients with syndromic tall stature reveals four novel candidate genes.

本文引用的文献

1
Validation of predicted mRNA splicing mutations using high-throughput transcriptome data.利用高通量转录组数据验证预测的mRNA剪接突变
F1000Res. 2014 Jan 13;3:8. doi: 10.12688/f1000research.3-8.v2. eCollection 2014.
2
LaSSO, a strategy for genome-wide mapping of intronic lariats and branch points using RNA-seq.LaSSO,一种利用RNA测序对内含子套索和分支点进行全基因组图谱绘制的策略。
Genome Res. 2014 Jul;24(7):1169-79. doi: 10.1101/gr.166819.113. Epub 2014 Apr 7.
3
Synonymous mutations frequently act as driver mutations in human cancers.
对综合征性身材高大患者进行外显子组测序发现了四个新的候选基因。
Endocr Connect. 2025 Jul 15;14(7). doi: 10.1530/EC-25-0137. Print 2025 Jul 1.
4
Whole genome sequencing and single-cell transcriptomics identify KMT2D inactivation as a potential new driver for pituitary tumors: a case report.全基因组测序和单细胞转录组学鉴定KMT2D失活为垂体瘤的潜在新驱动因素:病例报告
BJC Rep. 2025 Jun 16;3(1):43. doi: 10.1038/s44276-025-00155-0.
5
Cracking rare disorders: a new minimally invasive RNA-seq protocol.攻克罕见疾病:一种新的微创RNA测序方案
NPJ Genom Med. 2025 May 28;10(1):45. doi: 10.1038/s41525-025-00502-7.
6
Uncovering a Novel Pathogenic Mechanism of in Mitochondrial Disorders: Insights from Functional Studies on the c.38A>G Variant.揭示线粒体疾病中的一种新致病机制:对c.38A>G变异体功能研究的见解
Int J Mol Sci. 2025 Apr 12;26(8):3670. doi: 10.3390/ijms26083670.
7
Identification of new families and variants in autosomal dominant macular dystrophy associated with THRB.与甲状腺激素受体β(THRB)相关的常染色体显性黄斑营养不良新家族及变异体的鉴定
Sci Rep. 2025 Apr 28;15(1):14904. doi: 10.1038/s41598-025-97768-9.
8
UDP-glucose dehydrogenase variants cause dystroglycanopathy.尿苷二磷酸葡萄糖脱氢酶变体导致糖基化肌营养不良症。
Ann Clin Transl Neurol. 2025 Jun;12(6):1302-1308. doi: 10.1002/acn3.70002. Epub 2025 Apr 17.
9
Deleterious variants in intolerant genes reveal new candidates for self-limited delayed puberty.不耐受基因中的有害变异揭示了自限性青春期延迟的新候选因素。
Eur J Endocrinol. 2025 Mar 27;192(4):481-490. doi: 10.1093/ejendo/lvaf061.
10
Deciphering the Genetic Basis of Degenerative and Developmental Eye Disorders in 50 Pakistani Consanguineous Families Using Whole-Exome Sequencing.利用全外显子组测序解析50个巴基斯坦近亲家庭中退行性和发育性眼部疾病的遗传基础。
Int J Mol Sci. 2025 Mar 18;26(6):2715. doi: 10.3390/ijms26062715.
同义突变经常在人类癌症中充当驱动突变。
Cell. 2014 Mar 13;156(6):1324-1335. doi: 10.1016/j.cell.2014.01.051.
4
Identification of novel point mutations in splicing sites integrating whole-exome and RNA-seq data in myeloproliferative diseases.在骨髓增殖性疾病中整合全外显子组和 RNA-seq 数据鉴定剪接位点的新型点突变。
Mol Genet Genomic Med. 2013 Nov;1(4):246-59. doi: 10.1002/mgg3.23. Epub 2013 Jul 7.
5
A general framework for estimating the relative pathogenicity of human genetic variants.一种用于估计人类遗传变异相对致病性的通用框架。
Nat Genet. 2014 Mar;46(3):310-5. doi: 10.1038/ng.2892. Epub 2014 Feb 2.
6
MutPred Splice: machine learning-based prediction of exonic variants that disrupt splicing.MutPred剪接:基于机器学习预测破坏剪接的外显子变体。
Genome Biol. 2014 Jan 13;15(1):R19. doi: 10.1186/gb-2014-15-1-r19.
7
Ensembl 2014.Ensembl 2014.
Nucleic Acids Res. 2014 Jan;42(Database issue):D749-55. doi: 10.1093/nar/gkt1196. Epub 2013 Dec 6.
8
In silico tools for splicing defect prediction: a survey from the viewpoint of end users.用于剪接缺陷预测的计算机工具:从终端用户角度的综述
Genet Med. 2014 Jul;16(7):497-503. doi: 10.1038/gim.2013.176. Epub 2013 Nov 21.
9
RefSeq: an update on mammalian reference sequences.RefSeq:哺乳动物参考序列的更新。
Nucleic Acids Res. 2014 Jan;42(Database issue):D756-63. doi: 10.1093/nar/gkt1114. Epub 2013 Nov 19.
10
dbNSFP v2.0: a database of human non-synonymous SNVs and their functional predictions and annotations.dbNSFP v2.0:一个人类非同义 SNP 及其功能预测和注释数据库。
Hum Mutat. 2013 Sep;34(9):E2393-402. doi: 10.1002/humu.22376. Epub 2013 Jul 10.