• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于大规模并行拼接分析的剪接变异预测算法的基准测试。

Benchmarking splice variant prediction algorithms using massively parallel splicing assays.

机构信息

Department of Human Genetics, University of Michigan Medical School, Ann Arbor, MI, 48109, USA.

Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, 48109, USA.

出版信息

Genome Biol. 2023 Dec 21;24(1):294. doi: 10.1186/s13059-023-03144-z.

DOI:10.1186/s13059-023-03144-z
PMID:38129864
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10734170/
Abstract

BACKGROUND

Variants that disrupt mRNA splicing account for a sizable fraction of the pathogenic burden in many genetic disorders, but identifying splice-disruptive variants (SDVs) beyond the essential splice site dinucleotides remains difficult. Computational predictors are often discordant, compounding the challenge of variant interpretation. Because they are primarily validated using clinical variant sets heavily biased to known canonical splice site mutations, it remains unclear how well their performance generalizes.

RESULTS

We benchmark eight widely used splicing effect prediction algorithms, leveraging massively parallel splicing assays (MPSAs) as a source of experimentally determined ground-truth. MPSAs simultaneously assay many variants to nominate candidate SDVs. We compare experimentally measured splicing outcomes with bioinformatic predictions for 3,616 variants in five genes. Algorithms' concordance with MPSA measurements, and with each other, is lower for exonic than intronic variants, underscoring the difficulty of identifying missense or synonymous SDVs. Deep learning-based predictors trained on gene model annotations achieve the best overall performance at distinguishing disruptive and neutral variants, and controlling for overall call rate genome-wide, SpliceAI and Pangolin have superior sensitivity. Finally, our results highlight two practical considerations when scoring variants genome-wide: finding an optimal score cutoff, and the substantial variability introduced by differences in gene model annotation, and we suggest strategies for optimal splice effect prediction in the face of these issues.

CONCLUSION

SpliceAI and Pangolin show the best overall performance among predictors tested, however, improvements in splice effect prediction are still needed especially within exons.

摘要

背景

在许多遗传疾病中,破坏 mRNA 剪接的变异体占致病负担的相当大一部分,但识别超出必需剪接位点二核苷酸的剪接破坏变异体(SDV)仍然很困难。计算预测器通常存在差异,这增加了变异解释的挑战。由于它们主要使用偏向已知经典剪接位点突变的临床变异集进行验证,因此尚不清楚它们的性能如何普遍适用。

结果

我们利用大规模并行剪接分析(MPSA)作为实验确定的真实值来源,对 8 种广泛使用的剪接效应预测算法进行基准测试。MPSA 同时检测许多变体,以提名候选 SDV。我们将 5 个基因中 3616 个变体的实验测量的剪接结果与生物信息学预测进行比较。对于外显子变体,算法与 MPSA 测量值的一致性以及彼此之间的一致性低于内含子变体,这突出了识别错义或同义 SDV 的困难。基于深度学习的基于基因模型注释的预测器在区分破坏和中性变体方面表现出最佳的整体性能,并且在全基因组范围内控制总体调用率方面,SpliceAI 和 Pangolin 具有更高的灵敏度。最后,我们的结果突出了在全基因组范围内评分变体时需要考虑的两个实际问题:找到最佳的评分截止值,以及基因模型注释差异引入的大量可变性,我们为在这些问题面前进行最佳剪接效应预测提供了策略。

结论

在测试的预测器中,SpliceAI 和 Pangolin 显示出最佳的整体性能,然而,特别是在内含子中,仍然需要改进剪接效应预测。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5aad/10734170/8b98ae31f691/13059_2023_3144_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5aad/10734170/571933b63c71/13059_2023_3144_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5aad/10734170/cd63eec90430/13059_2023_3144_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5aad/10734170/9569441af1c6/13059_2023_3144_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5aad/10734170/f26224f7b392/13059_2023_3144_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5aad/10734170/f9337b861dc8/13059_2023_3144_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5aad/10734170/8b98ae31f691/13059_2023_3144_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5aad/10734170/571933b63c71/13059_2023_3144_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5aad/10734170/cd63eec90430/13059_2023_3144_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5aad/10734170/9569441af1c6/13059_2023_3144_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5aad/10734170/f26224f7b392/13059_2023_3144_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5aad/10734170/f9337b861dc8/13059_2023_3144_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5aad/10734170/8b98ae31f691/13059_2023_3144_Fig6_HTML.jpg

相似文献

1
Benchmarking splice variant prediction algorithms using massively parallel splicing assays.基于大规模并行拼接分析的剪接变异预测算法的基准测试。
Genome Biol. 2023 Dec 21;24(1):294. doi: 10.1186/s13059-023-03144-z.
2
Benchmarking splice variant prediction algorithms using massively parallel splicing assays.使用大规模平行剪接分析对剪接变体预测算法进行基准测试。
bioRxiv. 2023 May 7:2023.05.04.539398. doi: 10.1101/2023.05.04.539398.
3
Benchmarking deep learning splice prediction tools using functional splice assays.使用功能剪接测定法对深度学习剪接预测工具进行基准测试。
Hum Mutat. 2021 Jul;42(7):799-810. doi: 10.1002/humu.24212. Epub 2021 May 20.
4
CADD-Splice-improving genome-wide variant effect prediction using deep learning-derived splice scores.使用深度学习衍生的剪接分数提高 CADD-Splice 全基因组变异效应预测。
Genome Med. 2021 Feb 22;13(1):31. doi: 10.1186/s13073-021-00835-9.
5
Combining genetic constraint with predictions of alternative splicing to prioritize deleterious splicing in rare disease studies.结合遗传约束和选择性剪接预测,优先考虑罕见病研究中的有害剪接。
BMC Bioinformatics. 2022 Nov 14;23(1):482. doi: 10.1186/s12859-022-05041-x.
6
Mechanism and modeling of human disease-associated near-exon intronic variants that perturb RNA splicing.人类疾病相关的近外显子内含子变异体干扰 RNA 剪接的机制和建模。
Nat Struct Mol Biol. 2022 Nov;29(11):1043-1055. doi: 10.1038/s41594-022-00844-1. Epub 2022 Oct 27.
7
Systematic Computational Identification of Variants That Activate Exonic and Intronic Cryptic Splice Sites.激活外显子和内含子隐蔽剪接位点的变异体的系统计算鉴定
Am J Hum Genet. 2017 May 4;100(5):751-765. doi: 10.1016/j.ajhg.2017.04.001.
8
Introme accurately predicts the impact of coding and noncoding variants on gene splicing, with clinical applications.Introme 能够准确预测编码和非编码变异对基因剪接的影响,具有临床应用价值。
Genome Biol. 2023 May 17;24(1):118. doi: 10.1186/s13059-023-02936-7.
9
Investigation of cryptic JAG1 splice variants as a cause of Alagille syndrome and performance evaluation of splice predictor tools.隐匿性 JAG1 剪接变异作为 Alagille 综合征病因的研究及剪接预测工具的性能评估。
HGG Adv. 2024 Oct 10;5(4):100351. doi: 10.1016/j.xhgg.2024.100351. Epub 2024 Sep 6.
10
Performance evaluation of computational methods for splice-disrupting variants and improving the performance using the machine learning-based framework.基于机器学习框架的剪接破坏变异计算方法性能评估及改进
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac334.

引用本文的文献

1
Tandem splice acceptor sites: Profiling their relevance to human disease.串联剪接受体位点:剖析它们与人类疾病的相关性。
Genet Med. 2025 Jul 2;27(9):101520. doi: 10.1016/j.gim.2025.101520.
2
Unraveling the complex genetic landscape of OTOF-related hearing loss: a deep dive into cryptic variants and haplotype phasing.解析OTOF相关听力损失的复杂遗传图谱:深入探究隐秘变异和单倍型分型
Mol Med. 2025 May 9;31(1):181. doi: 10.1186/s10020-025-01225-2.
3
TP53 minigene analysis of 161 sequence changes provides evidence for role of spatial constraint and regulatory elements on variant-induced splicing impact.

本文引用的文献

1
High-Throughput Splicing Assays Identify Known and Novel Exon 9 Variants in Nephrotic Syndrome.高通量剪接分析鉴定出肾病综合征中已知和新型的外显子9变体。
Kidney Int Rep. 2023 Aug 5;8(10):2117-2125. doi: 10.1016/j.ekir.2023.07.033. eCollection 2023 Oct.
2
Aberrant splicing prediction across human tissues.跨人类组织的异常剪接预测
Nat Genet. 2023 May;55(5):861-870. doi: 10.1038/s41588-023-01373-3. Epub 2023 May 4.
3
SpliceVault predicts the precise nature of variant-associated mis-splicing.SpliceVault 预测了变体相关的错误剪接的确切性质。
对161个序列变化进行的TP53小基因分析为空间限制和调控元件对变异诱导剪接影响的作用提供了证据。
NPJ Genom Med. 2025 May 8;10(1):37. doi: 10.1038/s41525-025-00498-0.
4
Functional impact of splicing variants in the elaboration of complex traits in cattle.剪接变异体对牛复杂性状形成的功能影响
Nat Commun. 2025 Apr 24;16(1):3893. doi: 10.1038/s41467-025-58970-5.
5
Saturation mapping of variant effects using DNA repair reporters.使用DNA修复报告基因进行变异效应的饱和度作图。
bioRxiv. 2025 Mar 6:2025.03.01.640912. doi: 10.1101/2025.03.01.640912.
6
Toward a comprehensive profiling of alternative splicing proteoform structures, interactions and functions.迈向对可变剪接蛋白质异构体的结构、相互作用和功能进行全面分析。
Curr Opin Struct Biol. 2025 Feb;90:102979. doi: 10.1016/j.sbi.2024.102979. Epub 2025 Jan 7.
7
Multifaceted functions of the Wilms tumor 1 protein: From its expression in various malignancies to targeted therapy.肾母细胞瘤1蛋白的多方面功能:从其在各种恶性肿瘤中的表达到靶向治疗
Transl Oncol. 2025 Feb;52:102237. doi: 10.1016/j.tranon.2024.102237. Epub 2024 Dec 12.
8
Decoding biology with massively parallel reporter assays and machine learning.利用大规模平行报告基因检测和机器学习解码生物学。
Genes Dev. 2024 Oct 16;38(17-20):843-865. doi: 10.1101/gad.351800.124.
9
From computational models of the splicing code to regulatory mechanisms and therapeutic implications.从剪接密码的计算模型到调控机制及治疗意义
Nat Rev Genet. 2025 Mar;26(3):171-190. doi: 10.1038/s41576-024-00774-2. Epub 2024 Oct 2.
10
Uncovering recessive alleles in rare Mendelian disorders by genome sequencing of 174 individuals with monoallelic pathogenic variants.通过对174名单等位基因致病变异个体进行基因组测序,发现罕见孟德尔疾病中的隐性等位基因。
Eur J Hum Genet. 2025 Jan;33(1):56-64. doi: 10.1038/s41431-024-01694-9. Epub 2024 Sep 27.
Nat Genet. 2023 Feb;55(2):324-332. doi: 10.1038/s41588-022-01293-8. Epub 2023 Feb 6.
4
POU1F1/Pou1f1 c.143-83A > G Variant Disrupts the Branch Site in Pre-mRNA and Leads to Dwarfism.POU1F1/Pou1f1 c.143-83A > G 变异破坏了前体 mRNA 的分支位点,导致侏儒症。
Endocrinology. 2022 Dec 19;164(2). doi: 10.1210/endocr/bqac198.
5
A recurrent de novo splice site variant involving DNM1 exon 10a causes developmental and epileptic encephalopathy through a dominant-negative mechanism.一种涉及 DNM1 外显子 10a 的反复出现的新生剪接位点变异通过显性负性机制导致发育性和癫痫性脑病。
Am J Hum Genet. 2022 Dec 1;109(12):2253-2269. doi: 10.1016/j.ajhg.2022.11.002. Epub 2022 Nov 21.
6
Combining genetic constraint with predictions of alternative splicing to prioritize deleterious splicing in rare disease studies.结合遗传约束和选择性剪接预测,优先考虑罕见病研究中的有害剪接。
BMC Bioinformatics. 2022 Nov 14;23(1):482. doi: 10.1186/s12859-022-05041-x.
7
SPiP: Splicing Prediction Pipeline, a machine learning tool for massive detection of exonic and intronic variant effects on mRNA splicing.SPiP:剪接预测管道,一种用于大规模检测外显子和内含子变异对 mRNA 剪接影响的机器学习工具。
Hum Mutat. 2022 Dec;43(12):2308-2323. doi: 10.1002/humu.24491. Epub 2022 Nov 20.
8
Mutational and splicing landscape in a cohort of 43,000 patients tested for hereditary cancer.对43000名遗传性癌症检测患者队列的突变和剪接图谱分析。
NPJ Genom Med. 2022 Aug 25;7(1):49. doi: 10.1038/s41525-022-00323-y.
9
Splicing analyses for variants in MMR genes: best practice recommendations from the European Mismatch Repair Working Group.MMR 基因变异的剪接分析:欧洲错配修复工作组的最佳实践建议。
Eur J Hum Genet. 2022 Sep;30(9):1051-1059. doi: 10.1038/s41431-022-01106-w. Epub 2022 Jun 9.
10
CI-SpliceAI-Improving machine learning predictions of disease causing splicing variants using curated alternative splice sites.CI-SpliceAI-利用已注释的可变剪接位点来改进疾病相关剪接变异体的机器学习预测。
PLoS One. 2022 Jun 3;17(6):e0269159. doi: 10.1371/journal.pone.0269159. eCollection 2022.