• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

长读测序数据中 mRNA 异构体检测方法的综合评估。

Comprehensive assessment of mRNA isoform detection methods for long-read sequencing data.

机构信息

Department of Orthopedic Surgery of the Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou, 310009, Zhejiang, China.

Centre of Biomedical Systems and Informatics of Zhejiang University-University of Edinburgh Institute (ZJU-UoE Institute), International Campus, Zhejiang University, Haining, 314400, Zhejiang, China.

出版信息

Nat Commun. 2024 May 10;15(1):3972. doi: 10.1038/s41467-024-48117-3.

DOI:10.1038/s41467-024-48117-3
PMID:38730241
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11087464/
Abstract

The advancement of Long-Read Sequencing (LRS) techniques has significantly increased the length of sequencing to several kilobases, thereby facilitating the identification of alternative splicing events and isoform expressions. Recently, numerous computational tools for isoform detection using long-read sequencing data have been developed. Nevertheless, there remains a deficiency in comparative studies that systemically evaluate the performance of these tools, which are implemented with different algorithms, under various simulations that encompass potential influencing factors. In this study, we conducted a benchmark analysis of thirteen methods implemented in nine tools capable of identifying isoform structures from long-read RNA-seq data. We evaluated their performances using simulated data, which represented diverse sequencing platforms generated by an in-house simulator, RNA sequins (sequencing spike-ins) data, as well as experimental data. Our findings demonstrate IsoQuant as a highly effective tool for isoform detection with LRS, with Bambu and StringTie2 also exhibiting strong performance. These results offer valuable guidance for future research on alternative splicing analysis and the ongoing improvement of tools for isoform detection using LRS data.

摘要

长读测序(Long-Read Sequencing,LRS)技术的进步显著增加了测序的长度至几千个碱基,从而促进了可变剪接事件和异构体表达的鉴定。最近,已经开发出了许多使用长读测序数据进行异构体检测的计算工具。然而,仍然缺乏对这些工具的性能进行系统评估的比较研究,这些工具使用不同的算法,在各种模拟中涵盖了潜在的影响因素。在这项研究中,我们对九个工具中的十三种方法进行了基准分析,这些方法能够从长读 RNA-seq 数据中识别异构体结构。我们使用模拟数据、RNA sequins(测序插入物)数据以及实验数据来评估它们的性能。我们的研究结果表明 IsoQuant 是一种非常有效的 LRS 异构体检测工具,Bambu 和 StringTie2 也表现出很强的性能。这些结果为未来的可变剪接分析研究以及使用 LRS 数据进行异构体检测工具的不断改进提供了有价值的指导。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35d1/11087464/87d629c77a53/41467_2024_48117_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35d1/11087464/a75514e5b6b7/41467_2024_48117_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35d1/11087464/56cb9ecb4064/41467_2024_48117_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35d1/11087464/98c85ae30b75/41467_2024_48117_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35d1/11087464/b336c6744b6e/41467_2024_48117_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35d1/11087464/820972a403f1/41467_2024_48117_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35d1/11087464/87d629c77a53/41467_2024_48117_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35d1/11087464/a75514e5b6b7/41467_2024_48117_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35d1/11087464/56cb9ecb4064/41467_2024_48117_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35d1/11087464/98c85ae30b75/41467_2024_48117_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35d1/11087464/b336c6744b6e/41467_2024_48117_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35d1/11087464/820972a403f1/41467_2024_48117_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35d1/11087464/87d629c77a53/41467_2024_48117_Fig6_HTML.jpg

相似文献

1
Comprehensive assessment of mRNA isoform detection methods for long-read sequencing data.长读测序数据中 mRNA 异构体检测方法的综合评估。
Nat Commun. 2024 May 10;15(1):3972. doi: 10.1038/s41467-024-48117-3.
2
Transcript Identification Through Long-Read Sequencing.通过长读测序进行转录本鉴定。
Methods Mol Biol. 2021;2284:531-541. doi: 10.1007/978-1-0716-1307-8_29.
3
Freddie: annotation-independent detection and discovery of transcriptomic alternative splicing isoforms using long-read sequencing.弗雷迪:使用长读测序进行注释独立的转录组可变剪接异构体的检测和发现。
Nucleic Acids Res. 2023 Jan 25;51(2):e11. doi: 10.1093/nar/gkac1112.
4
Joint estimation of isoform expression and isoform-specific read distribution using multisample RNA-Seq data.利用多样本 RNA-Seq 数据联合估计异构体表达和异构体特异性读取分布。
Bioinformatics. 2014 Feb 15;30(4):506-13. doi: 10.1093/bioinformatics/btt704. Epub 2013 Dec 3.
5
Enhancing novel isoform discovery: leveraging nanopore long-read sequencing and machine learning approaches.增强新型异构体发现:利用纳米孔长读长测序和机器学习方法。
Brief Funct Genomics. 2024 Dec 6;23(6):683-694. doi: 10.1093/bfgp/elae031.
6
Fine mapping of RNA isoform diversity using an innovative targeted long-read RNA sequencing protocol with novel dedicated bioinformatics pipeline.利用创新的靶向长读 RNA 测序方案和新型专用生物信息学管道精细映射 RNA 异构体多样性。
BMC Genomics. 2024 Sep 30;25(1):909. doi: 10.1186/s12864-024-10741-0.
7
Transcript Profiling Using Long-Read Sequencing Technologies.使用长读长测序技术进行转录本分析
Methods Mol Biol. 2018;1783:121-147. doi: 10.1007/978-1-4939-7834-2_6.
8
Benchmarking long-read RNA-sequencing analysis tools using in silico mixtures.基于计算机模拟混合物对长读 RNA 测序分析工具进行基准测试。
Nat Methods. 2023 Nov;20(11):1810-1821. doi: 10.1038/s41592-023-02026-3. Epub 2023 Oct 2.
9
LIQA: long-read isoform quantification and analysis.LIQA:长读 isoform 定量分析。
Genome Biol. 2021 Jun 17;22(1):182. doi: 10.1186/s13059-021-02399-8.
10
NMFP: a non-negative matrix factorization based preselection method to increase accuracy of identifying mRNA isoforms from RNA-seq data.NMFP:一种基于非负矩阵分解的预选方法,用于提高从RNA测序数据中识别mRNA异构体的准确性。
BMC Genomics. 2016 Jan 11;17 Suppl 1(Suppl 1):11. doi: 10.1186/s12864-015-2304-8.

引用本文的文献

1
A spatial long-read approach at near-single-cell resolution reveals developmental regulation of splicing and polyadenylation sites in distinct cortical layers and cell types.一种接近单细胞分辨率的空间长读长方法揭示了不同皮质层和细胞类型中剪接和聚腺苷酸化位点的发育调控。
Nat Commun. 2025 Aug 29;16(1):8093. doi: 10.1038/s41467-025-63301-9.
2
Spatial isoform sequencing at sub-micrometer single-cell resolution reveals novel patterns of spatial isoform variability in brain cell types.亚微米级单细胞分辨率的空间异构体测序揭示了脑细胞类型中空间异构体变异的新模式。
bioRxiv. 2025 Jun 25:2025.06.25.661563. doi: 10.1101/2025.06.25.661563.
3

本文引用的文献

1
Systematic assessment of long-read RNA-seq methods for transcript identification and quantification.系统评估长读 RNA-seq 方法在转录本鉴定和定量中的应用。
Nat Methods. 2024 Jul;21(7):1349-1363. doi: 10.1038/s41592-024-02298-3. Epub 2024 Jun 7.
2
SQANTI3: curation of long-read transcriptomes for accurate identification of known and novel isoforms.SQANTI3:长读转录组的编目,用于准确识别已知和新的异构体。
Nat Methods. 2024 May;21(5):793-797. doi: 10.1038/s41592-024-02229-2. Epub 2024 Mar 20.
3
Benchmarking long-read RNA-sequencing analysis tools using in silico mixtures.
A spatial long-read approach at near-single-cell resolution reveals developmental regulation of splicing and polyadenylation sites in distinct cortical layers and cell types.
一种接近单细胞分辨率的空间长读长方法揭示了不同皮质层和细胞类型中剪接和聚腺苷酸化位点的发育调控。
bioRxiv. 2025 Jun 10:2025.06.10.658877. doi: 10.1101/2025.06.10.658877.
4
StringTie3 Improves Total RNA-seq Assembly by Resolving Nascent and Mature Transcripts.StringTie3通过解析新生转录本和成熟转录本来改进总RNA测序组装。
bioRxiv. 2025 May 26:2025.05.21.655404. doi: 10.1101/2025.05.21.655404.
5
Quantitative isoform profiling using deep coverage long-read RNA sequencing across early endothelial differentiation.使用深度覆盖长读长RNA测序对早期内皮细胞分化进行定量异构体分析。
bioRxiv. 2025 Jun 2:2025.05.30.656561. doi: 10.1101/2025.05.30.656561.
6
Improving gene isoform quantification with miniQuant.使用miniQuant改进基因异构体定量分析。
Nat Biotechnol. 2025 Jun 3. doi: 10.1038/s41587-025-02633-9.
7
Transcriptomics in the era of long-read sequencing.长读长测序时代的转录组学
Nat Rev Genet. 2025 Mar 28. doi: 10.1038/s41576-025-00828-z.
8
Detection of mRNA Transcript Variants.mRNA转录变体的检测
Genes (Basel). 2025 Mar 16;16(3):343. doi: 10.3390/genes16030343.
9
Clinical validation of RNA sequencing for Mendelian disorder diagnostics.用于孟德尔疾病诊断的RNA测序的临床验证
Am J Hum Genet. 2025 Apr 3;112(4):779-792. doi: 10.1016/j.ajhg.2025.02.006. Epub 2025 Mar 4.
10
Notable challenges posed by long-read sequencing for the study of transcriptional diversity and genome annotation.长读长测序在转录多样性研究和基因组注释方面带来的显著挑战。
Genome Res. 2025 Apr 14;35(4):583-592. doi: 10.1101/gr.279865.124.
基于计算机模拟混合物对长读 RNA 测序分析工具进行基准测试。
Nat Methods. 2023 Nov;20(11):1810-1821. doi: 10.1038/s41592-023-02026-3. Epub 2023 Oct 2.
4
TAGET: a toolkit for analyzing full-length transcripts from long-read sequencing.TAGET:用于分析长读测序全长转录本的工具包。
Nat Commun. 2023 Sep 23;14(1):5935. doi: 10.1038/s41467-023-41649-0.
5
isONform: reference-free transcriptome reconstruction from Oxford Nanopore data.isONform:一种无需参考基因组的转录组重构方法,来自牛津纳米孔技术。
Bioinformatics. 2023 Jun 30;39(39 Suppl 1):i222-i231. doi: 10.1093/bioinformatics/btad264.
6
Context-aware transcript quantification from long-read RNA-seq data with Bambu.使用 Bambu 从长读 RNA-seq 数据中进行上下文感知的转录本定量。
Nat Methods. 2023 Aug;20(8):1187-1195. doi: 10.1038/s41592-023-01908-w. Epub 2023 Jun 12.
7
IsoTools: a flexible workflow for long-read transcriptome sequencing analysis.IsoTools:一种用于长读转录组测序分析的灵活工作流程。
Bioinformatics. 2023 Jun 1;39(6). doi: 10.1093/bioinformatics/btad364.
8
ESPRESSO: Robust discovery and quantification of transcript isoforms from error-prone long-read RNA-seq data.ESPRESSO:从易错的长读 RNA-seq 数据中稳健地发现和定量转录本异构体。
Sci Adv. 2023 Jan 20;9(3):eabq5072. doi: 10.1126/sciadv.abq5072.
9
Accurate isoform discovery with IsoQuant using long reads.利用长读长 IsoQuant 进行准确的异构体发现。
Nat Biotechnol. 2023 Jul;41(7):915-918. doi: 10.1038/s41587-022-01565-y. Epub 2023 Jan 2.
10
A male germ-cell-specific ribosome controls male fertility.一种雄性生殖细胞特异性核糖体控制着雄性生育能力。
Nature. 2022 Dec;612(7941):725-731. doi: 10.1038/s41586-022-05508-0. Epub 2022 Dec 14.