• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

短读长读 RNA 测序捕获的转录组复杂性的对比和组合。

Contrasting and combining transcriptome complexity captured by short and long RNA sequencing reads.

机构信息

Department of Computer and Information Sciences, School of Engineering, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA.

Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA.

出版信息

Genome Res. 2024 Oct 29;34(10):1624-1635. doi: 10.1101/gr.278659.123.

DOI:10.1101/gr.278659.123
PMID:39322279
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11529863/
Abstract

Mapping transcriptomic variations using either short- or long-read RNA sequencing is a staple of genomic research. Long reads are able to capture entire isoforms and overcome repetitive regions, whereas short reads still provide improved coverage and error rates. Yet, open questions remain, such as how to quantitatively compare the technologies, can we combine them, and what is the benefit of such a combined view? We tackle these questions by first creating a pipeline to assess matched long- and short-read data using a variety of transcriptome statistics. We find that across data sets, algorithms, and technologies, matched short-read data detects ∼30% more splice junctions, such that ∼10%-30% of the splice junctions included at ≥20% by short reads are missed by long reads. In contrast, long reads detect many more intron-retention events and can detect full isoforms, pointing to the benefit of combining the technologies. We introduce MAJIQ-L, an extension of the MAJIQ software, to enable a unified view of transcriptome variations from both technologies and demonstrate its benefits. Our software can be used to assess any future long-read technology or algorithm and can be combined with short-read data for improved transcriptome analysis.

摘要

使用短读或长读 RNA 测序来绘制转录组变异是基因组研究的基础。长读能够捕获整个异构体并克服重复区域,而短读仍能提供更好的覆盖度和更低的错误率。然而,仍有一些悬而未决的问题,例如如何对这些技术进行定量比较,我们能否将它们结合起来,以及这种组合视图有什么好处?我们通过首先创建一个使用各种转录组统计数据来评估匹配的长读和短读数据的管道来解决这些问题。我们发现,在数据集、算法和技术方面,匹配的短读数据检测到的剪接位点大约多 30%,以至于短读数据检测到的剪接位点中有 10%-30%的被长读数据遗漏。相比之下,长读数据检测到更多的内含子保留事件,并且能够检测到完整的异构体,这表明结合使用这两种技术具有优势。我们引入了 MAJIQ-L,这是 MAJIQ 软件的扩展,用于实现两种技术的转录组变异的统一视图,并展示了其优势。我们的软件可用于评估任何未来的长读技术或算法,并可与短读数据结合使用,以提高转录组分析的准确性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95e4/11529863/d0715913e271/1624f06.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95e4/11529863/dbf74376d3fb/1624f01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95e4/11529863/dc19b89feaf8/1624f02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95e4/11529863/a562c569ae62/1624f03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95e4/11529863/0cdd25a204b1/1624f04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95e4/11529863/60e3f9f87628/1624f05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95e4/11529863/d0715913e271/1624f06.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95e4/11529863/dbf74376d3fb/1624f01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95e4/11529863/dc19b89feaf8/1624f02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95e4/11529863/a562c569ae62/1624f03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95e4/11529863/0cdd25a204b1/1624f04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95e4/11529863/60e3f9f87628/1624f05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95e4/11529863/d0715913e271/1624f06.jpg

相似文献

1
Contrasting and combining transcriptome complexity captured by short and long RNA sequencing reads.短读长读 RNA 测序捕获的转录组复杂性的对比和组合。
Genome Res. 2024 Oct 29;34(10):1624-1635. doi: 10.1101/gr.278659.123.
2
Contrasting and Combining Transcriptome Complexity Captured by Short and Long RNA Sequencing Reads.短读长和长读长RNA测序读取所捕获的转录组复杂性的对比与整合
bioRxiv. 2023 Nov 21:2023.11.21.568046. doi: 10.1101/2023.11.21.568046.
3
Freddie: annotation-independent detection and discovery of transcriptomic alternative splicing isoforms using long-read sequencing.弗雷迪:使用长读测序进行注释独立的转录组可变剪接异构体的检测和发现。
Nucleic Acids Res. 2023 Jan 25;51(2):e11. doi: 10.1093/nar/gkac1112.
4
Improved transcriptome assembly using a hybrid of long and short reads with StringTie.使用长读长和短读长混合的方法进行转录组组装,可提高组装质量。
PLoS Comput Biol. 2022 Jun 1;18(6):e1009730. doi: 10.1371/journal.pcbi.1009730. eCollection 2022 Jun.
5
A Full-Length mRNA Transcriptome Generated From Hybrid-Corrected PacBio Long-Reads Improves the Transcript Annotation and Identifies Thousands of Novel Splice Variants in Atlantic Salmon.通过混合校正的PacBio长读长生成的全长mRNA转录组改善了转录本注释并鉴定了大西洋鲑鱼中数千种新的剪接变体。
Front Genet. 2021 Apr 27;12:656334. doi: 10.3389/fgene.2021.656334. eCollection 2021.
6
SQANTI: extensive characterization of long-read transcript sequences for quality control in full-length transcriptome identification and quantification.SQANTI:用于全长转录组鉴定和定量的长读转录序列的广泛特征化,以进行质量控制。
Genome Res. 2018 Mar 1;28(3):396-411. doi: 10.1101/gr.222976.117.
7
CLASS: constrained transcript assembly of RNA-seq reads.类:RNA-seq 读段的约束转录本组装。
BMC Bioinformatics. 2013;14 Suppl 5(Suppl 5):S14. doi: 10.1186/1471-2105-14-S5-S14. Epub 2013 Apr 10.
8
Accurate identification and analysis of human mRNA isoforms using deep long read sequencing.利用深度长读测序准确识别和分析人类 mRNA 异构体。
G3 (Bethesda). 2013 Mar;3(3):387-97. doi: 10.1534/g3.112.004812. Epub 2013 Mar 1.
9
Transcriptome profiling analysis of tea plant (Camellia sinensis) using Oxford Nanopore long-read RNA-Seq technology.利用牛津纳米孔长读 RNA-Seq 技术对茶树(Camellia sinensis)进行转录组谱分析。
Gene. 2021 Feb 15;769:145247. doi: 10.1016/j.gene.2020.145247. Epub 2020 Oct 20.
10
Transcriptome sequencing of the Microarray Quality Control (MAQC) RNA reference samples using next generation sequencing.使用下一代测序技术对微阵列质量控制(MAQC)RNA参考样本进行转录组测序。
BMC Genomics. 2009 Jun 12;10:264. doi: 10.1186/1471-2164-10-264.

引用本文的文献

1
HNRNPH1-mediated splicing events regulate transcript variant composition and the organization of the 5'UTR.HNRNPH1介导的剪接事件调控转录本变体组成和5'非翻译区的结构。
bioRxiv. 2025 Jul 29:2025.07.28.667222. doi: 10.1101/2025.07.28.667222.
2
Long-read transcriptome assembly reveals vast isoform diversity in the placenta associated with metabolic and endocrine function.长读长转录组组装揭示了胎盘中与代谢和内分泌功能相关的大量异构体多样性。
bioRxiv. 2025 Jun 27:2025.06.26.661362. doi: 10.1101/2025.06.26.661362.
3
Understanding isoform expression by pairing long-read sequencing with single-cell and spatial transcriptomics.

本文引用的文献

1
Systematic assessment of long-read RNA-seq methods for transcript identification and quantification.系统评估长读 RNA-seq 方法在转录本鉴定和定量中的应用。
Nat Methods. 2024 Jul;21(7):1349-1363. doi: 10.1038/s41592-024-02298-3. Epub 2024 Jun 7.
2
Context-aware transcript quantification from long-read RNA-seq data with Bambu.使用 Bambu 从长读 RNA-seq 数据中进行上下文感知的转录本定量。
Nat Methods. 2023 Aug;20(8):1187-1195. doi: 10.1038/s41592-023-01908-w. Epub 2023 Jun 12.
3
RNA splicing analysis using heterogeneous and large RNA-seq datasets.
通过将长读测序与单细胞和空间转录组学相结合来理解异构体表达。
Genome Res. 2024 Nov 20;34(11):1735-1746. doi: 10.1101/gr.279640.124.
4
Steering research on mRNA splicing in cancer towards clinical translation.推动癌症中 mRNA 剪接的研究向临床转化。
Nat Rev Cancer. 2024 Dec;24(12):887-905. doi: 10.1038/s41568-024-00750-2. Epub 2024 Oct 9.
使用异质和大型 RNA-seq 数据集进行 RNA 剪接分析。
Nat Commun. 2023 Mar 3;14(1):1230. doi: 10.1038/s41467-023-36585-y.
4
ESPRESSO: Robust discovery and quantification of transcript isoforms from error-prone long-read RNA-seq data.ESPRESSO:从易错的长读 RNA-seq 数据中稳健地发现和定量转录本异构体。
Sci Adv. 2023 Jan 20;9(3):eabq5072. doi: 10.1126/sciadv.abq5072.
5
Method of the year: long-read sequencing.年度方法:长读长测序。
Nat Methods. 2023 Jan;20(1):6-11. doi: 10.1038/s41592-022-01730-w.
6
Long-read sequencing in the era of epigenomics and epitranscriptomics.表观基因组学和表观转录组学时代的长读长测序
Nat Methods. 2023 Jan;20(1):25-29. doi: 10.1038/s41592-022-01724-8.
7
Approaching complete genomes, transcriptomes and epi-omes with accurate long-read sequencing.采用准确的长读测序技术获取完整的基因组、转录组和表观基因组。
Nat Methods. 2023 Jan;20(1):12-16. doi: 10.1038/s41592-022-01716-8.
8
The variables on RNA molecules: concert or cacophony? Answers in long-read sequencing.RNA分子上的变量:和谐还是杂音?长读长测序给出答案。
Nat Methods. 2023 Jan;20(1):20-24. doi: 10.1038/s41592-022-01715-9.
9
Accurate isoform discovery with IsoQuant using long reads.利用长读长 IsoQuant 进行准确的异构体发现。
Nat Biotechnol. 2023 Jul;41(7):915-918. doi: 10.1038/s41587-022-01565-y. Epub 2023 Jan 2.
10
Transcriptome variation in human tissues revealed by long-read sequencing.长读测序揭示人类组织中的转录组变异。
Nature. 2022 Aug;608(7922):353-359. doi: 10.1038/s41586-022-05035-y. Epub 2022 Aug 3.