• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

无参考组装长读转录组测序数据的 RNA-Bloom2 方法。

Reference-free assembly of long-read transcriptome sequencing data with RNA-Bloom2.

机构信息

Canada's Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, BC, V5Z 4S6, Canada.

Bioinformatics Graduate Program, University of British Columbia, Vancouver, BC, V5Z 4S6, Canada.

出版信息

Nat Commun. 2023 May 22;14(1):2940. doi: 10.1038/s41467-023-38553-y.

DOI:10.1038/s41467-023-38553-y
PMID:37217540
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10202958/
Abstract

Long-read sequencing technologies have improved significantly since their emergence. Their read lengths, potentially spanning entire transcripts, is advantageous for reconstructing transcriptomes. Existing long-read transcriptome assembly methods are primarily reference-based and to date, there is little focus on reference-free transcriptome assembly. We introduce "RNA-Bloom2 [ https://github.com/bcgsc/RNA-Bloom ]", a reference-free assembly method for long-read transcriptome sequencing data. Using simulated datasets and spike-in control data, we show that the transcriptome assembly quality of RNA-Bloom2 is competitive to those of reference-based methods. Furthermore, we find that RNA-Bloom2 requires 27.0 to 80.6% of the peak memory and 3.6 to 10.8% of the total wall-clock runtime of a competing reference-free method. Finally, we showcase RNA-Bloom2 in assembling a transcriptome sample of Picea sitchensis (Sitka spruce). Since our method does not rely on a reference, it further sets the groundwork for large-scale comparative transcriptomics where high-quality draft genome assemblies are not readily available.

摘要

长读测序技术自出现以来有了显著的改进。它们的读长,可能跨越整个转录本,有利于重建转录组。现有的长读转录组组装方法主要是基于参考的,迄今为止,很少有研究关注无参考转录组组装。我们引入了“RNA-Bloom2[ https://github.com/bcgsc/RNA-Bloom ]”,这是一种用于长读转录组测序数据的无参考组装方法。使用模拟数据集和 Spike-in 对照数据,我们表明 RNA-Bloom2 的转录组组装质量可与基于参考的方法相媲美。此外,我们发现 RNA-Bloom2 需要竞争方法的峰值内存的 27.0%到 80.6%和总运行时间的 3.6%到 10.8%。最后,我们在组装 Sitka spruce(白皮松)的转录组样本中展示了 RNA-Bloom2。由于我们的方法不依赖于参考基因组,因此它为大规模比较转录组学奠定了基础,而在大规模比较转录组学中,高质量的基因组草图组装不易获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a11/10202958/179a89b1c7ef/41467_2023_38553_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a11/10202958/8f8e96c4b00d/41467_2023_38553_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a11/10202958/7be9f2402232/41467_2023_38553_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a11/10202958/14f69b105559/41467_2023_38553_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a11/10202958/2908a3188600/41467_2023_38553_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a11/10202958/b8458e88a44b/41467_2023_38553_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a11/10202958/179a89b1c7ef/41467_2023_38553_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a11/10202958/8f8e96c4b00d/41467_2023_38553_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a11/10202958/7be9f2402232/41467_2023_38553_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a11/10202958/14f69b105559/41467_2023_38553_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a11/10202958/2908a3188600/41467_2023_38553_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a11/10202958/b8458e88a44b/41467_2023_38553_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a11/10202958/179a89b1c7ef/41467_2023_38553_Fig6_HTML.jpg

相似文献

1
Reference-free assembly of long-read transcriptome sequencing data with RNA-Bloom2.无参考组装长读转录组测序数据的 RNA-Bloom2 方法。
Nat Commun. 2023 May 22;14(1):2940. doi: 10.1038/s41467-023-38553-y.
2
Transcript Profiling Using Long-Read Sequencing Technologies.使用长读长测序技术进行转录本分析
Methods Mol Biol. 2018;1783:121-147. doi: 10.1007/978-1-4939-7834-2_6.
3
PARRoT- a homology-based strategy to quantify and compare RNA-sequencing from non-model organisms.PARRoT——一种基于同源性的策略,用于量化和比较非模式生物的RNA测序。
BMC Bioinformatics. 2016 Dec 22;17(Suppl 19):513. doi: 10.1186/s12859-016-1366-1.
4
Optimizing de novo transcriptome assembly from short-read RNA-Seq data: a comparative study.优化从头转录组组装从短读 RNA-Seq 数据:一项比较研究。
BMC Bioinformatics. 2011 Dec 14;12 Suppl 14(Suppl 14):S2. doi: 10.1186/1471-2105-12-S14-S2.
5
Partitioning RNAs by length improves transcriptome reconstruction from short-read RNA-seq data.通过长度对 RNA 进行分区可提高基于短读长 RNA-seq 数据的转录组重构。
Nat Biotechnol. 2022 May;40(5):741-750. doi: 10.1038/s41587-021-01136-7. Epub 2022 Jan 10.
6
Challenges and advances for transcriptome assembly in non-model species.非模式物种转录组组装面临的挑战与进展
PLoS One. 2017 Sep 20;12(9):e0185020. doi: 10.1371/journal.pone.0185020. eCollection 2017.
7
TransRate: reference-free quality assessment of de novo transcriptome assemblies.TransRate:从头转录组组装的无参考质量评估
Genome Res. 2016 Aug;26(8):1134-44. doi: 10.1101/gr.196469.115. Epub 2016 Jun 1.
8
Current and Future Methods for mRNA Analysis: A Drive Toward Single Molecule Sequencing.mRNA分析的当前及未来方法:迈向单分子测序的动力
Methods Mol Biol. 2018;1783:209-241. doi: 10.1007/978-1-4939-7834-2_11.
9
Full-length isoform concatenation sequencing to resolve cancer transcriptome complexity.全长异构体拼接测序解析癌症转录组复杂性。
BMC Genomics. 2024 Jan 29;25(1):122. doi: 10.1186/s12864-024-10021-x.
10
Sequencing of the needle transcriptome from Norway spruce (Picea abies Karst L.) reveals lower substitution rates, but similar selective constraints in gymnosperms and angiosperms.对挪威云杉(Picea abies Karst L.)针转录组进行测序,揭示了裸子植物和被子植物较低的替换率,但相似的选择压力。
BMC Genomics. 2012 Nov 2;13:589. doi: 10.1186/1471-2164-13-589.

引用本文的文献

1
Transposable element expression and sub-cellular dynamics during hPSC differentiation to endoderm, mesoderm, and ectoderm lineages.人多能干细胞分化为内胚层、中胚层和外胚层谱系过程中的转座元件表达及亚细胞动力学
Nat Commun. 2025 Aug 18;16(1):7670. doi: 10.1038/s41467-025-63080-3.
2
cONcat: Computational reconstruction of concatenated fragments from long Oxford Nanopore reads.cONcat:从长牛津纳米孔测序读段中进行串联片段的计算重建。
PLoS One. 2025 Jul 24;20(7):e0321246. doi: 10.1371/journal.pone.0321246. eCollection 2025.
3
Long-read RNA sequencing unveils a novel cryptic exon in MNAT1 along with its full-length transcript structure in TDP-43 proteinopathy.

本文引用的文献

1
Systematic assessment of long-read RNA-seq methods for transcript identification and quantification.系统评估长读 RNA-seq 方法在转录本鉴定和定量中的应用。
Nat Methods. 2024 Jul;21(7):1349-1363. doi: 10.1038/s41592-024-02298-3. Epub 2024 Jun 7.
2
Constitutive and insect-induced transcriptomes of weevil-resistant and susceptible Sitka spruce.抗象鼻虫和易受象鼻虫侵害的西加云杉的组成型和昆虫诱导转录组
Plant Environ Interact. 2021 Jun 9;2(3):137-147. doi: 10.1002/pei3.10053. eCollection 2021 Jun.
3
An atlas of plant full-length RNA reveals tissue-specific and monocots-dicots conserved regulation of poly(A) tail length.
长读长RNA测序揭示了MNAT1中一个新的隐蔽外显子及其在TDP-43蛋白病中的全长转录本结构。
Commun Biol. 2025 Jul 16;8(1):1056. doi: 10.1038/s42003-025-08463-4.
4
De novo transcriptome assembly of Acartia tonsa adults using Nanopore long-read sequencing.使用纳米孔长读长测序对中华哲水蚤成虫进行从头转录组组装。
Sci Data. 2025 Jul 1;12(1):1112. doi: 10.1038/s41597-025-05399-6.
5
De novo clustering of large long-read transcriptome datasets with isONclust3.使用isONclust3对大型长读长转录组数据集进行从头聚类。
Bioinformatics. 2025 May 6;41(5). doi: 10.1093/bioinformatics/btaf207.
6
Transcriptomics in the era of long-read sequencing.长读长测序时代的转录组学
Nat Rev Genet. 2025 Mar 28. doi: 10.1038/s41576-025-00828-z.
7
Notable challenges posed by long-read sequencing for the study of transcriptional diversity and genome annotation.长读长测序在转录多样性研究和基因组注释方面带来的显著挑战。
Genome Res. 2025 Apr 14;35(4):583-592. doi: 10.1101/gr.279865.124.
8
Chromosome-scale telomere to telomere genome assembly of common crystalwort (Riccia sorocarpa Bisch.).普通晶藓(Riccia sorocarpa Bisch.)的染色体级端粒到端粒基因组组装
Sci Data. 2025 Jan 15;12(1):77. doi: 10.1038/s41597-025-04373-6.
9
Evaluation of strategies for evidence-driven genome annotation using long-read RNA-seq.使用长读长RNA测序对证据驱动的基因组注释策略进行评估。
Genome Res. 2025 Apr 14;35(4):1053-1064. doi: 10.1101/gr.279864.124.
10
Evaluation of commercial RNA extraction kits for long-read metatranscriptomics in soil.评价商用 RNA 提取试剂盒在土壤长读转录组学中的应用。
Microb Genom. 2024 Sep;10(9). doi: 10.1099/mgen.0.001298.
植物全长 RNA 图谱揭示了组织特异性和单子叶植物-双子叶植物中聚(A)尾长度的保守调控。
Nat Plants. 2022 Sep;8(9):1118-1126. doi: 10.1038/s41477-022-01224-9. Epub 2022 Aug 18.
4
RATTLE: reference-free reconstruction and quantification of transcriptomes from Nanopore sequencing.RATTLE:基于纳米孔测序的无参转录组重构和定量分析。
Genome Biol. 2022 Jul 8;23(1):153. doi: 10.1186/s13059-022-02715-w.
5
Spruce giga-genomes: structurally similar yet distinctive with differentially expanding gene families and rapidly evolving genes.云杉超大基因组:结构相似但具有明显差异,基因家族差异扩张,基因快速进化。
Plant J. 2022 Sep;111(5):1469-1485. doi: 10.1111/tpj.15889. Epub 2022 Jul 16.
6
The Chinese pine genome and methylome unveil key features of conifer evolution.中国松基因组和甲基组揭示了针叶树进化的关键特征。
Cell. 2022 Jan 6;185(1):204-217.e14. doi: 10.1016/j.cell.2021.12.006. Epub 2021 Dec 28.
7
Effective sequence similarity detection with strobemers.利用频闪体进行有效的序列相似性检测。
Genome Res. 2021 Nov;31(11):2080-2094. doi: 10.1101/gr.275648.121. Epub 2021 Oct 19.
8
BUSCO Update: Novel and Streamlined Workflows along with Broader and Deeper Phylogenetic Coverage for Scoring of Eukaryotic, Prokaryotic, and Viral Genomes.BUSCO 更新:用于真核生物、原核生物和病毒基因组评分的新颖且简化的工作流程以及更广泛和更深的系统发育覆盖范围。
Mol Biol Evol. 2021 Sep 27;38(10):4647-4654. doi: 10.1093/molbev/msab199.
9
Error correction enables use of Oxford Nanopore technology for reference-free transcriptome analysis.纠错功能使牛津纳米孔技术能够用于无参考转录组分析。
Nat Commun. 2021 Jan 4;12(1):2. doi: 10.1038/s41467-020-20340-8.
10
UniProt: the universal protein knowledgebase in 2021.UniProt:2021 年的通用蛋白质知识库。
Nucleic Acids Res. 2021 Jan 8;49(D1):D480-D489. doi: 10.1093/nar/gkaa1100.