• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

扩展 rnaSPAdes 功能以进行混合转录组组装。

Extending rnaSPAdes functionality for hybrid transcriptome assembly.

机构信息

Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, St. Petersburg State University, St. Petersburg, Russia.

Consiglio Nazionale delle Ricerche, Istituto per i Sistemi Agricoli e Forestali del Mediterraneo, Catania, Italy.

出版信息

BMC Bioinformatics. 2020 Jul 24;21(Suppl 12):302. doi: 10.1186/s12859-020-03614-2.

DOI:10.1186/s12859-020-03614-2
PMID:32703149
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7379828/
Abstract

BACKGROUND

De novo RNA-Seq assembly is a powerful method for analysing transcriptomes when the reference genome is not available or poorly annotated. However, due to the short length of Illumina reads it is usually impossible to reconstruct complete sequences of complex genes and alternative isoforms. Recently emerged possibility to generate long RNA reads, such as PacBio and Oxford Nanopores, may dramatically improve the assembly quality, and thus the consecutive analysis. While reference-based tools for analysing long RNA reads were recently developed, there is no established pipeline for de novo assembly of such data.

RESULTS

In this work we present a novel method that allows to perform high-quality de novo transcriptome assemblies by combining accuracy and reliability of short reads with exon structure information carried out from long error-prone reads. The algorithm is designed by incorporating existing hybridSPAdes approach into rnaSPAdes pipeline and adapting it for transcriptomic data.

CONCLUSION

To evaluate the benefit of using long RNA reads we selected several datasets containing both Illumina and Iso-seq or Oxford Nanopore Technologies (ONT) reads. Using an existing quality assessment software, we show that hybrid assemblies performed with rnaSPAdes contain more full-length genes and alternative isoforms comparing to the case when only short-read data is used.

摘要

背景

当参考基因组不可用或注释较差时,从头 RNA-Seq 组装是分析转录组的一种强大方法。然而,由于 Illumina 读取的长度较短,通常不可能重建复杂基因和替代异构体的完整序列。最近出现的生成长 RNA 读取的可能性,例如 PacBio 和 Oxford Nanopores,可以极大地提高组装质量,从而提高后续分析的质量。虽然最近开发了用于分析长 RNA 读取的基于参考的工具,但尚无用于从头组装此类数据的既定流水线。

结果

在这项工作中,我们提出了一种新方法,该方法通过将短读的准确性和可靠性与从长易错读中提取的外显子结构信息结合起来,从而实现高质量的从头转录组组装。该算法的设计是通过将现有的混合 SPAdes 方法纳入 rnaSPAdes 流水线并对其进行调整以适应转录组数据来实现的。

结论

为了评估使用长 RNA 读取的好处,我们选择了包含 Illumina 和 Iso-seq 或 Oxford Nanopore Technologies (ONT) 读取的几个数据集。使用现有的质量评估软件,我们表明,与仅使用短读数据的情况相比,使用 rnaSPAdes 进行的混合组装包含更多的全长基因和替代异构体。

相似文献

1
Extending rnaSPAdes functionality for hybrid transcriptome assembly.扩展 rnaSPAdes 功能以进行混合转录组组装。
BMC Bioinformatics. 2020 Jul 24;21(Suppl 12):302. doi: 10.1186/s12859-020-03614-2.
2
rnaSPAdes: a de novo transcriptome assembler and its application to RNA-Seq data.rnaSPAdes:一种从头转录组组装程序及其在 RNA-Seq 数据中的应用。
Gigascience. 2019 Sep 1;8(9). doi: 10.1093/gigascience/giz100.
3
A Full-Length mRNA Transcriptome Generated From Hybrid-Corrected PacBio Long-Reads Improves the Transcript Annotation and Identifies Thousands of Novel Splice Variants in Atlantic Salmon.通过混合校正的PacBio长读长生成的全长mRNA转录组改善了转录本注释并鉴定了大西洋鲑鱼中数千种新的剪接变体。
Front Genet. 2021 Apr 27;12:656334. doi: 10.3389/fgene.2021.656334. eCollection 2021.
4
A survey of the complex transcriptome from the highly polyploid sugarcane genome using full-length isoform sequencing and de novo assembly from short read sequencing.利用全长异构体测序和短读长测序的从头组装对高度多倍体甘蔗基因组的复杂转录组进行的一项调查。
BMC Genomics. 2017 May 22;18(1):395. doi: 10.1186/s12864-017-3757-8.
5
Can we use it? On the utility of de novo and reference-based assembly of Nanopore data for plant plastome sequencing.我们可以使用它吗?从头组装和基于参考的纳米孔数据在植物质体测序中的应用。
PLoS One. 2020 Mar 24;15(3):e0226234. doi: 10.1371/journal.pone.0226234. eCollection 2020.
6
Improving the quality of barley transcriptome de novo assembling by using a hybrid approach for lines with varying spike and stem coloration.通过使用混合方法提高不同穗部和茎部颜色品系大麦转录组从头组装的质量。
Vavilovskii Zhurnal Genet Selektsii. 2021 Feb;25(1):30-38. doi: 10.18699/VJ21.004.
7
A comparison of next generation sequencing technologies for transcriptome assembly and utility for RNA-Seq in a non-model bird.非模式鸟类中转录组组装的新一代测序技术比较及RNA测序的实用性
PLoS One. 2014 Oct 3;9(10):e108550. doi: 10.1371/journal.pone.0108550. eCollection 2014.
8
Polishing the Oxford Nanopore long-read assemblies of bacterial pathogens with Illumina short reads to improve genomic analyses.用 Illumina 短读序列对牛津纳米孔长读序列组装的细菌病原体进行打磨,以改进基因组分析。
Genomics. 2021 May;113(3):1366-1377. doi: 10.1016/j.ygeno.2021.03.018. Epub 2021 Mar 11.
9
Benchmarking of de novo assembly algorithms for Nanopore data reveals optimal performance of OLC approaches.用于纳米孔数据的从头组装算法基准测试揭示了重叠布局一致(OLC)方法的最佳性能。
BMC Genomics. 2016 Aug 22;17 Suppl 7(Suppl 7):507. doi: 10.1186/s12864-016-2895-8.
10
The long and short of it: benchmarking viromics using Illumina, Nanopore and PacBio sequencing technologies.简而言之:使用Illumina、Nanopore和PacBio测序技术对病毒组进行基准测试。
Microb Genom. 2024 Feb;10(2). doi: 10.1099/mgen.0.001198.

引用本文的文献

1
How drought and ploidy level shape gene expression and DNA methylation in Phragmites australis.干旱和倍性水平如何塑造芦苇的基因表达和DNA甲基化。
Plant Cell Rep. 2025 Aug 12;44(9):197. doi: 10.1007/s00299-025-03585-9.
2
Draft whole genome sequence of strain P02PL2 an endophytic fungal species isolated from .从……分离出的内生真菌物种P02PL2菌株的全基因组序列草图
Microbiol Resour Announc. 2025 Mar 11;14(3):e0086524. doi: 10.1128/mra.00865-24. Epub 2025 Feb 18.
3
Trans2express - de novo transcriptome assembly pipeline optimized for gene expression analysis.

本文引用的文献

1
RNA-Bloom enables reference-free and reference-guided sequence assembly for single-cell transcriptomes.RNA-Bloom 能够实现无参考和有参考的单细胞转录组序列组装。
Genome Res. 2020 Aug;30(8):1191-1200. doi: 10.1101/gr.260174.119. Epub 2020 Aug 17.
2
rnaSPAdes: a de novo transcriptome assembler and its application to RNA-Seq data.rnaSPAdes:一种从头转录组组装程序及其在 RNA-Seq 数据中的应用。
Gigascience. 2019 Sep 1;8(9). doi: 10.1093/gigascience/giz100.
3
De novo transcriptome assembly: A comprehensive cross-species comparison of short-read RNA-Seq assemblers.
Trans2express - 针对基因表达分析优化的从头转录组组装流程。
Plant Methods. 2024 Aug 17;20(1):128. doi: 10.1186/s13007-024-01255-7.
4
Identification of novel cattle (Bos taurus) genes and biological insights of their function in pre-implantation embryo development.鉴定新型牛(Bos taurus)基因及其在胚胎植入前发育中的功能的生物学见解。
BMC Genomics. 2024 Aug 9;25(1):775. doi: 10.1186/s12864-024-10685-5.
5
Integrated transcriptomic and WGCNA analyses reveal candidate genes regulating mainly flavonoid biosynthesis in Litsea coreana var. sinensis.整合转录组学和 WGCNA 分析揭示候选基因调控红淡比Litsea coreana var. sinensis 中主要黄酮类生物合成。
BMC Plant Biol. 2024 Apr 1;24(1):231. doi: 10.1186/s12870-024-04949-1.
6
Enhancing transcriptome analysis in medicinal plants: multiple unigene sets in .增强药用植物的转录组分析:. 中的多个单基因集
Front Plant Sci. 2024 Feb 7;15:1301526. doi: 10.3389/fpls.2024.1301526. eCollection 2024.
7
Genomic and morphological characterization of a new Thiothrix species from a sulfide hot spring of the Zmeinaya bay (Northern Baikal, Russia).从俄罗斯北贝加尔湖的 Zmeinaya 湾(俄罗斯)的一处硫化温泉中分离到一种新的硫丝菌属物种的基因组和形态特征。
Antonie Van Leeuwenhoek. 2024 Jan 13;117(1):23. doi: 10.1007/s10482-023-01918-w.
8
Evaluation of Genetic Diversity and Virulence Potential of Isolated from Water Supply Systems of Residential Buildings in Latvia.拉脱维亚住宅建筑供水系统中分离出的[具体对象未给出]的遗传多样性和毒力潜力评估。
Pathogens. 2023 Jun 28;12(7):884. doi: 10.3390/pathogens12070884.
9
Prevalence and Genetic Diversity of spp. in Hotel Water-Supply Systems in Latvia.拉脱维亚酒店供水系统中[具体物种名称缺失]的流行情况和遗传多样性 。
Microorganisms. 2023 Feb 27;11(3):596. doi: 10.3390/microorganisms11030596.
10
annotate_my_genomes: an easy-to-use pipeline to improve genome annotation and uncover neglected genes by hybrid RNA sequencing.annotate_my_genomes:一种易于使用的通过混合 RNA 测序来改善基因组注释和发现被忽视基因的管道。
Gigascience. 2022 Dec 6;11. doi: 10.1093/gigascience/giac099.
从头转录组组装:短读 RNA-Seq 组装器的全面跨物种比较。
Gigascience. 2019 May 1;8(5). doi: 10.1093/gigascience/giz039.
4
Long-Read RNA Sequencing Identifies Alternative Splice Variants in Hepatocellular Carcinoma and Tumor-Specific Isoforms.长读 RNA 测序鉴定肝癌中的可变剪接变体和肿瘤特异性异构体。
Hepatology. 2019 Sep;70(3):1011-1025. doi: 10.1002/hep.30500. Epub 2019 Mar 22.
5
PacBio full-length cDNA sequencing integrated with RNA-seq reads drastically improves the discovery of splicing transcripts in rice.PacBio 全长 cDNA 测序与 RNA-seq reads 相结合极大地提高了水稻剪接转录本的发现。
Plant J. 2019 Jan;97(2):296-305. doi: 10.1111/tpj.14120. Epub 2018 Dec 3.
6
IDP-denovo: de novo transcriptome assembly and isoform annotation by hybrid sequencing.IDP-denovo:通过混合测序进行从头转录组组装和异构体注释。
Bioinformatics. 2018 Jul 1;34(13):2168-2176. doi: 10.1093/bioinformatics/bty098.
7
Long-read sequencing of nascent RNA reveals coupling among RNA processing events.长读测序技术对新生 RNA 进行测序,揭示了 RNA 加工事件之间的偶联。
Genome Res. 2018 Jul;28(7):1008-1019. doi: 10.1101/gr.232025.117. Epub 2018 Jun 14.
8
Highly parallel direct RNA sequencing on an array of nanopores.基于纳米孔阵列的高通量直接 RNA 测序。
Nat Methods. 2018 Mar;15(3):201-206. doi: 10.1038/nmeth.4577. Epub 2018 Jan 15.
9
Long-read sequencing of the coffee bean transcriptome reveals the diversity of full-length transcripts.长读测序技术揭示了咖啡豆转录组全长转录本的多样性。
Gigascience. 2017 Nov 1;6(11):1-13. doi: 10.1093/gigascience/gix086.
10
Nanopore long-read RNAseq reveals widespread transcriptional variation among the surface receptors of individual B cells.纳米孔长读 RNA 测序揭示了个体 B 细胞表面受体之间广泛的转录变异性。
Nat Commun. 2017 Jul 19;8:16027. doi: 10.1038/ncomms16027.