• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

非模式生物转录组组装方法的比较性能

Comparative performance of transcriptome assembly methods for non-model organisms.

作者信息

Huang Xin, Chen Xiao-Guang, Armbruster Peter A

机构信息

Department of Biology, Georgetown University, 37th and O Streets NW, Washington, DC, 20057, USA.

Key Laboratory of Prevention and Control for Emerging Infectious Diseases of Guangdong Higher Institutes, Department of Pathogen Biology, School of Public Health and Tropical Medicine, Southern Medical University, Guangzhou, China.

出版信息

BMC Genomics. 2016 Jul 27;17:523. doi: 10.1186/s12864-016-2923-8.

DOI:10.1186/s12864-016-2923-8
PMID:27464550
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4964045/
Abstract

BACKGROUND

The technological revolution in next-generation sequencing has brought unprecedented opportunities to study any organism of interest at the genomic or transcriptomic level. Transcriptome assembly is a crucial first step for studying the molecular basis of phenotypes of interest using RNA-Sequencing (RNA-Seq). However, the optimal strategy for assembling vast amounts of short RNA-Seq reads remains unresolved, especially for organisms without a sequenced genome. This study compared four transcriptome assembly methods, including a widely used de novo assembler (Trinity), two transcriptome re-assembly strategies utilizing proteomic and genomic resources from closely related species (reference-based re-assembly and TransPS) and a genome-guided assembler (Cufflinks).

RESULTS

These four assembly strategies were compared using a comprehensive transcriptomic database of Aedes albopictus, for which a genome sequence has recently been completed. The quality of the various assemblies was assessed by the number of contigs generated, contig length distribution, percent paired-end read mapping, and gene model representation via BLASTX. Our results reveal that de novo assembly generates a similar number of gene models relative to genome-guided assembly with a fragmented reference, but produces the highest level of redundancy and requires the most computational power. Using a closely related reference genome to guide transcriptome assembly can generate biased contig sequences. Increasing the number of reads used in the transcriptome assembly tends to increase the redundancy within the assembly and decrease both median contig length and percent identity between contigs and reference protein sequences.

CONCLUSIONS

This study provides general guidance for transcriptome assembly of RNA-Seq data from organisms with or without a sequenced genome. The optimal transcriptome assembly strategy will depend upon the subsequent downstream analyses. However, our results emphasize the efficacy of de novo assembly, which can be as effective as genome-guided assembly when the reference genome assembly is fragmented. If a genome assembly and sufficient computational resources are available, it can be beneficial to combine de novo and genome-guided assemblies. Caution should be taken when using a closely related reference genome to guide transcriptome assembly. The quantity of read pairs used in the transcriptome assembly does not necessarily correlate with the quality of the assembly.

摘要

背景

新一代测序技术革命为在基因组或转录组水平研究任何感兴趣的生物体带来了前所未有的机遇。转录组组装是使用RNA测序(RNA-Seq)研究感兴趣表型的分子基础的关键第一步。然而,组装大量短RNA-Seq读段的最佳策略仍未解决,尤其是对于没有测序基因组的生物体。本研究比较了四种转录组组装方法,包括一种广泛使用的从头组装器(Trinity)、两种利用来自密切相关物种的蛋白质组和基因组资源的转录组重新组装策略(基于参考的重新组装和TransPS)以及一种基因组引导的组装器(Cufflinks)。

结果

使用白纹伊蚊的综合转录组数据库对这四种组装策略进行了比较,其基因组序列最近已完成。通过生成的重叠群数量、重叠群长度分布、双末端读段映射百分比以及通过BLASTX的基因模型表示来评估各种组装的质量。我们的结果表明,相对于使用片段化参考的基因组引导组装,从头组装产生的基因模型数量相似,但产生的冗余度最高,并且需要最多的计算能力。使用密切相关的参考基因组来指导转录组组装会产生有偏差的重叠群序列。增加转录组组装中使用的读段数量往往会增加组装内的冗余度,并降低重叠群的中位数长度以及重叠群与参考蛋白质序列之间的同一性百分比。

结论

本研究为有或没有测序基因组的生物体的RNA-Seq数据的转录组组装提供了一般性指导。最佳的转录组组装策略将取决于后续的下游分析。然而,我们的结果强调了从头组装的有效性,当参考基因组组装是片段化时,它可以与基因组引导组装一样有效。如果有基因组组装和足够的计算资源,将从头组装和基因组引导组装相结合可能会有好处。在使用密切相关的参考基因组来指导转录组组装时应谨慎。转录组组装中使用的读段对数量不一定与组装质量相关。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d57/4964045/ac4678d14652/12864_2016_2923_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d57/4964045/86b20c25d775/12864_2016_2923_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d57/4964045/d8b4ba930068/12864_2016_2923_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d57/4964045/1d7c8cf49aba/12864_2016_2923_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d57/4964045/ac4678d14652/12864_2016_2923_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d57/4964045/86b20c25d775/12864_2016_2923_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d57/4964045/d8b4ba930068/12864_2016_2923_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d57/4964045/1d7c8cf49aba/12864_2016_2923_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d57/4964045/ac4678d14652/12864_2016_2923_Fig4_HTML.jpg

相似文献

1
Comparative performance of transcriptome assembly methods for non-model organisms.非模式生物转录组组装方法的比较性能
BMC Genomics. 2016 Jul 27;17:523. doi: 10.1186/s12864-016-2923-8.
2
Challenges and advances for transcriptome assembly in non-model species.非模式物种转录组组装面临的挑战与进展
PLoS One. 2017 Sep 20;12(9):e0185020. doi: 10.1371/journal.pone.0185020. eCollection 2017.
3
454 pyrosequencing-based analysis of gene expression profiles in the amphipod Melita plumulosa: transcriptome assembly and toxicant induced changes.基于 454 焦磷酸测序的蔓足类动物蔓足密鳞水虱基因表达谱分析:转录组组装和毒物诱导变化。
Aquat Toxicol. 2014 Aug;153:73-88. doi: 10.1016/j.aquatox.2013.11.022. Epub 2013 Dec 12.
4
Short read Illumina data for the de novo assembly of a non-model snail species transcriptome (Radix balthica, Basommatophora, Pulmonata), and a comparison of assembler performance.Illumina 短读测序数据用于从头组装非模式蜗牛物种转录组(Radix balthica,Basommatophora,Pulmonata),并比较组装器性能。
BMC Genomics. 2011 Jun 16;12:317. doi: 10.1186/1471-2164-12-317.
5
PARRoT- a homology-based strategy to quantify and compare RNA-sequencing from non-model organisms.PARRoT——一种基于同源性的策略,用于量化和比较非模式生物的RNA测序。
BMC Bioinformatics. 2016 Dec 22;17(Suppl 19):513. doi: 10.1186/s12859-016-1366-1.
6
De novo transcriptome assembly for a non-model species, the blood-sucking bug Triatoma brasiliensis, a vector of Chagas disease.对非模式物种巴西吸血蝽(恰加斯病的传播媒介)进行从头转录组组装。
Genetica. 2015 Apr;143(2):225-39. doi: 10.1007/s10709-014-9790-5. Epub 2014 Sep 19.
7
Combined de novo and genome guided assembly and annotation of the Pinus patula juvenile shoot transcriptome.辐射松幼嫩枝梢转录组的从头组装与基因组引导组装及注释
BMC Genomics. 2015 Dec 12;16:1057. doi: 10.1186/s12864-015-2277-7.
8
Transcriptome assembly, gene annotation and tissue gene expression atlas of the rainbow trout.虹鳟鱼的转录组组装、基因注释及组织基因表达图谱
PLoS One. 2015 Mar 20;10(3):e0121778. doi: 10.1371/journal.pone.0121778. eCollection 2015.
9
A survey of the complex transcriptome from the highly polyploid sugarcane genome using full-length isoform sequencing and de novo assembly from short read sequencing.利用全长异构体测序和短读长测序的从头组装对高度多倍体甘蔗基因组的复杂转录组进行的一项调查。
BMC Genomics. 2017 May 22;18(1):395. doi: 10.1186/s12864-017-3757-8.
10
Assembly and annotation of a non-model gastropod (Nerita melanotragus) transcriptome: a comparison of de novo assemblers.一种非模式腹足动物(黑唇蜒螺)转录组的组装与注释:从头组装器的比较
BMC Res Notes. 2014 Aug 1;7:488. doi: 10.1186/1756-0500-7-488.

引用本文的文献

1
De novo transcriptome assembly and analysis during agarwood induction in Gyrinops versteegii Gilg. seedling.白木香幼苗沉香诱导过程中的从头转录组组装与分析
Sci Rep. 2025 Jan 23;15(1):2977. doi: 10.1038/s41598-025-87486-7.
2
Roast: a tool for reference-free optimization of supertranscriptome assemblies.Roast:一种用于无参考超级转录组组装优化的工具。
BMC Bioinformatics. 2024 Jan 2;25(1):2. doi: 10.1186/s12859-023-05614-4.
3
Comparative Root Transcriptome Profiling and Gene Regulatory Network Analysis between Eastern and Western Carrot ( L.) Cultivars Reveals Candidate Genes for Vascular Tissue Patterning.

本文引用的文献

1
TransPS: A Transcriptome Post Scaffolding Method for Assembling High Quality Contigs.TransPS:一种用于组装高质量重叠群的转录组支架后方法。
Comput Biol J. 2014;2014. doi: 10.1155/2014/961823. Epub 2014 May 28.
2
Genome sequence of the Asian Tiger mosquito, Aedes albopictus, reveals insights into its biology, genetics, and evolution.白纹伊蚊(亚洲虎蚊)的基因组序列揭示了其生物学、遗传学和进化方面的见解。
Proc Natl Acad Sci U S A. 2015 Nov 3;112(44):E5907-15. doi: 10.1073/pnas.1516410112. Epub 2015 Oct 19.
3
Comparing de novo and reference-based transcriptome assembly strategies by applying them to the blood-sucking bug Rhodnius prolixus.
东西方胡萝卜(L.)品种间的根系转录组比较分析及基因调控网络分析揭示了维管组织模式形成的候选基因。
Plants (Basel). 2023 Sep 30;12(19):3449. doi: 10.3390/plants12193449.
4
Welcome to the big leaves: Best practices for improving genome annotation in non-model plant genomes.欢迎来到大叶植物:改善非模式植物基因组注释的最佳实践。
Appl Plant Sci. 2023 Aug 8;11(4):e11533. doi: 10.1002/aps3.11533. eCollection 2023 Jul-Aug.
5
Differential gene expression in response to fungal pathogen exposure in the aquatic invertebrate, .水生无脊椎动物对真菌病原体暴露的差异基因表达 。 (你提供的原文似乎不完整,句末缺少具体内容)
Ecol Evol. 2023 Jul 28;13(8):e10354. doi: 10.1002/ece3.10354. eCollection 2023 Aug.
6
Optimizing an efficient ensemble approach for high-quality de novo transcriptome assembly of Thymus daenensis.优化高效集成方法,实现高质量的甘肃黄花补血草从头转录组组装。
Sci Rep. 2023 Jul 31;13(1):12415. doi: 10.1038/s41598-023-39620-6.
7
Transcriptomic thermal plasticity underlying gonadal development in a turtle with ZZ/ZW sex chromosomes despite canalized genotypic sex determination.尽管基因型性别决定具有稳定性,但具有ZZ/ZW性染色体的乌龟性腺发育过程中的转录组热可塑性。
Ecol Evol. 2023 Feb 24;13(2):e9854. doi: 10.1002/ece3.9854. eCollection 2023 Feb.
8
Alkaloid production and response to natural adverse conditions in : transcriptome analyses.生物碱的产生及对自然不利条件的响应:转录组分析
BioTechnologia (Pozn). 2022 Dec 24;103(4):355-384. doi: 10.5114/bta.2022.120706. eCollection 2022.
9
Transcriptome Profiling and Expression Localization of Key Sex-Related Genes in a Socially-Controlled Hermaphroditic Clownfish, .转录组谱分析和关键性别相关基因在社会性控制雌雄同体小丑鱼中的表达定位。
Int J Mol Sci. 2022 Aug 13;23(16):9085. doi: 10.3390/ijms23169085.
10
Transcriptome Analysis to Understand Salt Stress Regulation Mechanism of ANJ207.转录组分析以了解ANJ207的盐胁迫调控机制
Front Microbiol. 2022 Jun 30;13:909276. doi: 10.3389/fmicb.2022.909276. eCollection 2022.
通过将从头转录组组装策略和基于参考的转录组组装策略应用于吸血蝽Rhodnius prolixus来进行比较。
Insect Biochem Mol Biol. 2016 Feb;69:25-33. doi: 10.1016/j.ibmb.2015.05.009. Epub 2015 May 22.
4
Global transcriptional dynamics of diapause induction in non-blood-fed and blood-fed Aedes albopictus.未吸血和已吸血白纹伊蚊滞育诱导的全球转录动力学
PLoS Negl Trop Dis. 2015 Apr 21;9(4):e0003724. doi: 10.1371/journal.pntd.0003724. eCollection 2015 Apr.
5
A comparative study of RNA-seq analysis strategies.RNA测序分析策略的比较研究
Brief Bioinform. 2015 Nov;16(6):932-40. doi: 10.1093/bib/bbv007. Epub 2015 Mar 18.
6
Vision from next generation sequencing: multi-dimensional genome-wide analysis for producing gene regulatory networks underlying retinal development, aging and disease.新一代测序技术带来的视野:用于构建视网膜发育、衰老和疾病相关基因调控网络的全基因组多维分析
Prog Retin Eye Res. 2015 May;46:1-30. doi: 10.1016/j.preteyeres.2015.01.005. Epub 2015 Feb 7.
7
Evaluation of de novo transcriptome assemblies from RNA-Seq data.基于RNA测序数据的从头转录组组装评估。
Genome Biol. 2014 Dec 21;15(12):553. doi: 10.1186/s13059-014-0553-5.
8
An experimental and bioinformatics protocol for RNA-Seq analyses of photoperiodic diapause in the Asian tiger mosquito, Aedes albopictus.一种用于白纹伊蚊光周期滞育RNA测序分析的实验与生物信息学方案。
J Vis Exp. 2014 Nov 30(93):e51961. doi: 10.3791/51961.
9
Phylogenomics resolves the timing and pattern of insect evolution.系统基因组学解决了昆虫进化的时间和模式问题。
Science. 2014 Nov 7;346(6210):763-7. doi: 10.1126/science.1257570. Epub 2014 Nov 6.
10
Understanding complex transcriptome dynamics in schizophrenia and other neurological diseases using RNA sequencing.利用RNA测序了解精神分裂症和其他神经疾病中的复杂转录组动态变化。
Int Rev Neurobiol. 2014;116:127-52. doi: 10.1016/B978-0-12-801105-8.00006-0.