Lau Maggie C Y, Harris Rachel L, Oh Youmi, Yi Min Joo, Behmard Aida, Onstott Tullis C
Department of Geosciences, Princeton University, Princeton, NJ, United States.
Program in Atmospheric and Oceanic Sciences, Princeton University, Princeton, NJ, United States.
Front Microbiol. 2018 Jun 20;9:1235. doi: 10.3389/fmicb.2018.01235. eCollection 2018.
Metatranscriptomics has recently been applied to investigate the active biogeochemical processes and elemental cycles, and responses of microbiomes to environmental stimuli and stress factors. assembly of RNA-Sequencing (RNA-Seq) data can reveal a more detailed description of the metabolic interactions amongst the active microbial communities. However, the quality of the assemblies and the depiction of the metabolic network provided by various assemblers have not yet been thoroughly assessed. In this study, we compared 15 metatranscriptomic assemblies for a fracture fluid sample collected from a borehole located at 1.34 km below land surface in a South African gold mine. These assemblies were constructed from total, non-coding, and coding reads using five transcriptomic assemblers (Trans-ABySS, Trinity, Oases, IDBA-tran, and Rockhopper). They were evaluated based on the number of transcripts, transcript length, range of transcript coverage, continuity, percentage of transcripts with confident annotation assignments, as well as taxonomic and functional diversity patterns. The results showed that these parameters varied considerably among the assemblies, with Trans-ABySS and Trinity generating the best assemblies for non-coding and coding RNA reads, respectively, because the high number of transcripts assembled covered a wide expression range, and captured extensively the taxonomic and metabolic gene diversity, respectively. We concluded that the choice of transcriptomic assemblers impacts substantially the taxonomic and functional compositions. Care should be taken to obtain high-quality assemblies for informing the metabolic landscape.
宏转录组学最近已被应用于研究活跃的生物地球化学过程和元素循环,以及微生物群落对环境刺激和压力因素的反应。RNA测序(RNA-Seq)数据的组装可以揭示活跃微生物群落之间代谢相互作用的更详细描述。然而,各种组装器提供的组装质量和代谢网络的描绘尚未得到全面评估。在本研究中,我们比较了从南非金矿地表以下1.34公里处的一个钻孔采集的压裂液样本的15个宏转录组组装。这些组装是使用五个转录组组装器(Trans-ABySS、Trinity、Oases、IDBA-tran和Rockhopper)从总读数、非编码读数和编码读数构建的。它们根据转录本数量、转录本长度、转录本覆盖范围、连续性、具有可靠注释分配的转录本百分比以及分类和功能多样性模式进行评估。结果表明,这些参数在组装之间有很大差异,Trans-ABySS和Trinity分别为非编码和编码RNA读数生成了最佳组装,因为组装的大量转录本覆盖了广泛的表达范围,并分别广泛捕获了分类和代谢基因多样性。我们得出结论,转录组组装器的选择对分类和功能组成有重大影响。应谨慎获得高质量的组装,以了解代谢景观。