Michaelsen Thomas Yssing, Brandt Jakob, Singleton Caitlin Margaret, Kirkegaard Rasmus Hansen, Wiesinger Johanna, Segata Nicola, Albertsen Mads
Center for Microbial Communities, Aalborg University, Aalborg, Denmark
Center for Microbial Communities, Aalborg University, Aalborg, Denmark.
mSystems. 2020 Feb 11;5(1):e00587-19. doi: 10.1128/mSystems.00587-19.
High-throughput sequencing has allowed unprecedented insight into the composition and function of complex microbial communities. With metatranscriptomics, it is possible to interrogate the transcriptomes of multiple organisms simultaneously to get an overview of the gene expression of the entire community. Studies have successfully used metatranscriptomics to identify and describe relationships between gene expression levels and community characteristics. However, metatranscriptomic data sets contain a rich suite of additional information that is just beginning to be explored. Here, we focus on antisense expression in metatranscriptomics, discuss the different computational strategies for handling it, and highlight the strengths but also potentially detrimental effects on downstream analysis and interpretation. We also analyzed the antisense transcriptomes of multiple genomes and metagenome-assembled genomes (MAGs) from five different data sets and found high variability in the levels of antisense transcription for individual species, which were consistent across samples. Importantly, we challenged the conceptual framework that antisense transcription is primarily the product of transcriptional noise and found mixed support, suggesting that the total observed antisense RNA in complex communities arises from the combined effect of unknown biological and technical factors. Antisense transcription can be highly informative, including technical details about data quality and novel insight into the biology of complex microbial communities. This study systematically evaluated the global patterns of microbial antisense expression across various environments and provides a bird's-eye view of general patterns observed across data sets, which can provide guidelines in our understanding of antisense expression as well as interpretation of metatranscriptomic data in general. This analysis highlights that in some environments, antisense expression from microbial communities can dominate over regular gene expression. We explored some potential drivers of antisense transcription, but more importantly, this study serves as a starting point, highlighting topics for future research and providing guidelines to include antisense expression in generic bioinformatic pipelines for metatranscriptomic data.
高通量测序使人们对复杂微生物群落的组成和功能有了前所未有的深入了解。通过宏转录组学,可以同时研究多种生物体的转录组,从而全面了解整个群落的基因表达情况。研究已成功利用宏转录组学来识别和描述基因表达水平与群落特征之间的关系。然而,宏转录组数据集包含一系列丰富的额外信息,这些信息才刚刚开始被探索。在这里,我们聚焦于宏转录组学中的反义表达,讨论处理它的不同计算策略,并强调其优势以及对下游分析和解释可能产生的不利影响。我们还分析了来自五个不同数据集的多个基因组和宏基因组组装基因组(MAG)的反义转录组,发现单个物种的反义转录水平存在高度变异性,且在不同样本中保持一致。重要的是,我们对反义转录主要是转录噪声产物这一概念框架提出了挑战,结果发现既有支持的证据,也有反对的证据,这表明在复杂群落中观察到的总反义RNA是未知生物学和技术因素共同作用的结果。反义转录可能具有很高的信息量,包括有关数据质量的技术细节以及对复杂微生物群落生物学的新见解。本研究系统地评估了不同环境中微生物反义表达的全局模式,并提供了对各数据集所观察到的一般模式的鸟瞰图,这可为我们理解反义表达以及一般宏转录组数据的解释提供指导。该分析突出表明,在某些环境中,微生物群落的反义表达可能超过正常基因表达。我们探索了反义转录的一些潜在驱动因素,但更重要的是,本研究作为一个起点,突出了未来研究的主题,并为在宏转录组数据通用生物信息学流程中纳入反义表达提供了指导方针。