Cohen Ofir, Doron Shany, Wurtzel Omri, Dar Daniel, Edelheit Sarit, Karunker Iris, Mick Eran, Sorek Rotem
Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA.
Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel.
Nucleic Acids Res. 2016 Jul 8;44(W1):W46-53. doi: 10.1093/nar/gkw394. Epub 2016 May 6.
Whole-transcriptome sequencing studies from recent years revealed an unexpected complexity in transcriptomes of bacteria and archaea, including abundant non-coding RNAs, cis-antisense transcription and regulatory untranslated regions (UTRs). Understanding the functional relevance of the plethora of non-coding RNAs in a given organism is challenging, especially since some of these RNAs were attributed to 'transcriptional noise'. To allow the search for conserved transcriptomic elements we produced comparative transcriptome maps for multiple species across the microbial tree of life. These transcriptome maps are detailed in annotations, comparable by gene families, and BLAST-searchable by user provided sequences. Our transcriptome collection includes 18 model organisms spanning 10 phyla/subphyla of bacteria and archaea that were sequenced using standardized RNA-seq methods. The utility of the comparative approach, as implemented in our web server, is demonstrated by highlighting genes with exceptionally long 5'UTRs across species, which correspond to many known riboswitches and further suggest novel putative regulatory elements. Our study provides a standardized reference transcriptome to major clinically and environmentally important microbial phyla. The viewer is available at http://exploration.weizmann.ac.il/TCOL, setting a framework for comparative studies of the microbial non-coding genome.
近年来的全转录组测序研究揭示了细菌和古菌转录组中意想不到的复杂性,包括丰富的非编码RNA、顺式反义转录和调控非翻译区(UTR)。了解特定生物体中大量非编码RNA的功能相关性具有挑战性,特别是因为其中一些RNA被认为是“转录噪声”。为了寻找保守的转录组元件,我们绘制了生命之树中多个微生物物种的比较转录组图谱。这些转录组图谱注释详细,可按基因家族进行比较,还可通过用户提供的序列进行BLAST搜索。我们的转录组集合包括18种模式生物,涵盖细菌和古菌的10个门/亚门,这些生物均使用标准化RNA测序方法进行了测序。通过突出显示跨物种具有超长5'UTR的基因,证明了我们网络服务器中实施的比较方法的实用性,这些基因对应于许多已知的核糖开关,并进一步提示了新的假定调控元件。我们的研究为主要的临床和环境重要微生物门提供了标准化的参考转录组。可通过http://exploration.weizmann.ac.il/TCOL访问该查看器,它为微生物非编码基因组的比较研究奠定了框架。