Mostajo Nelly F, Lataretu Marie, Krautwurst Sebastian, Mock Florian, Desirò Daniel, Lamkiewicz Kevin, Collatz Maximilian, Schoen Andreas, Weber Friedemann, Marz Manja, Hölzer Martin
RNA Bioinformatics and High-Throughput Analysis, Friedrich Schiller University Jena, Leutragraben 1, 07743 Jena, Germany.
Institute of Virology, Philipps-University Marburg, Hans-Meerwein-Straße 2, 35043 Marburg, Germany.
NAR Genom Bioinform. 2020 Mar;2(1):lqz006. doi: 10.1093/nargab/lqz006. Epub 2019 Sep 30.
Although bats are increasingly becoming the focus of scientific studies due to their unique properties, these exceptional animals are still among the least studied mammals. Assembly quality and completeness of bat genomes vary a lot and especially non-coding RNA (ncRNA) annotations are incomplete or simply missing. Accordingly, standard bioinformatics pipelines for gene expression analysis often ignore ncRNAs such as microRNAs or long antisense RNAs. The main cause of this problem is the use of incomplete genome annotations. We present a complete screening for ncRNAs within 16 bat genomes. NcRNAs affect a remarkable variety of vital biological functions, including gene expression regulation, RNA processing, RNA interference and, as recently described, regulatory processes in viral infections. Within all investigated bat assemblies, we annotated 667 ncRNA families including 162 snoRNAs and 193 miRNAs as well as rRNAs, tRNAs, several snRNAs and lncRNAs, and other structural ncRNA elements. We validated our ncRNA candidates by six RNA-Seq data sets and show significant expression patterns that have never been described before in a bat species on such a large scale. Our annotations will be usable as a resource (rna.uni-jena.de/supplements/bats) for deeper studying of bat evolution, ncRNAs repertoire, gene expression and regulation, ecology and important host-virus interactions.
尽管蝙蝠因其独特特性日益成为科学研究的焦点,但这些特殊动物仍是研究最少的哺乳动物之一。蝙蝠基因组的组装质量和完整性差异很大,尤其是非编码RNA(ncRNA)注释不完整或根本缺失。因此,用于基因表达分析的标准生物信息学流程通常会忽略诸如微小RNA或长链反义RNA等ncRNA。这个问题的主要原因是使用了不完整的基因组注释。我们对16个蝙蝠基因组中的ncRNA进行了全面筛选。ncRNA影响着各种各样至关重要的生物学功能,包括基因表达调控、RNA加工、RNA干扰,以及最近描述的病毒感染中的调控过程。在所有研究的蝙蝠基因组组装中,我们注释了667个ncRNA家族,包括162个核仁小RNA和193个微小RNA,以及核糖体RNA、转运RNA、几种小核RNA和长链非编码RNA,还有其他结构性ncRNA元件。我们通过六个RNA测序数据集验证了我们的ncRNA候选物,并展示了此前从未在蝙蝠物种中大规模描述过的显著表达模式。我们的注释将作为一种资源(rna.uni-jena.de/supplements/bats),用于深入研究蝙蝠进化、ncRNA库、基因表达与调控、生态学以及重要的宿主 - 病毒相互作用。