Murashko Oleg N, Morgan-Lang Connor, Yu Chen-Hsin Albert, Lin Hsin-Nan, Chao Kaberdina Anna, Kung Shin-Yu, Kaberdin Vladimir R, Lin-Chao Sue
Institute of Molecular Biology, Academia Sinica, Taipei, Taiwan.
Koonkie Cloud Services Inc., Menlo Park, California, USA.
mSphere. 2025 Jul 29;10(7):e0012425. doi: 10.1128/msphere.00124-25. Epub 2025 Jul 7.
Repetitive extragenic palindromic (REP) sequences were first discovered in the intergenic regions of and later found in phylogenetically distinct bacterial species. However, their biological roles and sequence conservation remain unclear. Considering the growing number of microbial genome sequencing and annotation projects, along with the lack of online tools for REP identification, we developed a new web-based platform, RepRanger, to rapidly identify and annotate putative palindromic elements, including REPs. Using RepRanger, we identified >4,000 REPs in the MG1655 genome. Moreover, 81 (~52%) of its 157 small noncoding RNAs (sRNAs; known to regulate gene expression) contain REPs. Further analysis using RepRanger, alongside other bioinformatics tools, yielded 10 REP consensus motifs. Notably, REPs potentially involved in translational control (i.e., located within 15 nucleotides downstream of an open reading frame) primarily host motif 4, whereas REPs present in sRNAs predominantly encoded motif 9. Putative targets of REP-containing sRNAs include genes that rewire metabolic pathways in response to environmental changes, indicating that REPs likely contribute to bacterial adaptation. We also reveal that REP consensus motifs are similar across pathogenic and environmental strains but not commensal and laboratory ones. We employed RepRanger to search for REPs and confirm that they are present across bacteria and archaea. Furthermore, we uncovered similarities in REP sequences between bacterial and archaeal genomes. Thus, we demonstrate that RepRanger represents a versatile tool for discovering REPs in bacteria and archaea, providing new insights into their functions and sequence diversity.IMPORTANCERepetitive extragenic palindromic (REP) sequences were first discovered in , but their biological roles, diversity, and sequence conservation remain unclear. We have developed a web-based tool, RepRanger, to identify and annotate putative palindromic elements, including REPs. Using RepRanger, we identified approximately 4,000 REPs in the MG1655 genome. We show that >50% of small noncoding RNAs (sRNAs) contain REPs. The predicted functions of REP-containing sRNAs indicate that REPs likely contribute to bacterial environmental adaptability. In addition, we have discovered REPs in pathogenic, environmental, and commensal allowing us to assess their sequence similarity. We show that REPs are widely present in bacterial and archaeal genomes and share some sequence similarities. Our comparison of REPs in annotated genomes broadens the current understanding of REP sequence diversity, conservation, and function.
重复基因外回文序列(REP)最初是在[具体物种]的基因间区域发现的,后来在系统发育上不同的细菌物种中也被发现。然而,它们的生物学作用和序列保守性仍不清楚。考虑到微生物基因组测序和注释项目的数量不断增加,以及缺乏用于REP识别的在线工具,我们开发了一个新的基于网络的平台RepRanger,以快速识别和注释假定的回文元件,包括REP。使用RepRanger,我们在[具体细菌]MG1655基因组中鉴定出>4000个REP。此外,其157个小非编码RNA(sRNA,已知可调节基因表达)中有81个(约52%)含有REP。使用RepRanger以及其他生物信息学工具进行的进一步分析产生了10个REP共有基序。值得注意的是,可能参与翻译控制的REP(即位于开放阅读框下游15个核苷酸内)主要包含基序4,而sRNA中存在的REP主要编码基序9。含REP的sRNA的假定靶标包括响应环境变化而重新连接代谢途径的基因,这表明REP可能有助于细菌适应环境。我们还发现,致病菌株和环境菌株中的REP共有基序相似,但共生菌株和实验室菌株中的则不同。我们使用RepRanger搜索REP,并确认它们存在于细菌和古菌中。此外,我们发现了细菌和古菌基因组中REP序列的相似性。因此,我们证明RepRanger是一种用于在细菌和古菌中发现REP的通用工具,为它们的功能和序列多样性提供了新的见解。
重要性
重复基因外回文序列(REP)最初是在[具体物种]中发现的,但其生物学作用、多样性和序列保守性仍不清楚。我们开发了一个基于网络的工具RepRanger,用于识别和注释假定的回文元件,包括REP。使用RepRanger,我们在[具体细菌]MG1655基因组中鉴定出约4000个REP。我们表明,超过50%的小非编码RNA(sRNA)含有REP。含REP的sRNA的预测功能表明,REP可能有助于细菌的环境适应性。此外,我们在致病、环境和共生[具体细菌]中发现了REP,并能够评估它们的序列相似性。我们表明,REP广泛存在于细菌和古菌基因组中,并具有一些序列相似性。我们对注释基因组中REP的比较拓宽了目前对REP序列多样性、保守性和功能的理解。