Department of Biology, University of Fribourg & Swiss Institute of Bioinformatics, Fribourg, Switzerland.
Laboratoire de Microbiologie et de Génétique Moléculaires (LMGM), Centre de Biologie Intégrative (CBI), Université de Toulouse, CNRS, UPS, Toulouse, France.
PLoS Comput Biol. 2019 Apr 25;15(4):e1006946. doi: 10.1371/journal.pcbi.1006946. eCollection 2019 Apr.
Bacterial Toxin-Antitoxin systems (TAS) are involved in key biological functions including plasmid maintenance, defense against phages, persistence and virulence. They are found in nearly all phyla and classified into 6 different types based on the mode of inactivation of the toxin, with the type II TAS being the best characterized so far. We have herein developed a new in silico discovery pipeline named TASmania, which mines the >41K assemblies of the EnsemblBacteria database for known and uncharacterized protein components of type I to IV TAS loci. Our pipeline annotates the proteins based on a list of curated HMMs, which leads to >2.106 loci candidates, including orphan toxins and antitoxins, and organises the candidates in pseudo-operon structures in order to identify new TAS candidates based on a guilt-by-association strategy. In addition, we classify the two-component TAS with an unsupervised method on top of the pseudo-operon (pop) gene structures, leading to 1567 "popTA" models offering a more robust classification of the TAs families. These results give valuable clues in understanding the toxin/antitoxin modular structures and the TAS phylum specificities. Preliminary in vivo work confirmed six putative new hits in Mycobacterium tuberculosis as promising candidates. The TASmania database is available on the following server https://shiny.bioinformatics.unibe.ch/apps/tasmania/.
细菌毒素-抗毒素系统(TAS)参与关键的生物学功能,包括质粒维持、防御噬菌体、持久和毒力。它们几乎存在于所有的门中,并根据毒素失活的方式分为 6 种不同的类型,其中 II 型 TAS 是迄今为止研究最充分的。我们在此开发了一种名为 TASmania 的新的计算发现管道,该管道从 EnsemblBacteria 数据库中 >41K 的组装中挖掘已知和未表征的 I 型至 IV 型 TAS 基因座的蛋白成分。我们的管道基于一组经过精心整理的 HMM 对蛋白质进行注释,从而产生了 >2.106 个候选基因座,包括孤儿毒素和抗毒素,并将候选基因座组织成伪操纵子结构,以便根据关联定罪策略识别新的 TAS 候选基因座。此外,我们在伪操纵子(pop)基因结构上使用无监督方法对双组分 TAS 进行分类,得到 1567 个“popTA”模型,为 TAs 家族提供了更稳健的分类。这些结果为理解毒素/抗毒素模块结构和 TAS 门特异性提供了有价值的线索。初步的体内工作证实了结核分枝杆菌中的六个假定新靶点是有希望的候选者。TASmania 数据库可在以下服务器上获得:https://shiny.bioinformatics.unibe.ch/apps/tasmania/。