Grupo de Inmunología Celular e Inmunogenética (GICIG), Facultad de Medicina, Universidad de Antioquia, Medellín, Colombia.
Departamento de Microbiología y Parasitología, Facultad de Medicina, Universidad de Antioquia, Medellín, Colombia.
PLoS One. 2023 May 19;18(5):e0285417. doi: 10.1371/journal.pone.0285417. eCollection 2023.
Mycobacterium tuberculosis (Mtb) is still one of the primary pathogens of humans causing tuberculosis (TB) disease. Mtb embraces nine well-defined phylogenetic lineages with biological and geographical disparities. The lineage L4 is the most globally widespread of all lineages and was introduced to America with European colonization. Taking advantage of many genome projects available in public repositories, we undertake an evolutionary and comparative genomic analysis of 522 L4 Latin American Mtb genomes. Initially, we performed careful quality control of public read datasets and applied several thresholds to filter out low-quality data. Using a genome de novo assembly strategy and phylogenomic methods, we spotted novel south American clades that have not been revealed yet. Additionally, we describe genomic deletion profiles of these strains from an evolutionary perspective and report Mycobacterium tuberculosis L4 sublineages signature-like gene deletions, some of the novel. One is a specific deletion of 6.5 kbp that is only present in sublineage 4.1.2.1. This deletion affects a complex group of 10 genes with putative products annotated, among others, as a lipoprotein, transmembrane protein, and toxin/antitoxin system proteins. The second novel deletion spans for 4.9 kbp and specific of a particular clade of the 4.8 sublineage and affects 7 genes. The last novel deletion affects 4 genes, extends for 4.8 kbp., and is specific to some strains within the 4.1.2.1 sublineage that are present in Colombia, Peru and Brasil.
结核分枝杆菌(Mtb)仍然是导致结核病(TB)的主要人类病原体之一。Mtb 包含九个具有生物学和地理差异的明确的系统发育谱系。谱系 L4 是所有谱系中分布最广泛的,是随着欧洲殖民化引入美洲的。利用公共存储库中提供的许多基因组项目,我们对 522 个拉丁美洲 L4 分枝杆菌基因组进行了进化和比较基因组分析。最初,我们对公共读取数据集进行了仔细的质量控制,并应用了几个阈值来过滤低质量数据。我们使用基因组从头组装策略和系统发育方法,发现了尚未揭示的新的南美分支。此外,我们从进化的角度描述了这些菌株的基因组缺失谱,并报告了分枝杆菌 L4 亚谱系特征样基因缺失,其中一些是新的。一个是仅存在于亚谱系 4.1.2.1 中的 6.5 kbp 的特定缺失。该缺失影响了一组复杂的 10 个基因,其推定产物除其他外被注释为脂蛋白、跨膜蛋白和毒素/抗毒素系统蛋白。第二个新的缺失跨度为 4.9 kbp,是 4.8 亚谱系特定分支的特定缺失,影响 7 个基因。最后一个新的缺失影响 4 个基因,长度为 4.8 kbp,仅存在于哥伦比亚、秘鲁和巴西的一些 4.1.2.1 亚谱系的某些菌株中。