Volpiano Camila Gazolla, Sant'Anna Fernando Hayashi, Ambrosini Adriana, de São José Jackson Freitas Brilhante, Beneduzi Anelise, Whitman William B, de Souza Emanuel Maltempi, Lisboa Bruno Brito, Vargas Luciano Kayser, Passaglia Luciane Maria Pereira
Departamento de Genética, Instituto de Biociências, Universidade Federal do Rio Grande do Sul, Porto Alegre, Brazil.
Departamento de Diagnóstico e Pesquisa Agropecuária, Secretaria Estadual da Agricultura, Pecuária e Desenvolvimento Rural, Porto Alegre, Brazil.
Front Microbiol. 2021 Mar 25;12:614957. doi: 10.3389/fmicb.2021.614957. eCollection 2021.
Taxonomic decisions within the order have relied heavily on the interpretations of highly conserved 16S rRNA sequences and DNA-DNA hybridizations (DDH). Currently, bacterial species are defined as including strains that present 95-96% of average nucleotide identity (ANI) and 70% of digital DDH (dDDH). Thus, ANI values from 520 genome sequences of type strains from species of order were computed. From the resulting 270,400 comparisons, a ≥95% cut-off was used to extract high identity genome clusters through enumerating maximal cliques. Coupling this graph-based approach with dDDH from clusters of interest, it was found that: (i) there are synonymy between and , and , "," and , , and , , and , , and , and and ; (ii) is not a synonym of ; (iii) "" subsp. and "" subsp. represent members of different species; (iv) the genome accessions GCF_003024615.1 (" LMG 6,125"), GCF_003024595.1 (" LMG 11,892"), GCF_003096615.1 (" DSM 760"), and GCF_000373025.1 (" R-602 sp") are not from the genuine type strains used for the respective species descriptions; and v) "" Py2 and "" KCTC 2,477 represent cases of misuse of the term "type strain". comb. nov. and the reclassification of as is also proposed. To facilitate the downstream analysis of large ANI matrices, we introduce here ProKlust ("Prokaryotic Clusters"), an R package that uses a graph-based approach to obtain, filter, and visualize clusters on identity/similarity matrices, with settable cut-off points and the possibility of multiple matrices entries.
该目内的分类学判定在很大程度上依赖于对高度保守的16S rRNA序列的解读以及DNA-DNA杂交(DDH)。目前,细菌物种被定义为包括呈现95 - 96%平均核苷酸同一性(ANI)和70%数字DDH(dDDH)的菌株。因此,计算了该目物种模式菌株的520个基因组序列的ANI值。从得到的270,400次比较中,通过枚举最大团,使用≥95%的截止值来提取高同一性基因组簇。将这种基于图的方法与来自感兴趣簇的dDDH相结合,发现:(i)[具体物种1]与[具体物种2]、[具体物种3]与[具体物种4]、[具体物种5]与[具体物种6]、[具体物种7]与[具体物种8]、[具体物种9]与[具体物种10]、[具体物种11]与[具体物种12]、[具体物种13]与[具体物种14]、[具体物种15]与[具体物种16]、[具体物种17]与[具体物种18]之间存在同义关系;(ii)[具体物种19]不是[具体物种20]的同义词;(iii)“[具体物种21]”亚种[具体亚种1]和“[具体物种22]”亚种[具体亚种2]代表不同物种的成员;(iv)基因组登录号GCF_003024615.1(“[具体物种23] LMG 6,125”)、GCF_003024595.1(“[具体物种24] LMG 11,892”)、GCF_003096615.1(“[具体物种25] DSM 760”)和GCF_000373025.1(“[具体物种26] R - 602 sp”)并非来自用于各自物种描述的真正模式菌株;以及(v)“[具体物种27]”Py2和“[具体物种28]”KCTC 2,477代表“模式菌株”一词的误用情况[具体物种29]。还提出了新组合[具体物种29]以及将[具体物种30]重新分类为[具体物种31]。为便于对大型ANI矩阵进行下游分析,我们在此引入ProKlust(“原核生物簇”),这是一个R包它使用基于图的方法在同一性/相似性矩阵上获取、过滤和可视化簇,具有可设置的截止点以及多个矩阵输入的可能性。