Department of Mathematical Sciences, Chalmers University of Technology and University of Gothenburg, Göteborg, SE-412 96, Sweden.
BMC Genomics. 2012 Dec 11;13:695. doi: 10.1186/1471-2164-13-695.
Broad-spectrum fluoroquinolone antibiotics are central in modern health care and are used to treat and prevent a wide range of bacterial infections. The recently discovered qnr genes provide a mechanism of resistance with the potential to rapidly spread between bacteria using horizontal gene transfer. As for many antibiotic resistance genes present in pathogens today, qnr genes are hypothesized to originate from environmental bacteria. The vast amount of data generated by shotgun metagenomics can therefore be used to explore the diversity of qnr genes in more detail.
In this paper we describe a new method to identify qnr genes in nucleotide sequence data. We show, using cross-validation, that the method has a high statistical power of correctly classifying sequences from novel classes of qnr genes, even for fragments as short as 100 nucleotides. Based on sequences from public repositories, the method was able to identify all previously reported plasmid-mediated qnr genes. In addition, several fragments from novel putative qnr genes were identified in metagenomes. The method was also able to annotate 39 chromosomal variants of which 11 have previously not been reported in literature.
The method described in this paper significantly improves the sensitivity and specificity of identification and annotation of qnr genes in nucleotide sequence data. The predicted novel putative qnr genes in the metagenomic data support the hypothesis of a large and uncharacterized diversity within this family of resistance genes in environmental bacterial communities. An implementation of the method is freely available at http://bioinformatics.math.chalmers.se/qnr/.
广谱氟喹诺酮类抗生素在现代医疗保健中占据核心地位,被广泛用于治疗和预防各种细菌感染。最近发现的 qnr 基因提供了一种耐药机制,具有通过水平基因转移在细菌间快速传播的潜力。与当今病原体中存在的许多抗生素耐药基因一样,qnr 基因被假设源自环境细菌。因此,宏基因组学产生的大量数据可用于更详细地探索 qnr 基因的多样性。
在本文中,我们描述了一种在核苷酸序列数据中识别 qnr 基因的新方法。我们通过交叉验证表明,该方法具有很高的统计学能力,能够正确分类来自新型 qnr 基因类别的序列,即使对于短至 100 个核苷酸的片段也是如此。基于来自公共存储库的序列,该方法能够识别所有先前报道的质粒介导的 qnr 基因。此外,在宏基因组中还鉴定出了几个来自新型推定 qnr 基因的片段。该方法还能够注释 39 个染色体变体,其中 11 个在文献中尚未报道过。
本文中描述的方法显著提高了在核苷酸序列数据中识别和注释 qnr 基因的灵敏度和特异性。在宏基因组数据中预测的新型推定 qnr 基因支持了在环境细菌群落中该耐药基因家族具有大量未被表征的多样性的假设。该方法的实现可在 http://bioinformatics.math.chalmers.se/qnr/ 免费获取。