Department of Biological Science, Boise State University, 1910 University Drive, Boise, Idaho 83725, USA.
BMC Bioinformatics. 2010 Oct 12;11:508. doi: 10.1186/1471-2105-11-508.
Microevolution is the study of short-term changes of alleles within a population and their effects on the phenotype of organisms. The result of the below-species-level evolution is heterogeneity, where populations consist of subpopulations with a large number of structural variations. Heterogeneity analysis is thus essential to our understanding of how selective and neutral forces shape bacterial populations over a short period of time. The Solexa Genome Analyzer, a next-generation sequencing platform, allows millions of short sequencing reads to be obtained with great accuracy, allowing for the ability to study the dynamics of the bacterial population at the whole genome level. The tool referred to as GenHtr was developed for genome-wide heterogeneity analysis.
For particular bacterial strains, GenHtr relies on a set of Solexa short reads on given bacteria pathogens and their isogenic reference genome to identify heterogeneity sites, the chromosomal positions with multiple variants of genes in the bacterial population, and variations that occur in large gene families. GenHtr accomplishes this by building and comparatively analyzing genome-wide heterogeneity genotypes for both the newly sequenced genomes (using massive short-read sequencing) and their isogenic reference (using simulated data). As proof of the concept, this approach was applied to SRX007711, the Solexa sequencing data for a newly sequenced Staphylococcus aureus subsp. USA300 cell line, and demonstrated that it could predict such multiple variants. They include multiple variants of genes critical in pathogenesis, e.g. genes encoding a LysR family transcriptional regulator, 23 S ribosomal RNA, and DNA mismatch repair protein MutS. The heterogeneity results in non-synonymous and nonsense mutations, leading to truncated proteins for both LysR and MutS.
GenHtr was developed for genome-wide heterogeneity analysis. Although it is much more time-consuming when compared to Maq, a popular tool for SNP analysis, GenHtr is able to predict potential multiple variants that pre-exist in the bacterial population as well as SNPs that occur in the highly duplicated gene families. It is expected that, with the proper experimental design, this analysis can improve our understanding of the molecular mechanism underlying the dynamics and the evolution of drug-resistant bacterial pathogens.
微观进化是研究种群内等位基因的短期变化及其对生物体表型的影响。低于种水平的进化的结果是异质性,其中种群由具有大量结构变异的亚群组成。因此,异质性分析对于我们理解在短时间内选择和中性力量如何塑造细菌种群至关重要。Solexa 基因组分析仪是一种下一代测序平台,它可以非常准确地获得数百万个短测序reads,从而能够在全基因组水平上研究细菌种群的动态。称为 GenHtr 的工具是为全基因组异质性分析而开发的。
对于特定的细菌菌株,GenHtr 依赖于一组给定细菌病原体的 Solexa 短读序列及其同基因参考基因组,以识别异质性位点,即细菌种群中多个基因变体的染色体位置,以及发生在大基因家族中的变异。GenHtr 通过构建和比较新测序基因组(使用大量短读测序)及其同基因参考基因组(使用模拟数据)的全基因组异质性基因型来实现这一点。作为概念验证,该方法应用于 SRX007711,即新测序的金黄色葡萄球菌亚种 USA300 细胞系的 Solexa 测序数据,并证明它可以预测这种多态性。它们包括在发病机制中至关重要的基因的多个变体,例如编码 LysR 家族转录调节剂、23S 核糖体 RNA 和 DNA 错配修复蛋白 MutS 的基因。异质性导致非同义突变和无义突变,导致 LysR 和 MutS 的截短蛋白。
GenHtr 是为全基因组异质性分析而开发的。虽然与用于 SNP 分析的流行工具 Maq 相比,它的时间消耗要多得多,但 GenHtr 能够预测细菌种群中预先存在的潜在多态性以及高度重复基因家族中发生的 SNPs。预计,通过适当的实验设计,这种分析可以提高我们对耐药性细菌病原体动态和进化背后的分子机制的理解。