Cummings Craig A, Bormann Chung Christina A, Fang Rixun, Barker Melissa, Brzoska Pius, Williamson Phillip C, Beaudry Jodi, Matthews Molly, Schupp James, Wagner David M, Birdsell Dawn, Vogler Amy J, Furtado Manohar R, Keim Paul, Budowle Bruce
Northern Arizona University, Flagstaff, Arizona, USA.
Investig Genet. 2010 Sep 1;1(1):5. doi: 10.1186/2041-2223-1-5.
In the event of biocrimes or infectious disease outbreaks, high-resolution genetic characterization for identifying the agent and attributing it to a specific source can be crucial for an effective response. Until recently, in-depth genetic characterization required expensive and time-consuming Sanger sequencing of a few strains, followed by genotyping of a small number of marker loci in a panel of isolates at or by gel-based approaches such as pulsed field gel electrophoresis, which by necessity ignores most of the genome. Next-generation, massively parallel sequencing (MPS) technology (specifically the Applied Biosystems sequencing by oligonucleotide ligation and detection (SOLiD™) system) is a powerful investigative tool for rapid, cost-effective and parallel microbial whole-genome characterization.
To demonstrate the utility of MPS for whole-genome typing of monomorphic pathogens, four Bacillus anthracis and four Yersinia pestis strains were sequenced in parallel. Reads were aligned to complete reference genomes, and genomic variations were identified. Resequencing of the B. anthracis Ames ancestor strain detected no false-positive single-nucleotide polymorphisms (SNPs), and mapping of reads to the Sterne strain correctly identified 98% of the 133 SNPs that are not clustered or associated with repeats. Three geographically distinct B. anthracis strains from the A branch lineage were found to have between 352 and 471 SNPs each, relative to the Ames genome, and one strain harbored a genomic amplification. Sequencing of four Y. pestis strains from the Orientalis lineage identified between 20 and 54 SNPs per strain relative to the CO92 genome, with the single Bolivian isolate having approximately twice as many SNPs as the three more closely related North American strains. Coverage plotting also revealed a common deletion in two strains and an amplification in the Bolivian strain that appear to be due to insertion element-mediated recombination events. Most private SNPs (that is, a, variant found in only one strain in this set) selected for validation by Sanger sequencing were confirmed, although rare false-positive SNPs were associated with variable nucleotide tandem repeats.
The high-throughput, multiplexing capability, and accuracy of this system make it suitable for rapid whole-genome typing of microbial pathogens during a forensic or epidemiological investigation. By interrogating nearly every base of the genome, rare polymorphisms can be reliably discovered, thus facilitating high-resolution strain tracking and strengthening forensic attribution.
在发生生物犯罪或传染病爆发时,进行高分辨率基因特征分析以识别病原体并将其溯源至特定来源,对于有效应对至关重要。直到最近,深入的基因特征分析需要对少数菌株进行昂贵且耗时的桑格测序,随后通过基于凝胶的方法(如脉冲场凝胶电泳)对一组分离株中的少数标记位点进行基因分型,而这必然会忽略基因组的大部分信息。新一代大规模平行测序(MPS)技术(特别是应用生物系统公司的基于寡核苷酸连接和检测的测序(SOLiD™)系统)是一种强大的调查工具,可用于快速、经济高效且并行地进行微生物全基因组特征分析。
为证明MPS在单态病原体全基因组分型中的实用性,对4株炭疽芽孢杆菌和4株鼠疫耶尔森菌进行了平行测序。将读取序列与完整的参考基因组进行比对,并识别出基因组变异。对炭疽芽孢杆菌埃姆斯祖先菌株的重测序未检测到假阳性单核苷酸多态性(SNP),将读取序列映射到斯特恩菌株上正确识别出了133个未聚类或与重复序列无关的SNP中的98%。发现来自A分支谱系的3株地理上不同的炭疽芽孢杆菌菌株相对于埃姆斯基因组,每株分别有352至471个SNP,且有1株存在基因组扩增。对来自东方谱系的4株鼠疫耶尔森菌进行测序,相对于CO92基因组,每株发现20至54个SNP,其中单一的玻利维亚分离株的SNP数量约为另外3株亲缘关系更近的北美菌株的两倍。覆盖度绘图还揭示了2株菌株中存在一个共同缺失以及玻利维亚菌株中存在一个扩增,这似乎是由插入元件介导的重组事件导致的。通过桑格测序选择进行验证的大多数私有SNP(即仅在这组中的一个菌株中发现的变异)得到了确认,尽管罕见的假阳性SNP与可变核苷酸串联重复有关。
该系统的高通量、多重分析能力和准确性使其适用于法医或流行病学调查期间微生物病原体的快速全基因组分型。通过检测基因组的几乎每个碱基,可以可靠地发现罕见的多态性,从而便于进行高分辨率菌株追踪并加强法医溯源。