Horsfield Samuel T, Fok Basil C T, Fu Yuhan, Turner Paul, Lees John A, Croucher Nicholas J
MRC Centre for Global Infectious Disease Analysis, Department of Infectious Disease Epidemiology, Imperial College London, London W12 0BZ, United Kingdom;
European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SA, United Kingdom.
Genome Res. 2025 Apr 14;35(4):1025-1040. doi: 10.1101/gr.279435.124.
Serotype surveillance of (the pneumococcus) is critical for understanding the effectiveness of current vaccination strategies. However, existing methods for serotyping are limited in their ability to identify co-carriage of multiple pneumococci and detect novel serotypes. To develop a scalable and portable serotyping method that overcomes these challenges, we employed nanopore adaptive sampling (NAS), an on-sequencer enrichment method that selects for target DNA in real-time, for direct detection of in complex samples. Whereas NAS targeting the whole genome was ineffective in the presence of nonpathogenic streptococci, the method was both specific and sensitive when targeting the capsular biosynthetic locus (CBL), the operon that determines serotype. NAS significantly improved coverage and yield of the CBL relative to sequencing without NAS and accurately quantified the relative prevalence of serotypes in samples representing co-carriage. To maximize the sensitivity of NAS to detect novel serotypes, we developed and benchmarked a new pangenome-graph algorithm, named GNASTy. We show that GNASTy outperforms the current NAS implementation, which is based on linear genome alignment, when a sample contains a serotype absent from the database of targeted sequences. The methods developed in this work provide an improved approach for novel serotype discovery and routine surveillance that is fast, accurate, and feasible in low-resource settings. Although NAS facilitates whole-genome enrichment under ideal circumstances, GNASTy enables targeted enrichment to optimize serotype surveillance in complex samples.
肺炎球菌的血清型监测对于了解当前疫苗接种策略的有效性至关重要。然而,现有的血清型鉴定方法在识别多种肺炎球菌共携带情况和检测新型血清型方面能力有限。为了开发一种可扩展且便携的血清型鉴定方法以克服这些挑战,我们采用了纳米孔自适应采样(NAS),这是一种测序仪上的富集方法,可实时选择目标DNA,用于直接检测复杂样本中的肺炎球菌。虽然在存在非致病性链球菌的情况下,针对整个肺炎球菌基因组的NAS无效,但当针对荚膜生物合成基因座(CBL)(决定肺炎球菌血清型的操纵子)时,该方法具有特异性和敏感性。相对于无NAS的测序,NAS显著提高了CBL的覆盖率和产量,并准确量化了代表共携带样本中血清型的相对流行率。为了最大限度地提高NAS检测新型血清型的灵敏度,我们开发并测试了一种名为GNASTy的新全基因组图谱算法。我们表明,当样本中包含目标序列数据库中不存在的血清型时,GNASTy优于基于线性基因组比对的当前NAS实施方案。这项工作中开发的方法为新型血清型发现和常规肺炎球菌监测提供了一种改进的方法,该方法在低资源环境中快速、准确且可行。虽然NAS在理想情况下有助于全基因组富集,但GNASTy能够进行靶向富集,以优化复杂样本中的血清型监测。