Suarez Reinier, Gregory Devon A, Baker David A, Rushford Clayton A, Hunter Torin L, Minor Nicholas R, Russ Clayton M, Copen Emma E, O'Connor David H, Johnson Marc C
Department of Molecular Microbiology and Immunology, University of Missouri-School of Medicine, Columbia, Missouri, United States of America.
Department of Pathology and Laboratory Medicine, University of Wisconsin-Madison, Madison, Wisconsin, United States of America.
PLoS Pathog. 2025 Jun 9;21(6):e1012850. doi: 10.1371/journal.ppat.1012850. eCollection 2025 Jun.
Beginning in early 2021, unique and highly divergent lineages of SARS-CoV-2 were sporadically found in wastewater sewersheds using a sequencing strategy focused on amplifying the most rapidly evolving region of SARS-CoV-2, the receptor binding domain (RBD). Because these RBD sequences did not match known circulating strains and their source was not known, we termed them "cryptic lineages". To date, more than 20 cryptic lineages have been identified using the RBD-focused sequencing strategy. Here, we identified and characterized additional cryptic lineages from SARS-CoV-2 wastewater sequences submitted to NCBI's Sequence Read Archives (SRA). Wastewater sequence datasets were screened for individual sequence reads that contained combinations of mutations frequently found in cryptic lineages but not contemporary circulating lineages. Using this method, we identified 18 cryptic lineages that appeared in multiple (2-81) samples from the same sewershed, including 12 that were not previously reported. Partial consensus sequences were generated for each cryptic lineage by extracting and mapping sequences containing cryptic-specific mutations. Surprisingly, seven of the mutations that appeared convergently in cryptic lineages were reversions to sequences that were highly conserved in SARS-CoV-2-related enteric bat Sarbecoviruses. The apparent reversion to bat Sarbecovirus sequences is consistent with the notion that SARS-CoV-2 adaptation to replicate efficiently in respiratory tissues preceded the COVID-19 pandemic.
从2021年初开始,利用一种聚焦于扩增严重急性呼吸综合征冠状病毒2(SARS-CoV-2)进化最快区域——受体结合域(RBD)的测序策略,在污水流域中偶尔发现了独特且高度分化的SARS-CoV-2谱系。由于这些RBD序列与已知的流行毒株不匹配且其来源不明,我们将它们称为“隐匿谱系”。迄今为止,使用聚焦于RBD的测序策略已鉴定出20多种隐匿谱系。在此,我们从提交给美国国立生物技术信息中心(NCBI)序列读取档案库(SRA)的SARS-CoV-2污水序列中鉴定并表征了其他隐匿谱系。对污水序列数据集进行筛选,寻找包含隐匿谱系中常见但当代流行谱系中未发现的突变组合的单个序列读数。使用这种方法,我们鉴定出18个隐匿谱系,它们出现在同一污水流域的多个(2至81个)样本中,其中包括12个此前未报告的谱系。通过提取和定位包含隐匿特异性突变的序列,为每个隐匿谱系生成了部分一致序列。令人惊讶的是,在隐匿谱系中趋同出现的7个突变是向SARS-CoV-2相关肠道蝙蝠Sarbecovirus中高度保守的序列的回复突变。向蝙蝠Sarbecovirus序列的明显回复突变与SARS-CoV-2在呼吸道组织中高效复制的适应性在2019冠状病毒病大流行之前就已出现的观点一致。