Department of Microbiology and Immunology, University of Michigan, Ann Arbor, Michigan, USA.
Centers for Disease Control and Preventiongrid.416738.f, Atlanta, Georgia, USA.
mSphere. 2022 Dec 21;7(6):e0040022. doi: 10.1128/msphere.00400-22. Epub 2022 Nov 15.
The reliability of sequence-based inference of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) transmission is not clear. Sequence data from infections among household members can define the expected genomic diversity of a virus along a defined transmission chain. SARS-CoV-2 cases were identified prospectively among 2,369 participants in 706 households. Specimens with a reverse transcription-PCR cycle threshold of ≤30 underwent whole-genome sequencing. Intrahost single-nucleotide variants (iSNV) were identified at a ≥5% frequency. Phylogenetic trees were used to evaluate the relationship of household and community sequences. There were 178 SARS-CoV-2 cases in 706 households. Among 147 specimens sequenced, 106 yielded a whole-genome consensus with coverage suitable for identifying iSNV. Twenty-six households had sequences from multiple cases within 14 days. Consensus sequences were indistinguishable among cases in 15 households, while 11 had ≥1 consensus sequence that differed by 1 to 2 mutations. Sequences from households and the community were often interspersed on phylogenetic trees. Identification of iSNV improved inference in 2 of 15 households with indistinguishable consensus sequences and in 6 of 11 with distinct ones. In multiple-infection households, whole-genome consensus sequences differed by 0 to 1 mutations. Identification of shared iSNV occasionally resolved linkage, but the low genomic diversity of SARS-CoV-2 limits the utility of "sequence-only" transmission inference. We performed whole-genome sequencing of SARS-CoV-2 from prospectively identified cases in three longitudinal household cohorts. In a majority of multi-infection households, SARS-CoV-2 consensus sequences were indistinguishable, and they differed by 1 to 2 mutations in the rest. Importantly, even with modest genomic surveillance of the community (3 to 5% of cases sequenced), it was not uncommon to find community sequences interspersed with household sequences on phylogenetic trees. Identification of shared minority variants only occasionally resolved these ambiguities in transmission linkage. Overall, the low genomic diversity of SARS-CoV-2 limits the utility of "sequence-only" transmission inference. Our work highlights the need to carefully consider both epidemiologic linkage and sequence data to define transmission chains in households, hospitals, and other transmission settings.
基于序列的严重急性呼吸综合征冠状病毒 2(SARS-CoV-2)传播推断的可靠性尚不清楚。家庭内感染的序列数据可以定义沿定义的传播链的病毒的预期基因组多样性。前瞻性地在 706 户家庭的 2369 名参与者中鉴定了 SARS-CoV-2 病例。逆转录-PCR 循环阈值≤30 的标本进行全基因组测序。以≥5%的频率鉴定了宿主内单核苷酸变异(iSNV)。使用系统发育树评估家庭和社区序列之间的关系。在 706 户家庭中有 178 例 SARS-CoV-2 病例。在 147 个测序标本中,有 106 个产生了具有鉴定 iSNV 适合性的全基因组共识。26 个家庭在 14 天内有多个病例的序列。在 15 个家庭中,病例之间的共识序列无法区分,而 11 个家庭有≥1 个共识序列,差异为 1 到 2 个突变。家庭和社区的序列在系统发育树上经常交织在一起。在 15 个共识序列无法区分的家庭中,2 个家庭的 iSNV 鉴定提高了推断,在 11 个共识序列有差异的家庭中,6 个家庭的推断提高了。在多感染家庭中,全基因组共识序列差异为 0 到 1 个突变。共享 iSNV 的鉴定偶尔可以解决连锁问题,但 SARS-CoV-2 的低基因组多样性限制了“仅序列”传播推断的实用性。我们对三个纵向家庭队列中前瞻性鉴定的病例进行了 SARS-CoV-2 的全基因组测序。在大多数多感染家庭中,SARS-CoV-2 的共识序列无法区分,其余家庭的差异为 1 到 2 个突变。重要的是,即使对社区进行适度的基因组监测(3%至 5%的病例测序),在系统发育树上也经常发现社区序列与家庭序列交织在一起。鉴定共享的少数变体偶尔会解决这些传播连锁的模糊性。总体而言,SARS-CoV-2 的低基因组多样性限制了“仅序列”传播推断的实用性。我们的工作强调需要仔细考虑流行病学联系和序列数据,以在家庭、医院和其他传播环境中定义传播链。