Physical Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore, California, United States of America.
Department of Pathology, Microbiology and Immunology, School of Veterinary Medicine, University of California Davis, Davis, California, United States of America.
PLoS One. 2019 Dec 6;14(12):e0225699. doi: 10.1371/journal.pone.0225699. eCollection 2019.
The question of how Zika virus (ZIKV) changed from a seemingly mild virus to a human pathogen capable of microcephaly and sexual transmission remains unanswered. The unexpected emergence of ZIKV's pathogenicity and capacity for sexual transmission may be due to genetic changes, and future changes in phenotype may continue to occur as the virus expands its geographic range. Alternatively, the sheer size of the 2015-16 epidemic may have brought attention to a pre-existing virulent ZIKV phenotype in a highly susceptible population. Thus, it is important to identify patterns of genetic change that may yield a better understanding of ZIKV emergence and evolution. However, because ZIKV has an RNA genome and a polymerase incapable of proofreading, it undergoes rapid mutation which makes it difficult to identify combinations of mutations associated with viral emergence. As next generation sequencing technology has allowed whole genome consensus and variant sequence data to be generated for numerous virus samples, the task of analyzing these genomes for patterns of mutation has become more complex. However, understanding which combinations of mutations spread widely and become established in new geographic regions versus those that disappear relatively quickly is essential for defining the trajectory of an ongoing epidemic. In this study, multiscale analysis of the wealth of genomic data generated over the course of the epidemic combined with in vivo laboratory data allowed trends in mutations and outbreak trajectory to be assessed. Mutations were detected throughout the genome via deep sequencing, and many variants appeared in multiple samples and in some cases become consensus. Similarly, amino acids that were previously consensus in pre-outbreak samples were detected as low frequency variants in epidemic strains. Protein structural models indicate that most of the mutations associated with the epidemic transmission occur on the exposed surface of viral proteins. At the macroscale level, consensus data was organized into large and interactive databases to allow the spread of individual mutations and combinations of mutations to be visualized and assessed for temporal and geographical patterns. Thus, the use of multiscale modeling for identifying mutations or combinations of mutations that impact epidemic transmission and phenotypic impact can aid the formation of hypotheses which can then be tested using reverse genetics.
寨卡病毒(ZIKV)如何从一种看似温和的病毒转变为能够导致小头畸形和性传播的人类病原体,这个问题仍未得到解答。寨卡病毒致病性和性传播能力的意外出现可能是由于遗传变化,随着病毒扩大其地理范围,未来的表型变化可能会继续发生。或者,2015-16 年疫情的规模之大,可能使一种原本就存在的毒力较强的寨卡病毒表型在高度易感人群中受到关注。因此,确定可能有助于更好地了解寨卡病毒出现和进化的遗传变化模式非常重要。然而,由于寨卡病毒具有 RNA 基因组和缺乏校对功能的聚合酶,它会迅速发生突变,这使得很难确定与病毒出现相关的突变组合。随着下一代测序技术的发展,已经可以为大量病毒样本生成全基因组共识和变异序列数据,因此分析这些基因组中的突变模式的任务变得更加复杂。然而,了解哪些突变组合在新的地理区域中广泛传播并得以确立,而哪些突变组合则相对较快地消失,对于定义正在进行的疫情轨迹至关重要。在这项研究中,通过对疫情期间产生的大量基因组数据进行多尺度分析,并结合体内实验室数据,评估了突变趋势和疫情轨迹。通过深度测序检测到整个基因组中的突变,许多变异在多个样本中出现,在某些情况下成为共识。同样,在疫情前样本中之前是共识的氨基酸,在疫情株中也被检测到为低频变异。蛋白质结构模型表明,与疫情传播相关的大多数突变都发生在病毒蛋白的暴露表面上。在宏观尺度上,将共识数据组织成大型交互式数据库,以可视化和评估单个突变和突变组合的传播,并评估其时间和地理模式。因此,使用多尺度建模来识别影响疫情传播和表型影响的突变或突变组合,可以帮助形成假说,然后可以使用反向遗传学来验证这些假说。