Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, MA.
Department of Medicine, University of Massachusetts Medical School, Worcester, MA.
Mol Biol Evol. 2019 Apr 1;36(4):798-810. doi: 10.1093/molbev/msz022.
The evolution of HIV-1 protein sequences should be governed by a combination of factors including nucleotide mutational probabilities, the genetic code, and fitness. The impact of these factors on protein sequence evolution is interdependent, making it challenging to infer the individual contribution of each factor from phylogenetic analyses alone. We investigated the protein sequence evolution of HIV-1 by determining an experimental fitness landscape of all individual amino acid changes in protease. We compared our experimental results to the frequency of protease variants in a publicly available data set of 32,163 sequenced isolates from drug-naïve individuals. The most common amino acids in sequenced isolates supported robust experimental fitness, indicating that the experimental fitness landscape captured key features of selection acting on protease during viral infections of hosts. Amino acid changes requiring multiple mutations from the likely ancestor were slightly less likely to support robust experimental fitness than single mutations, consistent with the genetic code favoring chemically conservative amino acid changes. Amino acids that were common in sequenced isolates were predominantly accessible by single mutations from the likely protease ancestor. Multiple mutations commonly observed in isolates were accessible by mutational walks with highly fit single mutation intermediates. Our results indicate that the prevalence of multiple-base mutations in HIV-1 protease is strongly influenced by mutational sampling.
HIV-1 蛋白序列的进化应该受到多种因素的共同作用,包括核苷酸突变概率、遗传密码和适应性。这些因素对蛋白质序列进化的影响是相互依存的,仅通过系统发育分析推断每个因素的单独贡献具有挑战性。我们通过确定蛋白酶中所有单个氨基酸变化的实验适应性景观来研究 HIV-1 的蛋白质序列进化。我们将我们的实验结果与来自未经药物治疗的个体中 32163 个测序分离株的公开可用数据集的蛋白酶变体的频率进行了比较。在测序分离株中最常见的氨基酸支持强大的实验适应性,这表明实验适应性景观捕捉到了在宿主病毒感染过程中对蛋白酶起作用的选择的关键特征。与遗传密码有利于化学保守性氨基酸变化一致,从可能的祖先需要多次突变的氨基酸变化支持强大的实验适应性的可能性略低于单突变。在测序分离株中常见的氨基酸主要可以通过从可能的蛋白酶祖先单突变来获得。在分离株中经常观察到的多个突变可以通过具有高度适应性的单突变中间产物的突变步来获得。我们的结果表明,HIV-1 蛋白酶中多碱基突变的普遍性强烈受到突变采样的影响。