Graduate School of Natural Sciences, Nagoya City University, Nagoya-shi, Aichi-ken 467-8501, Japan.
Infect Genet Evol. 2013 Jun;16:93-8. doi: 10.1016/j.meegid.2013.01.017. Epub 2013 Feb 9.
In the evolutionary studies of proteins, the average effect of natural selection operating on amino acid mutations may be examined by comparing the numbers of synonymous (dS) and nonsynonymous (dN) substitutions that have accumulated during the same time period. In this method, destabilizing mutations occurring across protein molecules may interfere with detection of natural selection, particularly positive selection, operating on other mutations. Here an attempt to detect positive selection eliminating effects of structural constraints is demonstrated using hemagglutinin (HA) of H3N2 human influenza A virus as an example. Compatible and incompatible amino acids were inferred at each site from the computational analysis of three-dimensional structure using the thermodynamic stability as an indicator, and natural selection was examined by comparing dS and dN among compatible amino acids. In the analysis of 2701 nucleotide sequences for the entire coding region of HA, the new method identified twice as many positively selected amino acid sites as the ordinary method (16 and 4 sites in the former method without and with correction for multiple testing, respectively, and 8 and 2 sites in the latter method). Positively selected sites were involved in epitopes, receptor-binding pocket, epistasis, and stabilization, which appeared to be biologically reasonable. Nevertheless, there still appeared to be several problems, which may largely render this method conservative. It may be effective to analyze many densely sampled sequences in this method.
在蛋白质的进化研究中,可以通过比较在同一时间段内积累的同义(dS)和非同义(dN)替换数来检查自然选择对氨基酸突变的平均影响。在这种方法中,跨蛋白质分子发生的不稳定突变可能会干扰对其他突变起作用的自然选择的检测,尤其是正选择。本文以 H3N2 人甲型流感病毒血凝素(HA)为例,试图通过检测消除结构约束的正选择来消除这种影响。从使用热力学稳定性作为指标的三维结构计算分析中推断出每个位置的相容和不相容氨基酸,并通过比较相容氨基酸中的 dS 和 dN 来检查自然选择。在对 HA 整个编码区的 2701 个核苷酸序列的分析中,新方法确定的正选择氨基酸位点数量是常规方法的两倍(前者方法无校正和有校正的分别为 16 个和 4 个,后者方法为 8 个和 2 个)。正选择的位点涉及表位、受体结合口袋、上位效应和稳定性,这似乎是合理的。然而,似乎仍然存在一些问题,这可能使该方法过于保守。在该方法中分析许多密集采样的序列可能会很有效。