Sinclair Robert M, Ravantti Janne J, Bamford Dennis H
Mathematical Biology Unit, Okinawa Institute of Science and Technology Graduate University, Onna, Okinawa, Japan.
Department of Biosciences and Institute of Biotechnology, University of Helsinki, Helsinki, Finland.
J Virol. 2017 Mar 29;91(8). doi: 10.1128/JVI.02275-16. Print 2017 Apr 15.
Viral capsids ensure viral genome integrity by protecting the enclosed nucleic acids. Interactions between the genome and capsid and between individual capsid proteins (i.e., capsid architecture) are intimate and are expected to be characterized by strong evolutionary conservation. For this reason, a capsid structure-based viral classification has been proposed as a way to bring order to the viral universe. The seeming lack of sufficient sequence similarity to reproduce this classification has made it difficult to reject structural convergence as the basis for the classification. We reinvestigate whether the structure-based classification for viral coat proteins making icosahedral virus capsids is in fact supported by previously undetected sequence similarity. Since codon choices can influence nascent protein folding cotranslationally, we searched for both amino acid and nucleotide sequence similarity. To demonstrate the sensitivity of the approach, we identify a candidate gene for the pandoravirus capsid protein. We show that the structure-based classification is strongly supported by amino acid and also nucleotide sequence similarities, suggesting that the similarities are due to common descent. The correspondence between structure-based and sequence-based analyses of the same proteins shown here allow them to be used in future analyses of the relationship between linear sequence information and macromolecular function, as well as between linear sequence and protein folds. Viral capsids protect nucleic acid genomes, which in turn encode capsid proteins. This tight coupling of protein shell and nucleic acids, together with strong functional constraints on capsid protein folding and architecture, leads to the hypothesis that capsid protein-coding nucleotide sequences may retain signatures of ancient viral evolution. We have been able to show that this is indeed the case, using the major capsid proteins of viruses forming icosahedral capsids. Importantly, we detected similarity at the nucleotide level between capsid protein-coding regions from viruses infecting cells belonging to all three domains of life, reproducing a previously established structure-based classification of icosahedral viral capsids.
病毒衣壳通过保护内部的核酸来确保病毒基因组的完整性。基因组与衣壳之间以及各个衣壳蛋白之间的相互作用(即衣壳结构)十分紧密,预计具有很强的进化保守性。因此,有人提出基于衣壳结构的病毒分类方法,以便为病毒世界带来秩序。由于缺乏足够的序列相似性来重现这种分类,因此很难排除结构趋同是该分类的基础。我们重新研究了基于结构的二十面体病毒衣壳蛋白分类是否实际上得到了先前未检测到的序列相似性的支持。由于密码子选择可以共翻译地影响新生蛋白质的折叠,我们同时搜索了氨基酸和核苷酸序列的相似性。为了证明该方法的敏感性,我们鉴定了一种泛病毒衣壳蛋白的候选基因。我们表明,基于结构的分类得到了氨基酸以及核苷酸序列相似性的有力支持,这表明这些相似性是由于共同的祖先。此处显示的同一蛋白质基于结构和基于序列的分析之间的对应关系,使得它们可用于未来对线性序列信息与大分子功能之间以及线性序列与蛋白质折叠之间关系的分析。病毒衣壳保护核酸基因组,而核酸基因组又编码衣壳蛋白。蛋白质外壳与核酸的这种紧密耦合,以及对衣壳蛋白折叠和结构的强大功能限制,导致了这样一种假设,即衣壳蛋白编码核苷酸序列可能保留了古代病毒进化的特征。我们已经能够证明,对于形成二十面体衣壳的病毒的主要衣壳蛋白来说,情况确实如此。重要的是,我们在感染属于生命所有三个域的细胞的病毒的衣壳蛋白编码区之间检测到核苷酸水平的相似性,重现了先前建立的基于结构的二十面体病毒衣壳分类。