重叠基因产生具有异常序列特性的蛋白质，并为从头蛋白质创造提供了见解。

Overlapping genes produce proteins with unusual sequence properties and offer insight into de novo protein creation.

作者信息

Rancurel Corinne, Khosravi Mahvash, Dunker A Keith, Romero Pedro R, Karlin David

机构信息

Architecture et Fonction des Macromolécules Biologiques, Case 932, Campus de Luminy, 13288 Marseille Cedex 9, France.

出版信息

J Virol. 2009 Oct;83(20):10719-36. doi: 10.1128/JVI.00595-09. Epub 2009 Jul 29.

DOI:10.1128/JVI.00595-09

PMID:19640978

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2753099/

Abstract

It is widely assumed that new proteins are created by duplication, fusion, or fission of existing coding sequences. Another mechanism of protein birth is provided by overlapping genes. They are created de novo by mutations within a coding sequence that lead to the expression of a novel protein in another reading frame, a process called "overprinting." To investigate this mechanism, we have analyzed the sequences of the protein products of manually curated overlapping genes from 43 genera of unspliced RNA viruses infecting eukaryotes. Overlapping proteins have a sequence composition globally biased toward disorder-promoting amino acids and are predicted to contain significantly more structural disorder than nonoverlapping proteins. By analyzing the phylogenetic distribution of overlapping proteins, we were able to confirm that 17 of these had been created de novo and to study them individually. Most proteins created de novo are orphans (i.e., restricted to one species or genus). Almost all are accessory proteins that play a role in viral pathogenicity or spread, rather than proteins central to viral replication or structure. Most proteins created de novo are predicted to be fully disordered and have a highly unusual sequence composition. This suggests that some viral overlapping reading frames encoding hypothetical proteins with highly biased composition, often discarded as noncoding, might in fact encode proteins. Some proteins created de novo are predicted to be ordered, however, and whenever a three-dimensional structure of such a protein has been solved, it corresponds to a fold previously unobserved, suggesting that the study of these proteins could enhance our knowledge of protein space.

摘要

人们普遍认为，新蛋白质是通过现有编码序列的复制、融合或裂变产生的。蛋白质产生的另一种机制是由重叠基因提供的。它们是由编码序列内的突变从头产生的，这些突变导致在另一个阅读框中表达一种新的蛋白质，这一过程称为“套印”。为了研究这一机制，我们分析了来自43个感染真核生物的未剪接RNA病毒属的人工筛选的重叠基因的蛋白质产物序列。重叠蛋白的序列组成在整体上偏向于促进无序的氨基酸，并且预计比非重叠蛋白含有更多的结构无序。通过分析重叠蛋白的系统发育分布，我们能够确认其中17个是从头产生的，并对它们进行单独研究。大多数从头产生的蛋白质是孤儿蛋白（即仅限于一个物种或属）。几乎所有这些都是在病毒致病性或传播中起作用的辅助蛋白，而不是病毒复制或结构核心的蛋白。大多数从头产生的蛋白质预计是完全无序的，并且具有非常不寻常的序列组成。这表明一些编码具有高度偏向组成的假设蛋白的病毒重叠阅读框，通常被当作非编码序列而被丢弃，实际上可能编码蛋白质。然而，一些从头产生的蛋白质预计是有序的，每当解析出这种蛋白质的三维结构时，它都对应于一种以前未观察到的折叠，这表明对这些蛋白质的研究可以增进我们对蛋白质空间的了解。

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

重叠基因产生具有异常序列特性的蛋白质，并为从头蛋白质创造提供了见解。

Overlapping genes produce proteins with unusual sequence properties and offer insight into de novo protein creation.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

重叠基因产生具有异常序列特性的蛋白质，并为从头蛋白质创造提供了见解。

Overlapping genes produce proteins with unusual sequence properties and offer insight into de novo protein creation.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献