• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

重叠基因产生具有异常序列特性的蛋白质,并为从头蛋白质创造提供了见解。

Overlapping genes produce proteins with unusual sequence properties and offer insight into de novo protein creation.

作者信息

Rancurel Corinne, Khosravi Mahvash, Dunker A Keith, Romero Pedro R, Karlin David

机构信息

Architecture et Fonction des Macromolécules Biologiques, Case 932, Campus de Luminy, 13288 Marseille Cedex 9, France.

出版信息

J Virol. 2009 Oct;83(20):10719-36. doi: 10.1128/JVI.00595-09. Epub 2009 Jul 29.

DOI:10.1128/JVI.00595-09
PMID:19640978
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2753099/
Abstract

It is widely assumed that new proteins are created by duplication, fusion, or fission of existing coding sequences. Another mechanism of protein birth is provided by overlapping genes. They are created de novo by mutations within a coding sequence that lead to the expression of a novel protein in another reading frame, a process called "overprinting." To investigate this mechanism, we have analyzed the sequences of the protein products of manually curated overlapping genes from 43 genera of unspliced RNA viruses infecting eukaryotes. Overlapping proteins have a sequence composition globally biased toward disorder-promoting amino acids and are predicted to contain significantly more structural disorder than nonoverlapping proteins. By analyzing the phylogenetic distribution of overlapping proteins, we were able to confirm that 17 of these had been created de novo and to study them individually. Most proteins created de novo are orphans (i.e., restricted to one species or genus). Almost all are accessory proteins that play a role in viral pathogenicity or spread, rather than proteins central to viral replication or structure. Most proteins created de novo are predicted to be fully disordered and have a highly unusual sequence composition. This suggests that some viral overlapping reading frames encoding hypothetical proteins with highly biased composition, often discarded as noncoding, might in fact encode proteins. Some proteins created de novo are predicted to be ordered, however, and whenever a three-dimensional structure of such a protein has been solved, it corresponds to a fold previously unobserved, suggesting that the study of these proteins could enhance our knowledge of protein space.

摘要

人们普遍认为,新蛋白质是通过现有编码序列的复制、融合或裂变产生的。蛋白质产生的另一种机制是由重叠基因提供的。它们是由编码序列内的突变从头产生的,这些突变导致在另一个阅读框中表达一种新的蛋白质,这一过程称为“套印”。为了研究这一机制,我们分析了来自43个感染真核生物的未剪接RNA病毒属的人工筛选的重叠基因的蛋白质产物序列。重叠蛋白的序列组成在整体上偏向于促进无序的氨基酸,并且预计比非重叠蛋白含有更多的结构无序。通过分析重叠蛋白的系统发育分布,我们能够确认其中17个是从头产生的,并对它们进行单独研究。大多数从头产生的蛋白质是孤儿蛋白(即仅限于一个物种或属)。几乎所有这些都是在病毒致病性或传播中起作用的辅助蛋白,而不是病毒复制或结构核心的蛋白。大多数从头产生的蛋白质预计是完全无序的,并且具有非常不寻常的序列组成。这表明一些编码具有高度偏向组成的假设蛋白的病毒重叠阅读框,通常被当作非编码序列而被丢弃,实际上可能编码蛋白质。然而,一些从头产生的蛋白质预计是有序的,每当解析出这种蛋白质的三维结构时,它都对应于一种以前未观察到的折叠,这表明对这些蛋白质的研究可以增进我们对蛋白质空间的了解。

相似文献

1
Overlapping genes produce proteins with unusual sequence properties and offer insight into de novo protein creation.重叠基因产生具有异常序列特性的蛋白质,并为从头蛋白质创造提供了见解。
J Virol. 2009 Oct;83(20):10719-36. doi: 10.1128/JVI.00595-09. Epub 2009 Jul 29.
2
Gene Birth Contributes to Structural Disorder Encoded by Overlapping Genes.基因诞生导致重叠基因编码的结构无序。
Genetics. 2018 Sep;210(1):303-313. doi: 10.1534/genetics.118.301249. Epub 2018 Jul 19.
3
Evolution of viral proteins originated de novo by overprinting.新起源的病毒蛋白通过重迭进化。
Mol Biol Evol. 2012 Dec;29(12):3767-80. doi: 10.1093/molbev/mss179. Epub 2012 Jul 19.
4
Bioinformatic analysis suggests that the Orbivirus VP6 cistron encodes an overlapping gene.生物信息学分析表明,环状病毒VP6顺反子编码一个重叠基因。
Virol J. 2008 Apr 14;5:48. doi: 10.1186/1743-422X-5-48.
5
Sequential partially overlapping gene arrangement in the tricistronic S1 genome segments of avian reovirus and Nelson Bay reovirus: implications for translation initiation.禽呼肠孤病毒和纳尔逊湾呼肠孤病毒三顺反子S1基因组片段中的连续部分重叠基因排列:对翻译起始的影响
J Virol. 2002 Jan;76(2):609-18. doi: 10.1128/jvi.76.2.609-618.2002.
6
Asymmetric evolution in viral overlapping genes is a source of selective protein adaptation.病毒重叠基因的非对称进化是选择蛋白适应的一个来源。
Virology. 2019 Jun;532:39-47. doi: 10.1016/j.virol.2019.03.017. Epub 2019 Apr 3.
7
Overlapping genes and the proteins they encode differ significantly in their sequence composition from non-overlapping genes.重叠基因及其编码的蛋白质在序列组成上与非重叠基因有显著差异。
PLoS One. 2018 Oct 19;13(10):e0202513. doi: 10.1371/journal.pone.0202513. eCollection 2018.
8
Viral proteins originated de novo by overprinting can be identified by codon usage: application to the "gene nursery" of Deltaretroviruses.新生成的病毒蛋白可以通过密码子使用情况来识别:在 Deltaretroviruses 的“基因苗圃”中的应用。
PLoS Comput Biol. 2013;9(8):e1003162. doi: 10.1371/journal.pcbi.1003162. Epub 2013 Aug 15.
9
Origins of genes: "big bang" or continuous creation?基因的起源:“大爆炸”还是持续创造?
Proc Natl Acad Sci U S A. 1992 Oct 15;89(20):9489-93. doi: 10.1073/pnas.89.20.9489.
10
New insights into the evolutionary features of viral overlapping genes by discriminant analysis.通过判别分析深入了解病毒重叠基因的进化特征。
Virology. 2020 Jul;546:51-66. doi: 10.1016/j.virol.2020.03.007. Epub 2020 Apr 2.

引用本文的文献

1
Design of overlapping genes using deep generative models of protein sequences.使用蛋白质序列深度生成模型设计重叠基因。
bioRxiv. 2025 May 7:2025.05.06.652464. doi: 10.1101/2025.05.06.652464.
2
Creation of the HIV-1 antisense gene coincided with the emergence of the pandemic group M and is associated with faster disease progression.HIV-1 反义基因的产生与流行组 M 的出现同时发生,与疾病的快速进展有关。
Microbiol Spectr. 2024 Feb 6;12(2):e0380223. doi: 10.1128/spectrum.03802-23. Epub 2024 Jan 17.
3
Overlaps Between CDS Regions of Protein-Coding Genes in the Human Genome: A Case Study on the NR1D1-THRA Gene Pair.人类基因组中蛋白编码基因 CDS 区的重叠:以 NR1D1-THRA 基因对为例。
J Mol Evol. 2023 Dec;91(6):963-975. doi: 10.1007/s00239-023-10147-8. Epub 2023 Nov 25.
4
Alternative Reading Frames are an Underappreciated Source of Protein Sequence Novelty.可变阅读框是蛋白质序列新颖性的一个未被充分认识的来源。
J Mol Evol. 2023 Oct;91(5):570-580. doi: 10.1007/s00239-023-10122-3. Epub 2023 Jun 16.
5
A small protein coded within the mitochondrial canonical gene nd4 regulates mitochondrial bioenergetics.一个在线粒体经典基因 nd4 内编码的小蛋白调节线粒体生物能。
BMC Biol. 2023 May 18;21(1):111. doi: 10.1186/s12915-023-01609-y.
6
Three Novel Antisense Overlapping Genes in E. coli O157:H7 EDL933.大肠杆菌 O157:H7 EDL933 中的三个新的反义重叠基因。
Microbiol Spectr. 2023 Feb 14;11(1):e0235122. doi: 10.1128/spectrum.02351-22. Epub 2022 Dec 19.
7
Viral Complexity.病毒复杂性。
Biomolecules. 2022 Jul 30;12(8):1061. doi: 10.3390/biom12081061.
8
Functional and structural segregation of overlapping helices in HIV-1.HIV-1 中重叠螺旋的功能和结构隔离。
Elife. 2022 May 5;11:e72482. doi: 10.7554/eLife.72482.
9
Spacer Domain in Hepatitis B Virus Polymerase: Plugging a Hole or Performing a Role?乙型肝炎病毒聚合酶中的间隔结构域:填补空缺还是发挥作用?
J Virol. 2022 May 11;96(9):e0005122. doi: 10.1128/jvi.00051-22. Epub 2022 Apr 12.
10
Gene Overlapping as a Modulator of Evolution.基因重叠作为进化的一种调节因素。
Microorganisms. 2022 Feb 4;10(2):366. doi: 10.3390/microorganisms10020366.

本文引用的文献

1
Origin of primate orphan genes: a comparative genomics approach.灵长类孤儿基因的起源:一种比较基因组学方法。
Mol Biol Evol. 2009 Mar;26(3):603-12. doi: 10.1093/molbev/msn281. Epub 2008 Dec 8.
2
Structure of flexible filamentous plant viruses.柔性丝状植物病毒的结构。
J Virol. 2008 Oct;82(19):9546-54. doi: 10.1128/JVI.00895-08. Epub 2008 Jul 30.
3
On the origin of new genes in Drosophila.论果蝇中新基因的起源
Genome Res. 2008 Sep;18(9):1446-55. doi: 10.1101/gr.076588.108. Epub 2008 Jun 11.
4
Bioinformatic analysis suggests that a conserved ORF in the waikaviruses encodes an overlapping gene.生物信息学分析表明,怀卡病毒中的一个保守开放阅读框编码一个重叠基因。
Arch Virol. 2008;153(7):1379-83. doi: 10.1007/s00705-008-0119-5. Epub 2008 Jun 6.
5
The twilight zone between protein order and disorder.蛋白质有序与无序之间的过渡区域。
Biophys J. 2008 Aug;95(4):1612-26. doi: 10.1529/biophysj.108.131151. Epub 2008 Apr 25.
6
Bioinformatics prediction of overlapping frameshifted translation products in mammalian transcripts.哺乳动物转录本中重叠移码翻译产物的生物信息学预测
BMC Genomics. 2008 Mar 6;9:122. doi: 10.1186/1471-2164-9-122.
7
Rates of evolutionary change in viruses: patterns and determinants.病毒的进化变化速率:模式与决定因素
Nat Rev Genet. 2008 Apr;9(4):267-76. doi: 10.1038/nrg2323. Epub 2008 Mar 4.
8
Prediction of intrinsic disorder and its use in functional proteomics.内在无序性的预测及其在功能蛋白质组学中的应用。
Methods Mol Biol. 2007;408:69-92. doi: 10.1007/978-1-59745-547-3_5.
9
Fuzzy complexes: polymorphism and structural disorder in protein-protein interactions.模糊复合物:蛋白质-蛋白质相互作用中的多态性与结构无序
Trends Biochem Sci. 2008 Jan;33(1):2-8. doi: 10.1016/j.tibs.2007.10.003. Epub 2007 Nov 28.
10
The Pfam protein families database.Pfam蛋白质家族数据库。
Nucleic Acids Res. 2008 Jan;36(Database issue):D281-8. doi: 10.1093/nar/gkm960. Epub 2007 Nov 26.