• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基因间开放阅读框作为从头起源基因诞生和蛋白质进化的基本结构模块。

Intergenic ORFs as elementary structural modules of de novo gene birth and protein evolution.

作者信息

Papadopoulos Chris, Callebaut Isabelle, Gelly Jean-Christophe, Hatin Isabelle, Namy Olivier, Renard Maxime, Lespinet Olivier, Lopes Anne

机构信息

Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198 Gif-sur-Yvette, France.

Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, 75005 Paris, France.

出版信息

Genome Res. 2021 Dec;31(12):2303-2315. doi: 10.1101/gr.275638.121. Epub 2021 Nov 22.

DOI:10.1101/gr.275638.121
PMID:34810219
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8647833/
Abstract

The noncoding genome plays an important role in de novo gene birth and in the emergence of genetic novelty. Nevertheless, how noncoding sequences' properties could promote the birth of novel genes and shape the evolution and the structural diversity of proteins remains unclear. Therefore, by combining different bioinformatic approaches, we characterized the fold potential diversity of the amino acid sequences encoded by all intergenic open reading frames (ORFs) of with the aim of (1) exploring whether the structural states' diversity of proteomes is already present in noncoding sequences, and (2) estimating the potential of the noncoding genome to produce novel protein bricks that could either give rise to novel genes or be integrated into pre-existing proteins, thus participating in protein structure diversity and evolution. We showed that amino acid sequences encoded by most yeast intergenic ORFs contain the elementary building blocks of protein structures. Moreover, they encompass the large structural state diversity of canonical proteins, with the majority predicted as foldable. Then, we investigated the early stages of de novo gene birth by reconstructing the ancestral sequences of 70 yeast de novo genes and characterized the sequence and structural properties of intergenic ORFs with a strong translation signal. This enabled us to highlight sequence and structural factors determining de novo gene emergence. Finally, we showed a strong correlation between the fold potential of de novo proteins and one of their ancestral amino acid sequences, reflecting the relationship between the noncoding genome and the protein structure universe.

摘要

非编码基因组在新生基因产生和遗传新奇性出现中发挥着重要作用。然而,非编码序列的特性如何促进新基因的产生以及塑造蛋白质的进化和结构多样性仍不清楚。因此,通过结合不同的生物信息学方法,我们对酿酒酵母所有基因间开放阅读框(ORF)编码的氨基酸序列的折叠潜力多样性进行了表征,目的是:(1)探究蛋白质组的结构状态多样性是否已存在于非编码序列中;(2)评估非编码基因组产生新型蛋白质模块的潜力,这些模块既可以产生新基因,也可以整合到现有蛋白质中,从而参与蛋白质结构多样性和进化。我们发现,大多数酵母基因间ORF编码的氨基酸序列包含蛋白质结构的基本构建块。此外,它们涵盖了典型蛋白质的多种结构状态,大多数被预测为可折叠的。然后,我们通过重建70个酵母新生基因的祖先序列,研究了新生基因产生的早期阶段,并对具有强翻译信号的基因间ORF的序列和结构特性进行了表征。这使我们能够突出决定新生基因出现的序列和结构因素。最后,我们发现新生蛋白质的折叠潜力与其一个祖先氨基酸序列之间存在很强的相关性,这反映了非编码基因组与蛋白质结构世界之间的关系。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf6b/8647833/cfb16b9c5156/2303f07.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf6b/8647833/789bf4c1b4b3/2303f01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf6b/8647833/6ae6b88478ee/2303f02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf6b/8647833/b2ee9c924d77/2303f03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf6b/8647833/478fbf438725/2303f04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf6b/8647833/525f8b0e64b4/2303f05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf6b/8647833/cc482f641b3c/2303f06.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf6b/8647833/cfb16b9c5156/2303f07.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf6b/8647833/789bf4c1b4b3/2303f01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf6b/8647833/6ae6b88478ee/2303f02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf6b/8647833/b2ee9c924d77/2303f03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf6b/8647833/478fbf438725/2303f04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf6b/8647833/525f8b0e64b4/2303f05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf6b/8647833/cc482f641b3c/2303f06.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf6b/8647833/cfb16b9c5156/2303f07.jpg

相似文献

1
Intergenic ORFs as elementary structural modules of de novo gene birth and protein evolution.基因间开放阅读框作为从头起源基因诞生和蛋白质进化的基本结构模块。
Genome Res. 2021 Dec;31(12):2303-2315. doi: 10.1101/gr.275638.121. Epub 2021 Nov 22.
2
Turnover of ribosome-associated transcripts from de novo ORFs produces gene-like characteristics available for de novo gene emergence in wild yeast populations.从头 ORF 产生的核糖体相关转录本的周转率产生了可供野生酵母种群中新基因出现的基因特征。
Genome Res. 2019 Jun;29(6):932-943. doi: 10.1101/gr.239822.118. Epub 2019 May 31.
3
Stochastic Gain and Loss of Novel Transcribed Open Reading Frames in the Human Lineage.人类谱系中新型转录开放阅读框的随机增益和损失。
Genome Biol Evol. 2020 Nov 3;12(11):2183-2195. doi: 10.1093/gbe/evaa194.
4
A putative scenario of how de novo protein-coding genes originate in the Saccharomyces cerevisiae lineage.酵母属中从头产生蛋白质编码基因的推测情景。
BMC Genomics. 2024 Sep 5;25(Suppl 3):834. doi: 10.1186/s12864-024-10669-5.
5
Proto-genes and de novo gene birth.原基因和从头基因的诞生。
Nature. 2012 Jul 19;487(7407):370-4. doi: 10.1038/nature11184.
6
Ancestral Sequence Reconstruction as a Tool to Detect and Study De Novo Gene Emergence.祖先序列重建作为一种检测和研究新基因出现的工具。
Genome Biol Evol. 2024 Aug 5;16(8). doi: 10.1093/gbe/evae151.
7
Genome Plasticity in Papillomaviruses and De Novo Emergence of E5 Oncogenes.乳头瘤病毒的基因组可塑性和 E5 癌基因的从头出现。
Genome Biol Evol. 2019 Jun 1;11(6):1602-1617. doi: 10.1093/gbe/evz095.
8
High Polymorphism Levels of De Novo ORFs in a Yoruba Human Population.新发现的 Yoruba 人群中从头 ORF 的高度多态性。
Genome Biol Evol. 2024 Jul 3;16(7). doi: 10.1093/gbe/evae126.
9
Exploring the Peptide Potential of Genomes.探索基因组中的肽潜力。
Methods Mol Biol. 2022;2405:63-82. doi: 10.1007/978-1-0716-1855-4_3.
10
De novo ORFs in Drosophila are important to organismal fitness and evolved rapidly from previously non-coding sequences.果蝇中的从头 ORF 对生物适应性很重要,并且从先前的非编码序列中快速进化而来。
PLoS Genet. 2013;9(10):e1003860. doi: 10.1371/journal.pgen.1003860. Epub 2013 Oct 17.

引用本文的文献

1
The ribosome profiling landscape of yeast reveals a high diversity in pervasive translation.酵母核糖体图谱揭示了广泛翻译中的高度多样性。
Genome Biol. 2024 Oct 14;25(1):268. doi: 10.1186/s13059-024-03403-7.
2
De Novo Emerged Gene Search in Eukaryotes with DENSE.从头开始在真核生物中搜索新兴基因 DENSE。
Genome Biol Evol. 2024 Aug 5;16(8). doi: 10.1093/gbe/evae159.
3
Ancestral Sequence Reconstruction as a Tool to Detect and Study De Novo Gene Emergence.祖先序列重建作为一种检测和研究新基因出现的工具。

本文引用的文献

1
Deriving and Using Descriptors of Elementary Functions in Rational Protein Design.在合理蛋白质设计中推导和使用基本函数描述符。
Front Bioinform. 2021 Apr 13;1:657529. doi: 10.3389/fbinf.2021.657529. eCollection 2021.
2
Protlego: a Python package for the analysis and design of chimeric proteins.Protlego:一个用于嵌合蛋白分析与设计的Python软件包。
Bioinformatics. 2021 Oct 11;37(19):3182-3189. doi: 10.1093/bioinformatics/btab253.
3
Structural and functional characterization of a putative de novo gene in Drosophila.在果蝇中一个假定的从头基因的结构和功能特征。
Genome Biol Evol. 2024 Aug 5;16(8). doi: 10.1093/gbe/evae151.
4
Are Most Human-Specific Proteins Encoded by Long Noncoding RNAs?大多数人类特异性蛋白是否由长非编码 RNA 编码?
J Mol Evol. 2024 Aug;92(4):363-370. doi: 10.1007/s00239-024-10174-z. Epub 2024 Jun 25.
5
Modeling Length Changes in De Novo Open Reading Frames during Neutral Evolution.中性进化过程中从头开始的开放阅读框长度变化的建模。
Genome Biol Evol. 2024 Jul 3;16(7). doi: 10.1093/gbe/evae129.
6
The Rapid Evolution of De Novo Proteins in Structure and Complex.从头蛋白质在结构和复杂性上的快速进化。
Genome Biol Evol. 2024 Jun 4;16(6). doi: 10.1093/gbe/evae107.
7
ProTInSeq: transposon insertion tracking by ultra-deep DNA sequencing to identify translated large and small ORFs.ProTInSeq:通过超深度 DNA 测序进行转座子插入追踪,以鉴定翻译的大、小 ORF。
Nat Commun. 2024 Mar 7;15(1):2091. doi: 10.1038/s41467-024-46112-2.
8
Pervasive translation of Xrn1-sensitive unstable long noncoding RNAs in yeast.酵母中 Xrn1 敏感的不稳定长非编码 RNA 的普遍翻译。
RNA. 2024 May 16;30(6):662-679. doi: 10.1261/rna.079903.123.
9
Allele biased transcription factor binding across human brain regions gives mechanistic insight into eQTLs.全人类大脑区域中偏向性等位基因的转录因子结合为基因表达数量性状位点提供了机制性见解。
bioRxiv. 2023 Oct 9:2023.10.06.561245. doi: 10.1101/2023.10.06.561245.
10
Alternative Reading Frames are an Underappreciated Source of Protein Sequence Novelty.可变阅读框是蛋白质序列新颖性的一个未被充分认识的来源。
J Mol Evol. 2023 Oct;91(5):570-580. doi: 10.1007/s00239-023-10122-3. Epub 2023 Jun 16.
Nat Commun. 2021 Mar 12;12(1):1667. doi: 10.1038/s41467-021-21667-6.
4
Most non-canonical proteins uniquely populate the proteome or immunopeptidome.大多数非规范蛋白是蛋白质组或免疫肽组所特有的。
Cell Rep. 2021 Mar 9;34(10):108815. doi: 10.1016/j.celrep.2021.108815.
5
Structure and function of naturally evolved de novo proteins.天然进化的从头蛋白质的结构与功能。
Curr Opin Struct Biol. 2021 Jun;68:175-183. doi: 10.1016/j.sbi.2020.11.010. Epub 2021 Feb 7.
6
Uncovering de novo gene birth in yeast using deep transcriptomics.利用深度转录组学揭示酵母中的新基因诞生。
Nat Commun. 2021 Jan 27;12(1):604. doi: 10.1038/s41467-021-20911-3.
7
Bridging Themes: Short Protein Segments Found in Different Architectures.桥接主题:不同结构中发现的短蛋白片段。
Mol Biol Evol. 2021 May 19;38(6):2191-2208. doi: 10.1093/molbev/msab017.
8
RCSB Protein Data Bank: powerful new tools for exploring 3D structures of biological macromolecules for basic and applied research and education in fundamental biology, biomedicine, biotechnology, bioengineering and energy sciences.RCSB 蛋白质数据库:用于基础生物学、生物医学、生物技术、生物工程和能源科学等领域的基础研究、应用研究和教育中探索生物大分子三维结构的强大新工具。
Nucleic Acids Res. 2021 Jan 8;49(D1):D437-D451. doi: 10.1093/nar/gkaa1038.
9
Thermodynamic and Evolutionary Coupling between the Native and Amyloid State of Globular Proteins.球状蛋白天然态和淀粉样态之间的热力学和进化耦合。
Cell Rep. 2020 Apr 14;31(2):107512. doi: 10.1016/j.celrep.2020.03.076.
10
A Continuum of Evolving De Novo Genes Drives Protein-Coding Novelty in Drosophila.不断进化的从头基因连续体驱动果蝇的蛋白质编码新功能。
J Mol Evol. 2020 May;88(4):382-398. doi: 10.1007/s00239-020-09939-z. Epub 2020 Apr 7.