• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

祖先序列重建作为一种检测和研究新基因出现的工具。

Ancestral Sequence Reconstruction as a Tool to Detect and Study De Novo Gene Emergence.

机构信息

Institute for Fundamental Biomedical Research, BSRC "Alexander Fleming", Vari, Greece.

Pittsburgh Center for Evolutionary Biology and Medicine, Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, Pittsburgh, PA 15213, USA.

出版信息

Genome Biol Evol. 2024 Aug 5;16(8). doi: 10.1093/gbe/evae151.

DOI:10.1093/gbe/evae151
PMID:39004885
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11299112/
Abstract

New protein-coding genes can evolve from previously noncoding genomic regions through a process known as de novo gene emergence. Evidence suggests that this process has likely occurred throughout evolution and across the tree of life. Yet, confidently identifying de novo emerged genes remains challenging. Ancestral sequence reconstruction is a promising approach for inferring whether a gene has emerged de novo or not, as it allows us to inspect whether a given genomic locus ancestrally harbored protein-coding capacity. However, the use of ancestral sequence reconstruction in the context of de novo emergence is still in its infancy and its capabilities, limitations, and overall potential are largely unknown. Notably, it is difficult to formally evaluate the protein-coding capacity of ancestral sequences, particularly when new gene candidates are short. How well-suited is ancestral sequence reconstruction as a tool for the detection and study of de novo genes? Here, we address this question by designing an ancestral sequence reconstruction workflow incorporating different tools and sets of parameters and by introducing a formal criterion that allows to estimate, within a desired level of confidence, when protein-coding capacity originated at a particular locus. Applying this workflow on ∼2,600 short, annotated budding yeast genes (<1,000 nucleotides), we found that ancestral sequence reconstruction robustly predicts an ancient origin for the most widely conserved genes, which constitute "easy" cases. For less robust cases, we calculated a randomization-based empirical P-value estimating whether the observed conservation between the extant and ancestral reading frame could be attributed to chance. This formal criterion allowed us to pinpoint a branch of origin for most of the less robust cases, identifying 49 genes that can unequivocally be considered de novo originated since the split of the Saccharomyces genus, including 37 Saccharomyces cerevisiae-specific genes. We find that for the remaining equivocal cases we cannot rule out different evolutionary scenarios including rapid evolution, multiple gene losses, or a recent de novo origin. Overall, our findings suggest that ancestral sequence reconstruction is a valuable tool to study de novo gene emergence but should be applied with caution and awareness of its limitations.

摘要

新的蛋白质编码基因可以通过一个称为从头基因出现的过程从先前非编码基因组区域进化而来。有证据表明,这一过程可能在整个进化过程中和生命之树上都发生过。然而,自信地识别新出现的基因仍然具有挑战性。祖先序列重建是一种很有前途的方法,可以推断一个基因是否是从头出现的,因为它允许我们检查给定的基因组位点在祖先是否具有编码能力。然而,在从头出现的背景下使用祖先序列重建仍然处于起步阶段,其能力、限制和总体潜力在很大程度上是未知的。值得注意的是,很难正式评估祖先序列的蛋白质编码能力,特别是当新的基因候选者很短时。祖先序列重建作为检测和研究从头基因的工具的适用性如何?在这里,我们通过设计一个包含不同工具和参数集的祖先序列重建工作流程,并引入一个正式的标准来解决这个问题,该标准允许在给定的置信水平内估计蛋白质编码能力在特定位置起源的时间。将这个工作流程应用于大约 2600 个短的、注释的芽殖酵母基因(<1000 个核苷酸),我们发现,祖先序列重建可以稳健地预测最广泛保守基因的古老起源,这些基因构成了“简单”的情况。对于不太稳健的情况,我们计算了一个基于随机化的经验 P 值,估计在现存和祖先阅读框之间观察到的保守性是否可以归因于偶然。这个正式的标准允许我们确定大多数不太稳健的情况的起源分支,确定了 49 个可以明确被认为是从头起源的基因,包括 37 个酿酒酵母特有的基因。我们发现,对于其余模棱两可的情况,我们不能排除不同的进化情景,包括快速进化、多个基因丢失或最近的从头起源。总的来说,我们的发现表明,祖先序列重建是研究从头基因出现的一种有价值的工具,但应该谨慎应用,并意识到其局限性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ee2/11299112/3fb9ec7b08d0/evae151f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ee2/11299112/fecfc297cb64/evae151f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ee2/11299112/a354a5356dbc/evae151f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ee2/11299112/48a896f1444b/evae151f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ee2/11299112/3fb9ec7b08d0/evae151f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ee2/11299112/fecfc297cb64/evae151f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ee2/11299112/a354a5356dbc/evae151f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ee2/11299112/48a896f1444b/evae151f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ee2/11299112/3fb9ec7b08d0/evae151f4.jpg

相似文献

1
Ancestral Sequence Reconstruction as a Tool to Detect and Study De Novo Gene Emergence.祖先序列重建作为一种检测和研究新基因出现的工具。
Genome Biol Evol. 2024 Aug 5;16(8). doi: 10.1093/gbe/evae151.
2
A putative scenario of how de novo protein-coding genes originate in the Saccharomyces cerevisiae lineage.酵母属中从头产生蛋白质编码基因的推测情景。
BMC Genomics. 2024 Sep 5;25(Suppl 3):834. doi: 10.1186/s12864-024-10669-5.
3
Tracing the Origin of Protein-Coding Genes in Yeast.追踪酵母中蛋白质编码基因的起源。
mBio. 2018 Jul 31;9(4):e01024-18. doi: 10.1128/mBio.01024-18.
4
A Comprehensive Analysis of Transcript-Supported De Novo Genes in Saccharomyces sensu stricto Yeasts.对酿酒酵母属狭义酵母中转录本支持的从头合成基因的综合分析。
Mol Biol Evol. 2017 Nov 1;34(11):2823-2838. doi: 10.1093/molbev/msx210.
5
Computational Prediction of De Novo Emerged Protein-Coding Genes.从头出现的蛋白质编码基因的计算预测
Methods Mol Biol. 2019;1851:63-81. doi: 10.1007/978-1-4939-8736-8_4.
6
De novo origination of a new protein-coding gene in Saccharomyces cerevisiae.酿酒酵母中新蛋白质编码基因的从头起源。
Genetics. 2008 May;179(1):487-96. doi: 10.1534/genetics.107.084491.
7
A Continuum of Evolving De Novo Genes Drives Protein-Coding Novelty in Drosophila.不断进化的从头基因连续体驱动果蝇的蛋白质编码新功能。
J Mol Evol. 2020 May;88(4):382-398. doi: 10.1007/s00239-020-09939-z. Epub 2020 Apr 7.
8
From De Novo to "De Nono": The Majority of Novel Protein-Coding Genes Identified with Phylostratigraphy Are Old Genes or Recent Duplicates.从头新到“第九新”:系统发生分类学鉴定的大多数新蛋白编码基因是旧基因或近期重复基因。
Genome Biol Evol. 2018 Nov 1;10(11):2906-2918. doi: 10.1093/gbe/evy231.
9
fagin: synteny-based phylostratigraphy and finer classification of young genes.法金:基于同线性的系统发生地层学和年轻基因的更精细分类。
BMC Bioinformatics. 2019 Aug 27;20(1):440. doi: 10.1186/s12859-019-3023-y.
10
The origin and structural evolution of genes in .……中基因的起源与结构演化 。 (你提供的原文不完整,“in”后面缺少具体内容,以上是根据现有内容尽量准确翻译的结果 。)
bioRxiv. 2023 Jun 27:2023.03.13.532420. doi: 10.1101/2023.03.13.532420.

引用本文的文献

1
De novo gene birth and the conundrum of ORFan genes in bacteria.细菌中的从头基因诞生与孤儿基因难题
Genome Res. 2025 Aug 1;35(8):1679-1688. doi: 10.1101/gr.280157.124.
2
Annotation matters: the effect of structural gene annotation on orthology inference.注释很重要:结构基因注释对直系同源推断的影响。
Bioinformatics. 2025 Jul 1;41(7). doi: 10.1093/bioinformatics/btaf365.
3
Robust phylogenetic profile clustering for proteins.用于蛋白质的稳健系统发育谱聚类

本文引用的文献

1
The origin and structural evolution of de novo genes in Drosophila.果蝇中从头起源基因的起源与结构演化
Nat Commun. 2024 Jan 27;15(1):810. doi: 10.1038/s41467-024-45028-1.
2
A vast evolutionarily transient translatome contributes to phenotype and fitness.大量进化上短暂的翻译组(translatome)有助于表型和适应性。
Cell Syst. 2023 May 17;14(5):363-381.e8. doi: 10.1016/j.cels.2023.04.002. Epub 2023 May 9.
3
Evolutionary origins and interactomes of human, young microproteins and small peptides translated from short open reading frames.
PeerJ. 2025 Apr 28;13:e19370. doi: 10.7717/peerj.19370. eCollection 2025.
4
De Novo Emerged Gene Search in Eukaryotes with DENSE.从头开始在真核生物中搜索新兴基因 DENSE。
Genome Biol Evol. 2024 Aug 5;16(8). doi: 10.1093/gbe/evae159.
人类、年轻微蛋白和小肽的进化起源和互作组,这些微蛋白和小肽是由短开放阅读框翻译而来的。
Mol Cell. 2023 Mar 16;83(6):994-1011.e18. doi: 10.1016/j.molcel.2023.01.023. Epub 2023 Feb 17.
4
De novo birth of functional microproteins in the human lineage.人类谱系中功能性微蛋白的从头生成。
Cell Rep. 2022 Dec 20;41(12):111808. doi: 10.1016/j.celrep.2022.111808.
5
Distinct chromosomal "niches" in the genome of provide the background for genomic innovation and shape the fate of gene duplicates.基因组中不同的染色体“生态位”为基因组创新提供了背景,并塑造了基因复制的命运。
NAR Genom Bioinform. 2022 Nov 14;4(4):lqac086. doi: 10.1093/nargab/lqac086. eCollection 2022 Dec.
6
Intergenic ORFs as elementary structural modules of de novo gene birth and protein evolution.基因间开放阅读框作为从头起源基因诞生和蛋白质进化的基本结构模块。
Genome Res. 2021 Dec;31(12):2303-2315. doi: 10.1101/gr.275638.121. Epub 2021 Nov 22.
7
Gotree/Goalign: toolkit and Go API to facilitate the development of phylogenetic workflows.Gotree/Goalign:用于促进系统发育工作流程开发的工具包和Go API。
NAR Genom Bioinform. 2021 Aug 11;3(3):lqab075. doi: 10.1093/nargab/lqab075. eCollection 2021 Sep.
8
Structural and functional characterization of a putative de novo gene in Drosophila.在果蝇中一个假定的从头基因的结构和功能特征。
Nat Commun. 2021 Mar 12;12(1):1667. doi: 10.1038/s41467-021-21667-6.
9
Many, but not all, lineage-specific genes can be explained by homology detection failure.许多(但不是全部)谱系特异性基因可以通过同源性检测失败来解释。
PLoS Biol. 2020 Nov 2;18(11):e3000862. doi: 10.1371/journal.pbio.3000862. eCollection 2020 Nov.
10
Synteny-based analyses indicate that sequence divergence is not the main source of orphan genes.基于基因共线性的分析表明,序列分化并不是孤基因的主要来源。
Elife. 2020 Feb 18;9:e53500. doi: 10.7554/eLife.53500.