• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从头开始在真核生物中搜索新兴基因 DENSE。

De Novo Emerged Gene Search in Eukaryotes with DENSE.

机构信息

Institute for Integrative Biology of the Cell (I2BC), Université Paris-Saclay, CEA, CNRS, 91198 Gif-sur-Yvette, France.

Institute for Evolution and Biodiversity, University of Münster, 48149 Münster, Germany.

出版信息

Genome Biol Evol. 2024 Aug 5;16(8). doi: 10.1093/gbe/evae159.

DOI:10.1093/gbe/evae159
PMID:39212967
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11363675/
Abstract

The discovery of de novo emerged genes, originating from previously noncoding DNA regions, challenges traditional views of species evolution. Indeed, the hypothesis of neutrally evolving sequences giving rise to functional proteins is highly unlikely. This conundrum has sparked numerous studies to quantify and characterize these genes, aiming to understand their functional roles and contributions to genome evolution. Yet, no fully automated pipeline for their identification is available. Therefore, we introduce DENSE (DE Novo emerged gene SEarch), an automated Nextflow pipeline based on two distinct steps: detection of taxonomically restricted genes (TRGs) through phylostratigraphy, and filtering of TRGs for de novo emerged genes via genome comparisons and synteny search. DENSE is available as a user-friendly command-line tool, while the second step is accessible through a web server upon providing a list of TRGs. Highly flexible, DENSE provides various strategy and parameter combinations, enabling users to adapt to specific configurations or define their own strategy through a rational framework, facilitating protocol communication, and study interoperability. We apply DENSE to seven model organisms, exploring the impact of its strategies and parameters on de novo gene predictions. This thorough analysis across species with different evolutionary rates reveals useful metrics for users to define input datasets, identify favorable/unfavorable conditions for de novo gene detection, and control potential biases in genome annotations. Additionally, predictions made for the seven model organisms are compiled into a requestable database, which we hope will serve as a reference for de novo emerged gene lists generated with specific criteria combinations.

摘要

从头出现的基因的发现,起源于以前的非编码 DNA 区域,挑战了物种进化的传统观点。事实上,序列中性进化产生功能蛋白的假说极不可能成立。这一难题引发了众多研究,旨在量化和描述这些基因,以了解它们的功能作用和对基因组进化的贡献。然而,目前还没有完全自动化的鉴定这些基因的流程。因此,我们引入了 DENSE(从头出现的基因搜索),这是一个基于两个不同步骤的自动化 Nextflow 流程:通过系统发生学限制基因(TRGs)检测,以及通过基因组比较和同线性搜索过滤 TRGs 以获得从头出现的基因。DENSE 提供了一个用户友好的命令行工具,而第二步可以通过提供 TRGs 列表在 Web 服务器上访问。高度灵活的 DENSE 提供了各种策略和参数组合,使用户能够适应特定的配置或通过合理的框架定义自己的策略,从而促进协议的交流和研究的互操作性。我们将 DENSE 应用于七个模式生物,探索其策略和参数对从头基因预测的影响。这种对具有不同进化率的物种的全面分析揭示了有用的指标,供用户定义输入数据集,识别从头基因检测的有利/不利条件,并控制基因组注释中的潜在偏差。此外,为七个模式生物进行的预测被编译到一个可请求的数据库中,我们希望该数据库将成为根据特定标准组合生成的从头出现的基因列表的参考。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fac8/11363675/3a778132afdc/evae159f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fac8/11363675/588404b7a363/evae159f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fac8/11363675/a69e4fa059c1/evae159f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fac8/11363675/c3cd6819d014/evae159f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fac8/11363675/258f8ce8af4f/evae159f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fac8/11363675/df351bd15198/evae159f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fac8/11363675/6fa73202561f/evae159f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fac8/11363675/3a778132afdc/evae159f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fac8/11363675/588404b7a363/evae159f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fac8/11363675/a69e4fa059c1/evae159f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fac8/11363675/c3cd6819d014/evae159f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fac8/11363675/258f8ce8af4f/evae159f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fac8/11363675/df351bd15198/evae159f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fac8/11363675/6fa73202561f/evae159f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fac8/11363675/3a778132afdc/evae159f7.jpg

相似文献

1
De Novo Emerged Gene Search in Eukaryotes with DENSE.从头开始在真核生物中搜索新兴基因 DENSE。
Genome Biol Evol. 2024 Aug 5;16(8). doi: 10.1093/gbe/evae159.
2
fagin: synteny-based phylostratigraphy and finer classification of young genes.法金:基于同线性的系统发生地层学和年轻基因的更精细分类。
BMC Bioinformatics. 2019 Aug 27;20(1):440. doi: 10.1186/s12859-019-3023-y.
3
Only a Single Taxonomically Restricted Gene Family in the Drosophila melanogaster Subgroup Can Be Identified with High Confidence.仅有一个在果蝇亚组中具有严格分类学限制的基因家族可以被高度确信地识别。
Genome Biol Evol. 2020 Aug 1;12(8):1355-1366. doi: 10.1093/gbe/evaa127.
4
Computational Prediction of De Novo Emerged Protein-Coding Genes.从头出现的蛋白质编码基因的计算预测
Methods Mol Biol. 2019;1851:63-81. doi: 10.1007/978-1-4939-8736-8_4.
5
From De Novo to "De Nono": The Majority of Novel Protein-Coding Genes Identified with Phylostratigraphy Are Old Genes or Recent Duplicates.从头新到“第九新”:系统发生分类学鉴定的大多数新蛋白编码基因是旧基因或近期重复基因。
Genome Biol Evol. 2018 Nov 1;10(11):2906-2918. doi: 10.1093/gbe/evy231.
6
Ancestral Sequence Reconstruction as a Tool to Detect and Study De Novo Gene Emergence.祖先序列重建作为一种检测和研究新基因出现的工具。
Genome Biol Evol. 2024 Aug 5;16(8). doi: 10.1093/gbe/evae151.
7
No Evidence for Phylostratigraphic Bias Impacting Inferences on Patterns of Gene Emergence and Evolution.没有证据表明系统发育地层学偏差会影响对基因出现和进化模式的推断。
Mol Biol Evol. 2017 Apr 1;34(4):843-856. doi: 10.1093/molbev/msw284.
8
Taxonomically Restricted Genes in Bacillus may Form Clusters of Homologs and Can be Traced to a Large Reservoir of Noncoding Sequences.芽孢杆菌中分类受限的基因可能形成同源物簇,并可追溯到大量非编码序列的文库。
Genome Biol Evol. 2023 Mar 3;15(3). doi: 10.1093/gbe/evad023.
9
De novo ORFs in Drosophila are important to organismal fitness and evolved rapidly from previously non-coding sequences.果蝇中的从头 ORF 对生物适应性很重要,并且从先前的非编码序列中快速进化而来。
PLoS Genet. 2013;9(10):e1003860. doi: 10.1371/journal.pgen.1003860. Epub 2013 Oct 17.
10
A De-Novo Genome Analysis Pipeline (DeNoGAP) for large-scale comparative prokaryotic genomics studies.一种用于大规模比较原核生物基因组学研究的从头基因组分析流程(DeNoGAP)。
BMC Bioinformatics. 2016 Jun 30;17(1):260. doi: 10.1186/s12859-016-1142-2.

引用本文的文献

1
Robust phylogenetic profile clustering for proteins.用于蛋白质的稳健系统发育谱聚类
PeerJ. 2025 Apr 28;13:e19370. doi: 10.7717/peerj.19370. eCollection 2025.
2
Orphan genes are not a distinct biological entity.孤儿基因并非一个独特的生物学实体。
Bioessays. 2025 Jan;47(1):e2400146. doi: 10.1002/bies.202400146. Epub 2024 Nov 3.

本文引用的文献

1
Ancestral Sequence Reconstruction as a Tool to Detect and Study De Novo Gene Emergence.祖先序列重建作为一种检测和研究新基因出现的工具。
Genome Biol Evol. 2024 Aug 5;16(8). doi: 10.1093/gbe/evae151.
2
The origin and structural evolution of de novo genes in Drosophila.果蝇中从头起源基因的起源与结构演化
Nat Commun. 2024 Jan 27;15(1):810. doi: 10.1038/s41467-024-45028-1.
3
What Can Ribo-Seq, Immunopeptidomics, and Proteomics Tell Us About the Noncanonical Proteome?核糖体测序、免疫肽组学和蛋白质组学能告诉我们非典型蛋白质组的什么信息?
Mol Cell Proteomics. 2023 Sep;22(9):100631. doi: 10.1016/j.mcpro.2023.100631. Epub 2023 Aug 11.
4
Population genomics reveals mechanisms and dynamics of de novo expressed open reading frame emergence in .群体基因组学揭示了. 中从头表达的开放阅读框出现的机制和动态。
Genome Res. 2023 Jun;33(6):872-890. doi: 10.1101/gr.277482.122. Epub 2023 Jul 13.
5
A vast evolutionarily transient translatome contributes to phenotype and fitness.大量进化上短暂的翻译组(translatome)有助于表型和适应性。
Cell Syst. 2023 May 17;14(5):363-381.e8. doi: 10.1016/j.cels.2023.04.002. Epub 2023 May 9.
6
Evolutionary Trajectories of New Duplicated and Putative De Novo Genes.新复制和推定从头基因的进化轨迹。
Mol Biol Evol. 2023 May 2;40(5). doi: 10.1093/molbev/msad098.
7
Uncovering gene-family founder events during major evolutionary transitions in animals, plants and fungi using GenEra.利用 GenEra 揭示动物、植物和真菌主要进化转变过程中的基因家族起源事件。
Genome Biol. 2023 Mar 24;24(1):54. doi: 10.1186/s13059-023-02895-z.
8
Evaluating impacts of syntenic block detection strategies on rearrangement phylogeny using Mycobacterium tuberculosis isolates.评估基于连锁块检测策略对结核分枝杆菌分离株重排系统发育的影响。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btad024.
9
TimeTree 5: An Expanded Resource for Species Divergence Times.TimeTree 5:物种分化时间的扩展资源。
Mol Biol Evol. 2022 Aug 6;39(8). doi: 10.1093/molbev/msac174.
10
The Origins and Functions of De Novo Genes: Against All Odds?从头基因的起源和功能:逆势而上?
J Mol Evol. 2022 Aug;90(3-4):244-257. doi: 10.1007/s00239-022-10055-3. Epub 2022 Apr 22.