• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从表达数据预测基因缺失的进化靶点和参数。

Predicting evolutionary targets and parameters of gene deletion from expression data.

作者信息

Campelo Dos Santos Andre Luiz, DeGiorgio Michael, Assis Raquel

机构信息

Department of Electrical Engineering and Computer Science, Florida Atlantic University, Boca Raton, FL 33431, United States.

Institute for Human Health and Disease Intervention, Florida Atlantic University, Boca Raton, FL 33431, United States.

出版信息

Bioinform Adv. 2024 Jan 17;4(1):vbae002. doi: 10.1093/bioadv/vbae002. eCollection 2024.

DOI:10.1093/bioadv/vbae002
PMID:38282974
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10812876/
Abstract

MOTIVATION

Gene deletion is traditionally thought of as a nonadaptive process that removes functional redundancy from genomes, such that it generally receives less attention than duplication in evolutionary turnover studies. Yet, mounting evidence suggests that deletion may promote adaptation via the "less-is-more" evolutionary hypothesis, as it often targets genes harboring unique sequences, expression profiles, and molecular functions. Hence, predicting the relative prevalence of redundant and unique functions among genes targeted by deletion, as well as the parameters underlying their evolution, can shed light on the role of gene deletion in adaptation.

RESULTS

Here, we present CLOUDe, a suite of machine learning methods for predicting evolutionary targets of gene deletion events from expression data. Specifically, CLOUDe models expression evolution as an Ornstein-Uhlenbeck process, and uses multi-layer neural network, extreme gradient boosting, random forest, and support vector machine architectures to predict whether deleted genes are "redundant" or "unique", as well as several parameters underlying their evolution. We show that CLOUDe boasts high power and accuracy in differentiating between classes, and high accuracy and precision in estimating evolutionary parameters, with optimal performance achieved by its neural network architecture. Application of CLOUDe to empirical data from suggests that deletion primarily targets genes with unique functions, with further analysis showing these functions to be enriched for protein deubiquitination. Thus, CLOUDe represents a key advance in learning about the role of gene deletion in functional evolution and adaptation.

AVAILABILITY AND IMPLEMENTATION

CLOUDe is freely available on GitHub (https://github.com/anddssan/CLOUDe).

摘要

动机

传统上认为基因缺失是一个非适应性过程,它从基因组中去除功能冗余,因此在进化更替研究中,它通常比基因复制受到的关注更少。然而,越来越多的证据表明,基因缺失可能通过“少即是多”的进化假说促进适应性,因为它通常靶向具有独特序列、表达谱和分子功能的基因。因此,预测基因缺失靶向基因中冗余和独特功能的相对流行程度,以及它们进化的潜在参数,有助于阐明基因缺失在适应性中的作用。

结果

在这里,我们展示了CLOUDe,这是一套用于从表达数据预测基因缺失事件进化靶点的机器学习方法。具体来说,CLOUDe将表达进化建模为奥恩斯坦 - 乌伦贝克过程,并使用多层神经网络、极端梯度提升、随机森林和支持向量机架构来预测缺失基因是“冗余的”还是“独特的”,以及它们进化的几个潜在参数。我们表明,CLOUDe在区分类别方面具有高功效和准确性,在估计进化参数方面具有高精度和精确性,其神经网络架构实现了最佳性能。将CLOUDe应用于来自[具体来源未给出]的实证数据表明,基因缺失主要靶向具有独特功能的基因,进一步分析表明这些功能在蛋白质去泛素化方面富集。因此,CLOUDe代表了在了解基因缺失在功能进化和适应性中的作用方面的一项关键进展。

可用性和实现方式

CLOUDe可在GitHub(https://github.com/anddssan/CLOUDe)上免费获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/963d/10812876/9a620f089309/vbae002f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/963d/10812876/4e5896089266/vbae002f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/963d/10812876/d00b28a52d29/vbae002f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/963d/10812876/ac73e8dacfa3/vbae002f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/963d/10812876/2830a7452881/vbae002f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/963d/10812876/f96e9ebacb84/vbae002f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/963d/10812876/9a620f089309/vbae002f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/963d/10812876/4e5896089266/vbae002f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/963d/10812876/d00b28a52d29/vbae002f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/963d/10812876/ac73e8dacfa3/vbae002f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/963d/10812876/2830a7452881/vbae002f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/963d/10812876/f96e9ebacb84/vbae002f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/963d/10812876/9a620f089309/vbae002f6.jpg

相似文献

1
Predicting evolutionary targets and parameters of gene deletion from expression data.从表达数据预测基因缺失的进化靶点和参数。
Bioinform Adv. 2024 Jan 17;4(1):vbae002. doi: 10.1093/bioadv/vbae002. eCollection 2024.
2
Predicting gene expression divergence between single-copy orthologs in two species.预测两个物种中单拷贝直系同源基因之间的基因表达差异。
Genome Biol Evol. 2023 May 12;15(5). doi: 10.1093/gbe/evad078.
3
Learning Retention Mechanisms and Evolutionary Parameters of Duplicate Genes from Their Expression Data.从表达数据中学习重复基因的保留机制和进化参数。
Mol Biol Evol. 2021 Mar 9;38(3):1209-1224. doi: 10.1093/molbev/msaa267.
4
Out of the testis, into the ovary: biased outcomes of gene duplication and deletion in Drosophila.从睾丸到卵巢:果蝇中基因重复和缺失的偏向性结果。
Evolution. 2019 Sep;73(9):1850-1862. doi: 10.1111/evo.13820.
5
GSEA-SDBE: A gene selection method for breast cancer classification based on GSEA and analyzing differences in performance metrics.GSEA-SDBE:一种基于基因集富集分析(GSEA)并分析性能指标差异的乳腺癌分类基因选择方法。
PLoS One. 2022 Apr 26;17(4):e0263171. doi: 10.1371/journal.pone.0263171. eCollection 2022.
6
Neurogenomics and the role of a large mutational target on rapid behavioral change.神经基因组学以及大突变靶点在快速行为变化中的作用。
Biol Direct. 2016 Nov 8;11(1):60. doi: 10.1186/s13062-016-0162-1.
7
Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?预测模型工具能否识别 ACL 重建术后阿片类药物使用时间延长的高风险患者?
Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251.
8
Predicting genome-wide redundancy using machine learning.使用机器学习预测全基因组冗余。
BMC Evol Biol. 2010 Nov 18;10:357. doi: 10.1186/1471-2148-10-357.
9
Invariant transformers of Robinson and Foulds distance matrices for Convolutional Neural Network.不变的 Robinson 和 Foulds 距离矩阵变换用于卷积神经网络。
J Bioinform Comput Biol. 2022 Aug;20(4):2250012. doi: 10.1142/S0219720022500123. Epub 2022 Jul 6.
10
End-to-end learning of evolutionary models to find coding regions in genome alignments.用于在基因组比对中寻找编码区域的进化模型的端到端学习。
Bioinformatics. 2022 Mar 28;38(7):1857-1862. doi: 10.1093/bioinformatics/btac028.

引用本文的文献

1
Discriminating models of trait evolution.性状进化的判别模型。
bioRxiv. 2025 Jun 13:2025.06.12.659377. doi: 10.1101/2025.06.12.659377.

本文引用的文献

1
Predicting gene expression divergence between single-copy orthologs in two species.预测两个物种中单拷贝直系同源基因之间的基因表达差异。
Genome Biol Evol. 2023 May 12;15(5). doi: 10.1093/gbe/evad078.
2
The Gene Ontology knowledgebase in 2023.2023 版基因本体论知识库。
Genetics. 2023 May 4;224(1). doi: 10.1093/genetics/iyad031.
3
DAVID: a web server for functional enrichment analysis and functional annotation of gene lists (2021 update).DAVID:一个用于基因列表功能富集分析和功能注释的网络服务器(2021 更新)。
Nucleic Acids Res. 2022 Jul 5;50(W1):W216-W221. doi: 10.1093/nar/gkac194.
4
FlyBase: a guided tour of highlighted features.FlyBase:特色功能导览
Genetics. 2022 Apr 4;220(4). doi: 10.1093/genetics/iyac035.
5
Nuclear transport genes recurrently duplicate by means of RNA intermediates in Drosophila but not in other insects.在果蝇中,核转运基因通过 RNA 中间体频繁复制,但在其他昆虫中则不然。
BMC Genomics. 2021 Dec 5;22(1):876. doi: 10.1186/s12864-021-08170-4.
6
Learning Retention Mechanisms and Evolutionary Parameters of Duplicate Genes from Their Expression Data.从表达数据中学习重复基因的保留机制和进化参数。
Mol Biol Evol. 2021 Mar 9;38(3):1209-1224. doi: 10.1093/molbev/msaa267.
7
Ran GTPase: A Key Player in Tumor Progression and Metastasis.Ran GTP酶:肿瘤进展和转移中的关键因子
Front Cell Dev Biol. 2020 May 26;8:345. doi: 10.3389/fcell.2020.00345. eCollection 2020.
8
Out of the testis, into the ovary: biased outcomes of gene duplication and deletion in Drosophila.从睾丸到卵巢:果蝇中基因重复和缺失的偏向性结果。
Evolution. 2019 Sep;73(9):1850-1862. doi: 10.1111/evo.13820.
9
Rapid functional divergence after small-scale gene duplication in grasses.禾本科中小规模基因复制后的快速功能分化。
BMC Evol Biol. 2019 May 2;19(1):97. doi: 10.1186/s12862-019-1415-2.
10
Phylogenetic Comparative Analysis: A Modeling Approach for Adaptive Evolution.系统发育比较分析:一种适应性进化的建模方法。
Am Nat. 2004 Dec;164(6):683-695. doi: 10.1086/426002.