• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用带有结构推理的对比学习改进酶功能注释预测。

Improved enzyme functional annotation prediction using contrastive learning with structural inference.

作者信息

Yang Yuxin, Jerger Abby, Feng Song, Wang Zixu, Brasfield Christina, Cheung Margaret S, Zucker Jeremy, Guan Qiang

机构信息

Cleveland Clinic Genome Center, Lerner Research Institute, Cleveland Clinic, 9500 Euclid Ave, Cleveland, OH, 44195, USA.

Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, 9500 Euclid Ave, Cleveland, OH, 44195, USA.

出版信息

Commun Biol. 2024 Dec 23;7(1):1690. doi: 10.1038/s42003-024-07359-z.

DOI:10.1038/s42003-024-07359-z
PMID:39715863
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11666736/
Abstract

Recent years have witnessed the remarkable progress of deep learning within the realm of scientific disciplines, yielding a wealth of promising outcomes. A prominent challenge within this domain has been the task of predicting enzyme function, a complex problem that has seen the development of numerous computational methods, particularly those rooted in deep learning techniques. However, the majority of these methods have primarily focused on either amino acid sequence data or protein structure data, neglecting the potential synergy of combining both modalities. To address this gap, we propose a Contrastive Learning framework for Enzyme functional ANnotation prediction combined with protein amino acid sequences and Contact maps (CLEAN-Contact). We rigorously evaluate the performance of our CLEAN-Contact framework against the state-of-the-art enzyme function prediction models using multiple benchmark datasets. Using CLEAN-Contact, we predict previously unknown enzyme functions within the proteome of Prochlorococcus marinus MED4. Our findings convincingly demonstrate the substantial superiority of our CLEAN-Contact framework, marking a significant step forward in enzyme function prediction accuracy.

摘要

近年来,深度学习在科学学科领域取得了显著进展,产生了大量有前景的成果。该领域一个突出的挑战是预测酶功能的任务,这是一个复杂的问题,已经有许多计算方法得到发展,特别是那些基于深度学习技术的方法。然而,这些方法大多数主要集中在氨基酸序列数据或蛋白质结构数据上,忽略了结合这两种模态的潜在协同作用。为了弥补这一差距,我们提出了一种用于酶功能注释预测的对比学习框架,该框架结合了蛋白质氨基酸序列和接触图(CLEAN-Contact)。我们使用多个基准数据集,严格评估了我们的CLEAN-Contact框架相对于当前最先进的酶功能预测模型的性能。使用CLEAN-Contact,我们预测了海洋原绿球藻MED4蛋白质组中以前未知的酶功能。我们的研究结果令人信服地证明了我们的CLEAN-Contact框架的显著优越性,标志着酶功能预测准确性向前迈出了重要一步。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7e87/11666736/cf5008a43dd7/42003_2024_7359_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7e87/11666736/830f4dc2911e/42003_2024_7359_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7e87/11666736/5fba5ba1fa84/42003_2024_7359_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7e87/11666736/6931232810f6/42003_2024_7359_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7e87/11666736/cf5008a43dd7/42003_2024_7359_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7e87/11666736/830f4dc2911e/42003_2024_7359_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7e87/11666736/5fba5ba1fa84/42003_2024_7359_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7e87/11666736/6931232810f6/42003_2024_7359_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7e87/11666736/cf5008a43dd7/42003_2024_7359_Fig4_HTML.jpg

相似文献

1
Improved enzyme functional annotation prediction using contrastive learning with structural inference.使用带有结构推理的对比学习改进酶功能注释预测。
Commun Biol. 2024 Dec 23;7(1):1690. doi: 10.1038/s42003-024-07359-z.
2
Enzyme function prediction using contrastive learning.使用对比学习进行酶功能预测。
Science. 2023 Mar 31;379(6639):1358-1363. doi: 10.1126/science.adf2465. Epub 2023 Mar 30.
3
Parallel Convolutional Contrastive Learning Method for Enzyme Function Prediction.用于酶功能预测的并行卷积对比学习方法
IEEE/ACM Trans Comput Biol Bioinform. 2024 Nov-Dec;21(6):2604-2609. doi: 10.1109/TCBB.2024.3447037. Epub 2024 Dec 10.
4
Multi-modal deep learning enables efficient and accurate annotation of enzymatic active sites.多模态深度学习可实现酶活性位点的高效准确标注。
Nat Commun. 2024 Aug 27;15(1):7348. doi: 10.1038/s41467-024-51511-6.
5
Comparative Assessment of Protein Large Language Models for Enzyme Commission Number Prediction.用于酶委员会编号预测的蛋白质大语言模型的比较评估
BMC Bioinformatics. 2025 Feb 27;26(1):68. doi: 10.1186/s12859-025-06081-9.
6
DEEPre: sequence-based enzyme EC number prediction by deep learning.DEEPre:基于深度学习的酶 EC 号序列预测。
Bioinformatics. 2018 Mar 1;34(5):760-769. doi: 10.1093/bioinformatics/btx680.
7
Using deep learning to annotate the protein universe.利用深度学习标注蛋白质宇宙。
Nat Biotechnol. 2022 Jun;40(6):932-937. doi: 10.1038/s41587-021-01179-w. Epub 2022 Feb 21.
8
Functional profiling of the sequence stockpile: a protein pair-based assessment of in silico prediction tools.序列储备的功能分析:基于蛋白质对的计算机预测工具评估
Bioinformatics. 2025 Feb 4;41(2). doi: 10.1093/bioinformatics/btaf035.
9
Use of Chou's 5-steps rule to predict the subcellular localization of gram-negative and gram-positive bacterial proteins by multi-label learning based on gene ontology annotation and profile alignment.利用 Chou 的 5 步规则,通过基于基因本体论注释和序列比对的多标签学习,预测革兰氏阴性和革兰氏阳性细菌蛋白质的亚细胞定位。
J Integr Bioinform. 2020 Jun 29;18(1):51-79. doi: 10.1515/jib-2019-0091.
10
ProtNote: a multimodal method for protein-function annotation.ProtNote:一种用于蛋白质功能注释的多模态方法。
Bioinformatics. 2025 May 6;41(5). doi: 10.1093/bioinformatics/btaf170.

引用本文的文献

1
EZpred: improving deep learning-based enzyme function prediction using unlabeled sequence homologs.EZpred:利用未标记的序列同源物改进基于深度学习的酶功能预测
bioRxiv. 2025 Jul 14:2025.07.09.663945. doi: 10.1101/2025.07.09.663945.
2
Discovery, design, and engineering of enzymes based on molecular retrobiosynthesis.基于分子逆向生物合成的酶的发现、设计与工程
mLife. 2025 Mar 28;4(2):107-125. doi: 10.1002/mlf2.70009. eCollection 2025 Apr.
3
FGeneBERT: function-driven pre-trained gene language model for metagenomics.FGeneBERT:用于宏基因组学的功能驱动型预训练基因语言模型

本文引用的文献

1
Functional annotation of enzyme-encoding genes using deep learning with transformer layers.利用带有转换器层的深度学习对酶编码基因进行功能注释。
Nat Commun. 2023 Nov 14;14(1):7370. doi: 10.1038/s41467-023-43216-z.
2
Fast and accurate protein structure search with Foldseek.使用 Foldseek 进行快速准确的蛋白质结构搜索。
Nat Biotechnol. 2024 Feb;42(2):243-246. doi: 10.1038/s41587-023-01773-0. Epub 2023 May 8.
3
Enzyme function prediction using contrastive learning.使用对比学习进行酶功能预测。
Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf149.
4
ProCeSa: Contrast-Enhanced Structure-Aware Network for Thermostability Prediction with Protein Language Models.ProCeSa:用于蛋白质语言模型热稳定性预测的对比增强结构感知网络。
J Chem Inf Model. 2025 Mar 10;65(5):2304-2313. doi: 10.1021/acs.jcim.4c01752. Epub 2025 Feb 23.
Science. 2023 Mar 31;379(6639):1358-1363. doi: 10.1126/science.adf2465. Epub 2023 Mar 30.
4
Evolutionary-scale prediction of atomic-level protein structure with a language model.用语言模型进行原子级蛋白质结构的进化尺度预测。
Science. 2023 Mar 17;379(6637):1123-1130. doi: 10.1126/science.ade2574. Epub 2023 Mar 16.
5
ProteInfer, deep neural networks for protein functional inference.蛋白推断,用于蛋白质功能推断的深度神经网络。
Elife. 2023 Feb 27;12:e80942. doi: 10.7554/eLife.80942.
6
AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space with high-accuracy models.AlphaFold 蛋白质结构数据库:用高精度模型极大地扩展蛋白质序列空间的结构覆盖范围。
Nucleic Acids Res. 2022 Jan 7;50(D1):D439-D444. doi: 10.1093/nar/gkab1061.
7
Critical assessment of methods of protein structure prediction (CASP)-Round XIV.蛋白质结构预测方法的关键性评估(CASP)-第十四轮。
Proteins. 2021 Dec;89(12):1607-1617. doi: 10.1002/prot.26237. Epub 2021 Oct 7.
8
Highly accurate protein structure prediction with AlphaFold.利用 AlphaFold 进行高精度蛋白质结构预测。
Nature. 2021 Aug;596(7873):583-589. doi: 10.1038/s41586-021-03819-2. Epub 2021 Jul 15.
9
ProtTrans: Toward Understanding the Language of Life Through Self-Supervised Learning.ProtTrans:通过自监督学习理解生命语言。
IEEE Trans Pattern Anal Mach Intell. 2022 Oct;44(10):7112-7127. doi: 10.1109/TPAMI.2021.3095381. Epub 2022 Sep 14.
10
Structure-based protein function prediction using graph convolutional networks.基于结构的蛋白质功能预测使用图卷积网络。
Nat Commun. 2021 May 26;12(1):3168. doi: 10.1038/s41467-021-23303-9.