• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ProteinF3S:通过融合蛋白质序列、结构和表面特征增强酶功能预测

ProteinF3S: boosting enzyme function prediction by fusing protein sequence, structure, and surface.

作者信息

Yuan Mingzhi, Shen Ao, Ma Yingfan, Du Jie, An Bohan, Wang Manning

机构信息

Digital Medical Research Center, School of Basic Medical Sciences, Fudan University, 131 Dong'an Road, 200032 Shanghai, China.

Shanghai Key Laboratory of Medical Image Computing and Computer Assisted Intervention, Fudan University, 131 Dong'an Road, 200032 Shanghai, China.

出版信息

Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae695.

DOI:10.1093/bib/bbae695
PMID:39750023
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11697223/
Abstract

Proteins can be represented in different data forms, including sequence, structure, and surface, each of which has unique advantages and certain limitations. It is promising to fuse the complementary information among them. In this work, we propose a framework called ProteinF3S for enzyme function prediction that fuses the complementary information across protein sequence, structure, and surface. To achieve more effective fusion, we propose a multi-scale bidirectional fusion strategy between protein structure and surface, in which the hierarchical features of a surface encoder and a structure encoder interact with each other bidirectionally. Based on these interactions, more distinctive features can be obtained. After that, we achieve further fusion by concatenating the sequence features with the features containing structure and surface information, so that better performance can be achieved. To validate our method, we conduct extensive experiments on tasks including enzyme reaction classification and enzyme commission number prediction. Our method achieves new state-of-the-art performance and shows that fusing different forms of data is effective in enzyme function prediction.

摘要

蛋白质可以用不同的数据形式表示,包括序列、结构和表面,每种形式都有独特的优势和一定的局限性。融合它们之间的互补信息很有前景。在这项工作中,我们提出了一个名为ProteinF3S的框架用于酶功能预测,该框架融合了蛋白质序列、结构和表面的互补信息。为了实现更有效的融合,我们提出了一种蛋白质结构和表面之间的多尺度双向融合策略,其中表面编码器和结构编码器的层次特征相互双向交互。基于这些交互,可以获得更具特色的特征。之后,我们通过将序列特征与包含结构和表面信息的特征连接起来实现进一步融合,从而获得更好的性能。为了验证我们的方法,我们在酶反应分类和酶委员会编号预测等任务上进行了广泛的实验。我们的方法取得了新的最优性能,并表明融合不同形式的数据在酶功能预测中是有效的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd80/11697223/72660817999c/bbae695f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd80/11697223/e6653da8e463/bbae695f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd80/11697223/c7206a328ccf/bbae695f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd80/11697223/c530e8bc4bb3/bbae695f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd80/11697223/72660817999c/bbae695f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd80/11697223/e6653da8e463/bbae695f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd80/11697223/c7206a328ccf/bbae695f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd80/11697223/c530e8bc4bb3/bbae695f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd80/11697223/72660817999c/bbae695f4.jpg

相似文献

1
ProteinF3S: boosting enzyme function prediction by fusing protein sequence, structure, and surface.ProteinF3S:通过融合蛋白质序列、结构和表面特征增强酶功能预测
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae695.
2
CrossPredGO: A Novel Light-Weight Cross-Modal Multi-Attention Framework for Protein Function Prediction.CrossPredGO:一种用于蛋白质功能预测的新型轻量级跨模态多注意力框架
IEEE/ACM Trans Comput Biol Bioinform. 2024 Nov-Dec;21(6):1709-1720. doi: 10.1109/TCBB.2024.3410696. Epub 2024 Dec 10.
3
GloEC: a hierarchical-aware global model for predicting enzyme function.GloEC:一种用于预测酶功能的层次感知全局模型。
Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae365.
4
EC number prediction of protein sequences based on combination of hierarchical and global features.基于层次化和全局特征组合的蛋白质序列 EC 数预测。
Yi Chuan. 2024 Aug;46(8):661-669. doi: 10.16288/j.yczz.24-102.
5
PMSFF: Improved Protein Binding Residues Prediction through Multi-Scale Sequence-Based Feature Fusion Strategy.PMSFF:通过多尺度序列的基于特征融合策略来提高蛋白质结合残基预测。
Biomolecules. 2024 Sep 27;14(10):1220. doi: 10.3390/biom14101220.
6
Comparative Assessment of Protein Large Language Models for Enzyme Commission Number Prediction.用于酶委员会编号预测的蛋白质大语言模型的比较评估
BMC Bioinformatics. 2025 Feb 27;26(1):68. doi: 10.1186/s12859-025-06081-9.
7
Predicting protein-protein interaction with interpretable bilinear attention network.使用可解释双线性注意力网络预测蛋白质-蛋白质相互作用。
Comput Methods Programs Biomed. 2025 Jun;265:108756. doi: 10.1016/j.cmpb.2025.108756. Epub 2025 Mar 30.
8
SST-ResNet: A Sequence and Structure Information Integration Model for Protein Property Prediction.SST-ResNet:一种用于蛋白质属性预测的序列与结构信息整合模型。
Int J Mol Sci. 2025 Mar 19;26(6):2783. doi: 10.3390/ijms26062783.
9
ILMCNet: A Deep Neural Network Model That Uses PLM to Process Features and Employs CRF to Predict Protein Secondary Structure.ILMCNet:一种利用 PLM 处理特征并采用 CRF 预测蛋白质二级结构的深度神经网络模型。
Genes (Basel). 2024 Oct 21;15(10):1350. doi: 10.3390/genes15101350.
10
PCP-GC-LM: single-sequence-based protein contact prediction using dual graph convolutional neural network and convolutional neural network.PCP-GC-LM:基于双图卷积神经网络和卷积神经网络的单序列蛋白质接触预测。
BMC Bioinformatics. 2024 Sep 2;25(1):287. doi: 10.1186/s12859-024-05914-3.

本文引用的文献

1
Complementary multi-modality molecular self-supervised learning via non-overlapping masking for property prediction.通过非重叠掩蔽进行互补多模态分子自监督学习以进行性质预测。
Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae256.
2
ProteinMAE: masked autoencoder for protein surface self-supervised learning.蛋白质 MAE:用于蛋白质表面自监督学习的掩蔽自动编码器。
Bioinformatics. 2023 Dec 1;39(12). doi: 10.1093/bioinformatics/btad724.
3
A Multimodal Protein Representation Framework for Quantifying Transferability Across Biochemical Downstream Tasks.
一种用于量化跨生化下游任务可转移性的多模态蛋白质表示框架。
Adv Sci (Weinh). 2023 Aug;10(22):e2301223. doi: 10.1002/advs.202301223. Epub 2023 May 30.
4
Structure-aware protein self-supervised learning.基于结构的蛋白质自监督学习。
Bioinformatics. 2023 Apr 3;39(4). doi: 10.1093/bioinformatics/btad189.
5
Evolutionary-scale prediction of atomic-level protein structure with a language model.用语言模型进行原子级蛋白质结构的进化尺度预测。
Science. 2023 Mar 17;379(6637):1123-1130. doi: 10.1126/science.ade2574. Epub 2023 Mar 16.
6
LM-GVP: an extensible sequence and structure informed deep learning framework for protein property prediction.LM-GVP:一个可扩展的序列和结构信息深度学习框架,用于蛋白质性质预测。
Sci Rep. 2022 Apr 27;12(1):6832. doi: 10.1038/s41598-022-10775-y.
7
ProteinBERT: a universal deep-learning model of protein sequence and function.蛋白质 BERT:一种通用的蛋白质序列和功能深度学习模型。
Bioinformatics. 2022 Apr 12;38(8):2102-2110. doi: 10.1093/bioinformatics/btac020.
8
In silico methods and tools for drug discovery.基于计算机的药物研发方法和工具。
Comput Biol Med. 2021 Oct;137:104851. doi: 10.1016/j.compbiomed.2021.104851. Epub 2021 Sep 8.
9
Highly accurate protein structure prediction with AlphaFold.利用 AlphaFold 进行高精度蛋白质结构预测。
Nature. 2021 Aug;596(7873):583-589. doi: 10.1038/s41586-021-03819-2. Epub 2021 Jul 15.
10
Structure-based protein function prediction using graph convolutional networks.基于结构的蛋白质功能预测使用图卷积网络。
Nat Commun. 2021 May 26;12(1):3168. doi: 10.1038/s41467-021-23303-9.