• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于Transformer的蛋白质-蛋白质相互作用位点预测集成框架。

A Transformer-Based Ensemble Framework for the Prediction of Protein-Protein Interaction Sites.

作者信息

Mou Minjie, Pan Ziqi, Zhou Zhimeng, Zheng Lingyan, Zhang Hanyu, Shi Shuiyang, Li Fengcheng, Sun Xiuna, Zhu Feng

机构信息

College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang UniversitySchool of Medicine, National Key Laboratory of Advanced Drug Delivery and Release Systems, Zhejiang University, Hangzhou 310058, China.

Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou 330110, China.

出版信息

Research (Wash D C). 2023 Sep 27;6:0240. doi: 10.34133/research.0240. eCollection 2023.

DOI:10.34133/research.0240
PMID:37771850
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10528219/
Abstract

The identification of protein-protein interaction (PPI) sites is essential in the research of protein function and the discovery of new drugs. So far, a variety of computational tools based on machine learning have been developed to accelerate the identification of PPI sites. However, existing methods suffer from the low predictive accuracy or the limited scope of application. Specifically, some methods learned only global or local sequential features, leading to low predictive accuracy, while others achieved improved performance by extracting residue interactions from structures but were limited in their application scope for the serious dependence on precise structure information. There is an urgent need to develop a method that integrates comprehensive information to realize proteome-wide accurate profiling of PPI sites. Herein, a novel ensemble framework for PPI sites prediction, EnsemPPIS, was therefore proposed based on transformer and gated convolutional networks. EnsemPPIS can effectively capture not only global and local patterns but also residue interactions. Specifically, EnsemPPIS was unique in (a) extracting residue interactions from protein sequences with transformer and (b) further integrating global and local sequential features with the ensemble learning strategy. Compared with various existing methods, EnsemPPIS exhibited either superior performance or broader applicability on multiple PPI sites prediction tasks. Moreover, pattern analysis based on the interpretability of EnsemPPIS demonstrated that EnsemPPIS was fully capable of learning residue interactions within the local structure of PPI sites using only sequence information. The web server of EnsemPPIS is freely available at http://idrblab.org/ensemppis.

摘要

蛋白质-蛋白质相互作用(PPI)位点的识别在蛋白质功能研究和新药发现中至关重要。到目前为止,已经开发了多种基于机器学习的计算工具来加速PPI位点的识别。然而,现有方法存在预测准确率低或应用范围有限的问题。具体而言,一些方法仅学习全局或局部序列特征,导致预测准确率较低,而其他方法通过从结构中提取残基相互作用来提高性能,但由于严重依赖精确的结构信息,其应用范围受到限制。迫切需要开发一种整合综合信息的方法,以实现全蛋白质组范围内PPI位点的准确分析。在此,基于Transformer和门控卷积网络提出了一种用于PPI位点预测的新型集成框架EnsemPPIS。EnsemPPIS不仅可以有效地捕获全局和局部模式,还可以捕获残基相互作用。具体而言,EnsemPPIS的独特之处在于:(a)使用Transformer从蛋白质序列中提取残基相互作用;(b)通过集成学习策略进一步整合全局和局部序列特征。与各种现有方法相比,EnsemPPIS在多个PPI位点预测任务中表现出优异的性能或更广泛的适用性。此外,基于EnsemPPIS可解释性的模式分析表明,EnsemPPIS完全能够仅使用序列信息学习PPI位点局部结构内的残基相互作用。EnsemPPIS的网络服务器可在http://idrblab.org/ensemppis免费获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9936/10528219/4dbc690b556d/research.0240.fig.007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9936/10528219/56fdb782c8e0/research.0240.fig.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9936/10528219/c8e7a2ed7690/research.0240.fig.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9936/10528219/2da5321ccec8/research.0240.fig.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9936/10528219/eb0d738747b6/research.0240.fig.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9936/10528219/76ce38d92a9d/research.0240.fig.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9936/10528219/2df7a84cdda8/research.0240.fig.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9936/10528219/4dbc690b556d/research.0240.fig.007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9936/10528219/56fdb782c8e0/research.0240.fig.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9936/10528219/c8e7a2ed7690/research.0240.fig.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9936/10528219/2da5321ccec8/research.0240.fig.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9936/10528219/eb0d738747b6/research.0240.fig.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9936/10528219/76ce38d92a9d/research.0240.fig.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9936/10528219/2df7a84cdda8/research.0240.fig.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9936/10528219/4dbc690b556d/research.0240.fig.007.jpg

相似文献

1
A Transformer-Based Ensemble Framework for the Prediction of Protein-Protein Interaction Sites.一种基于Transformer的蛋白质-蛋白质相互作用位点预测集成框架。
Research (Wash D C). 2023 Sep 27;6:0240. doi: 10.34133/research.0240. eCollection 2023.
2
Protein-protein interaction site prediction through combining local and global features with deep neural networks.通过结合局部和全局特征与深度神经网络进行蛋白质-蛋白质相互作用位点预测。
Bioinformatics. 2020 Feb 15;36(4):1114-1120. doi: 10.1093/bioinformatics/btz699.
3
DCSE:Double-Channel-Siamese-Ensemble model for protein protein interaction prediction.DCSE:用于蛋白质相互作用预测的双通道暹罗集成模型。
BMC Genomics. 2022 Aug 4;23(1):555. doi: 10.1186/s12864-022-08772-6.
4
Improving protein-protein interaction site prediction using deep residual neural network.利用深度残差神经网络提高蛋白质-蛋白质相互作用位点预测
Anal Biochem. 2023 Jun 1;670:115132. doi: 10.1016/j.ab.2023.115132. Epub 2023 Mar 28.
5
HN-PPISP: a hybrid network based on MLP-Mixer for protein-protein interaction site prediction.HN-PPISP:一种基于MLP-Mixer的用于蛋白质-蛋白质相互作用位点预测的混合网络。
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac480.
6
PhosIDN: an integrated deep neural network for improving protein phosphorylation site prediction by combining sequence and protein-protein interaction information.PhosIDN:一种集成深度学习神经网络,通过结合序列和蛋白质-蛋白质相互作用信息来提高蛋白质磷酸化位点预测。
Bioinformatics. 2021 Dec 11;37(24):4668-4676. doi: 10.1093/bioinformatics/btab551.
7
Prediction of protein-protein interaction sites from weakly homologous template structures using meta-threading and machine learning.利用元线程和机器学习从弱同源模板结构预测蛋白质-蛋白质相互作用位点。
J Mol Recognit. 2015 Jan;28(1):35-48. doi: 10.1002/jmr.2410.
8
Residue-Frustration-Based Prediction of Protein-Protein Interactions Using Machine Learning.基于机器学习的基于残基受挫的蛋白质-蛋白质相互作用预测
J Phys Chem B. 2022 Mar 3;126(8):1719-1727. doi: 10.1021/acs.jpcb.1c10525. Epub 2022 Feb 16.
9
Critical assessment and performance improvement of plant-pathogen protein-protein interaction prediction methods.植物-病原体蛋白-蛋白相互作用预测方法的关键评估和性能改进。
Brief Bioinform. 2019 Jan 18;20(1):274-287. doi: 10.1093/bib/bbx123.
10
Computational probing protein-protein interactions targeting small molecules.针对小分子的蛋白质-蛋白质相互作用的计算探测
Bioinformatics. 2016 Jan 15;32(2):226-34. doi: 10.1093/bioinformatics/btv528. Epub 2015 Sep 28.

引用本文的文献

1
DeepPhosPPI: a deep learning framework with attention-CNN and transformer for predicting phosphorylation effects on protein-protein interactions.DeepPhosPPI:一种用于预测磷酸化对蛋白质-蛋白质相互作用影响的深度学习框架,该框架采用了注意力卷积神经网络和变换器。
Brief Bioinform. 2025 Sep 6;26(5). doi: 10.1093/bib/bbaf462.
2
Mechanisms of IGF1R signaling in type 2 diabetes-related neurodegeneration and therapeutic implications of exercise.2型糖尿病相关神经退行性变中IGF1R信号传导机制及运动的治疗意义
Eur J Med Res. 2025 Aug 30;30(1):825. doi: 10.1186/s40001-025-03087-0.
3
SALM: Sequence-Structure Pre-trained Large Language Model for Comprehensive Antibody Representation Learning.

本文引用的文献

1
iAMPCN: a deep-learning approach for identifying antimicrobial peptides and their functional activities.iAMPCN:一种用于识别抗菌肽及其功能活性的深度学习方法。
Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad240.
2
Enzyme Commission Number Prediction and Benchmarking with Hierarchical Dual-core Multitask Learning Framework.基于分层双核多任务学习框架的酶委员会编号预测与基准测试
Research (Wash D C). 2023 May 31;6:0153. doi: 10.34133/research.0153. eCollection 2023.
3
PeSTo: parameter-free geometric deep learning for accurate prediction of protein binding interfaces.
SALM:用于全面抗体表征学习的序列-结构预训练大语言模型。
Research (Wash D C). 2025 Aug 19;8:0721. doi: 10.34133/research.0721. eCollection 2025.
4
LocPro: A deep learning-based prediction of protein subcellular localization for promoting multi-directional pharmaceutical research.LocPro:基于深度学习的蛋白质亚细胞定位预测,以促进多方向药物研究。
J Pharm Anal. 2025 Aug;15(8):101255. doi: 10.1016/j.jpha.2025.101255. Epub 2025 Mar 5.
5
ASCE-PPIS: a protein-protein interaction sites predictor based on equivariant graph neural network with fusion of structure-aware pooling and graph collapse.ASCE-PPIS:一种基于等变图神经网络的蛋白质-蛋白质相互作用位点预测器,融合了结构感知池化和图折叠。
Bioinformatics. 2025 Aug 2;41(8). doi: 10.1093/bioinformatics/btaf423.
6
Adaptive multi-view learning method for enhanced drug repurposing using chemical-induced transcriptional profiles, knowledge graphs, and large language models.基于化学诱导转录谱、知识图谱和大语言模型的用于增强药物再利用的自适应多视图学习方法
J Pharm Anal. 2025 Jun;15(6):101275. doi: 10.1016/j.jpha.2025.101275. Epub 2025 Mar 21.
7
A review of transformer models in drug discovery and beyond.药物发现及其他领域中变压器模型综述。
J Pharm Anal. 2025 Jun;15(6):101081. doi: 10.1016/j.jpha.2024.101081. Epub 2024 Aug 30.
8
Large Context, Deeper Insights: Harnessing Large Language Models for Advancing Protein-Protein Interaction Analysis.大背景,更深刻的见解:利用大语言模型推动蛋白质-蛋白质相互作用分析
Methods Mol Biol. 2025;2941:243-267. doi: 10.1007/978-1-0716-4623-6_15.
9
Recent advances in deep learning for protein-protein interaction: a review.深度学习在蛋白质-蛋白质相互作用研究中的最新进展:综述
BioData Min. 2025 Jun 16;18(1):43. doi: 10.1186/s13040-025-00457-6.
10
Protein Sequence Analysis landscape: A Systematic Review of Task Types, Databases, Datasets, Word Embeddings Methods, and Language Models.蛋白质序列分析全景:任务类型、数据库、数据集、词嵌入方法和语言模型的系统综述
Database (Oxford). 2025 May 30;2025. doi: 10.1093/database/baaf027.
PeSTo:用于准确预测蛋白质结合界面的无参几何深度学习。
Nat Commun. 2023 Apr 18;14(1):2175. doi: 10.1038/s41467-023-37701-8.
4
Next-generation large-scale binary protein interaction network for Drosophila melanogaster.用于黑腹果蝇的下一代大规模二进制蛋白质相互作用网络。
Nat Commun. 2023 Apr 15;14(1):2162. doi: 10.1038/s41467-023-37876-0.
5
Trends and Potential of Machine Learning and Deep Learning in Drug Study at Single-Cell Level.单细胞水平药物研究中机器学习与深度学习的趋势及潜力
Research (Wash D C). 2023;6:0050. doi: 10.34133/research.0050. Epub 2023 Mar 9.
6
SARS-CoV-2 Spike Protein Post-Translational Modification Landscape and Its Impact on Protein Structure and Function via Computational Prediction.严重急性呼吸综合征冠状病毒2刺突蛋白的翻译后修饰图谱及其通过计算预测对蛋白质结构和功能的影响
Research (Wash D C). 2023 Mar 8;6:0078. doi: 10.34133/research.0078.
7
Evolutionary-scale prediction of atomic-level protein structure with a language model.用语言模型进行原子级蛋白质结构的进化尺度预测。
Science. 2023 Mar 17;379(6637):1123-1130. doi: 10.1126/science.ade2574. Epub 2023 Mar 16.
8
AlphaFold2 and its applications in the fields of biology and medicine.AlphaFold2 及其在生物学和医学领域的应用。
Signal Transduct Target Ther. 2023 Mar 14;8(1):115. doi: 10.1038/s41392-023-01381-z.
9
DeepBIO: an automated and interpretable deep-learning platform for high-throughput biological sequence prediction, functional annotation and visualization analysis.DeepBIO:一个自动化的、可解释的深度学习平台,用于高通量生物序列预测、功能注释和可视化分析。
Nucleic Acids Res. 2023 Apr 24;51(7):3017-3029. doi: 10.1093/nar/gkad055.
10
Deep-learning based approach to identify substrates of human E3 ubiquitin ligases and deubiquitinases.基于深度学习的方法来识别人类E3泛素连接酶和去泛素化酶的底物。
Comput Struct Biotechnol J. 2023 Jan 18;21:1014-1021. doi: 10.1016/j.csbj.2023.01.021. eCollection 2023.