• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SENSDeep:一种用于蛋白质-蛋白质相互作用位点预测的集成深度学习方法。

SENSDeep: An Ensemble Deep Learning Method for Protein-Protein Interaction Sites Prediction.

作者信息

Aybey Engin, Gümüş Özgür

机构信息

Department of Health Bioinformatics, Ege University, 35100, Bornova, Izmir, Turkey.

Rectorate, Marmara University, 34722, Kadıköy, Istanbul, Turkey.

出版信息

Interdiscip Sci. 2023 Mar;15(1):55-87. doi: 10.1007/s12539-022-00543-x. Epub 2022 Nov 8.

DOI:10.1007/s12539-022-00543-x
PMID:36346583
Abstract

PURPOSE

The determination of which amino acid in a protein interacts with other proteins is important in understanding the functional mechanism of that protein. Although there are experimental methods to detect protein-protein interaction sites (PPISs), these are costly, time-consuming, and require expertise. Therefore, many computational methods have been proposed to accelerate this type of research, but they are generally insufficient to predict PPISs accurately. There is a need for development in this field.

METHODS

In this study, we introduce a new PPISs prediction method. This method is a sequence-based Stacking ENSemble Deep (SENSDeep) learning method that has an ensemble learning model including the models of RNN, CNN, GRU sequence to sequence (GRUs2s), GRU sequence to sequence with an attention layer (GRUs2satt) and a multilayer perceptron. Two embedded features, secondary structure, and protein sequence information are added to the training data set in addition to twelve existing features to improve the prediction performance of the method.

RESULTS

SENSDeep trained on the training data set without two extra features obtains a better performance on some of the independent testing data sets than that of the other methods in the literature, especially on scoring metrics of sensitivity, F1, MCC, and AUPRC, having increments up to 63.5%, 19.3%, 18.5%, 11.4%, respectively. It is shown that the added extra features improve the performance of the method by having almost the same performance with less data as the method trained on the data set without these added features. On the other hand, different sizes of the sliding window are tried on the data sets and an optimal sliding window size for SENSDeep is found. Moreover, SENSDeep has also been compared to structure-based methods. Some of these methods have been found to perform better. Using SENSDeep obtained by training with both training data sets, PPISs prediction examples of various proteins that are not in these training data sets are also presented. Furthermore, execution times for SENSDeep and its submodels are shown.

AVAILABILITY AND IMPLEMENTATION

https://github.com/enginaybey/SENSDeep.

摘要

目的

确定蛋白质中的哪种氨基酸与其他蛋白质相互作用,对于理解该蛋白质的功能机制至关重要。尽管存在检测蛋白质 - 蛋白质相互作用位点(PPISs)的实验方法,但这些方法成本高昂、耗时且需要专业知识。因此,人们提出了许多计算方法来加速这类研究,但它们通常不足以准确预测PPISs。该领域仍有待发展。

方法

在本研究中,我们引入了一种新的PPISs预测方法。此方法是一种基于序列的堆叠集成深度(SENSDeep)学习方法,具有一个集成学习模型,该模型包括循环神经网络(RNN)、卷积神经网络(CNN)、门控循环单元序列到序列(GRUs2s)、带有注意力层的门控循环单元序列到序列(GRUs2satt)以及多层感知器的模型。除了十二个现有特征外,还将两个嵌入特征,即二级结构和蛋白质序列信息添加到训练数据集中,以提高该方法的预测性能。

结果

在没有两个额外特征的训练数据集上训练的SENSDeep,在一些独立测试数据集上的表现优于文献中的其他方法,特别是在灵敏度、F1、马修斯相关系数(MCC)和曲线下面积(AUPRC)等评分指标上,分别提高了63.5%、19.3%、18.5%、11.4%。结果表明,添加的额外特征提高了该方法的性能,在数据量几乎相同的情况下,其性能与在没有这些添加特征的数据集上训练的方法相近。另一方面,在数据集上尝试了不同大小的滑动窗口,并为SENSDeep找到了最佳滑动窗口大小。此外,还将SENSDeep与基于结构的方法进行了比较。发现其中一些方法表现更好。使用通过两个训练数据集训练得到的SENSDeep,还展示了不在这些训练数据集中的各种蛋白质的PPISs预测示例。此外,还给出了SENSDeep及其子模型的执行时间。

可用性和实现方式

https://github.com/enginaybey/SENSDeep。

相似文献

1
SENSDeep: An Ensemble Deep Learning Method for Protein-Protein Interaction Sites Prediction.SENSDeep:一种用于蛋白质-蛋白质相互作用位点预测的集成深度学习方法。
Interdiscip Sci. 2023 Mar;15(1):55-87. doi: 10.1007/s12539-022-00543-x. Epub 2022 Nov 8.
2
EDLMFC: an ensemble deep learning framework with multi-scale features combination for ncRNA-protein interaction prediction.EDLMFC:一种具有多尺度特征组合的集成深度学习框架,用于 ncRNA-蛋白质相互作用预测。
BMC Bioinformatics. 2021 Mar 19;22(1):133. doi: 10.1186/s12859-021-04069-9.
3
DELPHI: accurate deep ensemble model for protein interaction sites prediction.DELPHI:用于蛋白质相互作用位点预测的准确深度集成模型。
Bioinformatics. 2021 May 17;37(7):896-904. doi: 10.1093/bioinformatics/btaa750.
4
Predicting protein-ligand binding residues with deep convolutional neural networks.使用深度卷积神经网络预测蛋白质-配体结合残基。
BMC Bioinformatics. 2019 Feb 26;20(1):93. doi: 10.1186/s12859-019-2672-1.
5
HN-PPISP: a hybrid network based on MLP-Mixer for protein-protein interaction site prediction.HN-PPISP:一种基于MLP-Mixer的用于蛋白质-蛋白质相互作用位点预测的混合网络。
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac480.
6
DP-site: A dual deep learning-based method for protein-peptide interaction site prediction.DP-site:一种基于双重深度学习的蛋白质-肽相互作用位点预测方法。
Methods. 2024 Sep;229:17-29. doi: 10.1016/j.ymeth.2024.06.001. Epub 2024 Jun 12.
7
DCSE:Double-Channel-Siamese-Ensemble model for protein protein interaction prediction.DCSE:用于蛋白质相互作用预测的双通道暹罗集成模型。
BMC Genomics. 2022 Aug 4;23(1):555. doi: 10.1186/s12864-022-08772-6.
8
Improving protein-protein interaction site prediction using deep residual neural network.利用深度残差神经网络提高蛋白质-蛋白质相互作用位点预测
Anal Biochem. 2023 Jun 1;670:115132. doi: 10.1016/j.ab.2023.115132. Epub 2023 Mar 28.
9
Protein-protein interaction site prediction through combining local and global features with deep neural networks.通过结合局部和全局特征与深度神经网络进行蛋白质-蛋白质相互作用位点预测。
Bioinformatics. 2020 Feb 15;36(4):1114-1120. doi: 10.1093/bioinformatics/btz699.
10
ncDENSE: a novel computational method based on a deep learning framework for non-coding RNAs family prediction.ncDENSE:一种基于深度学习框架的新型计算方法,用于预测非编码 RNA 家族。
BMC Bioinformatics. 2023 Feb 27;24(1):68. doi: 10.1186/s12859-023-05191-6.

引用本文的文献

1
Bag-of-words is competitive with sum-of-embeddings language-inspired representations on protein inference.词袋模型在蛋白质推理方面与基于语言启发的词嵌入求和表示法具有竞争力。
PLoS One. 2025 Aug 6;20(8):e0325531. doi: 10.1371/journal.pone.0325531. eCollection 2025.
2
Optimizing lipocalin sequence classification with ensemble deep learning models.使用集成深度学习模型优化脂钙蛋白序列分类
PLoS One. 2025 Apr 16;20(4):e0319329. doi: 10.1371/journal.pone.0319329. eCollection 2025.
3
Development and validation of preeclampsia predictive models using key genes from bioinformatics and machine learning approaches.

本文引用的文献

1
Structure-aware protein-protein interaction site prediction using deep graph convolutional network.使用深度图卷积网络进行结构感知的蛋白质-蛋白质相互作用位点预测。
Bioinformatics. 2021 Dec 22;38(1):125-132. doi: 10.1093/bioinformatics/btab643.
2
Ensembling graph attention networks for human microbe-drug association prediction.基于图注意力网络的微生物-药物关联预测方法
Bioinformatics. 2020 Dec 30;36(Suppl_2):i779-i786. doi: 10.1093/bioinformatics/btaa891.
3
, Encoding Iron-Sulfur Subunit SDH2-1 of Succinate Dehydrogenase, Affects Leaf Senescence and Grain Yield in Rice.
基于生物信息学和机器学习方法的关键基因开发和验证子痫前期预测模型。
Front Immunol. 2024 Oct 31;15:1416297. doi: 10.3389/fimmu.2024.1416297. eCollection 2024.
4
Prediction of Protein-Protein Interactions Based on Integrating Deep Learning and Feature Fusion.基于深度学习和特征融合的蛋白质-蛋白质相互作用预测。
Int J Mol Sci. 2024 May 27;25(11):5820. doi: 10.3390/ijms25115820.
5
EnsembleDL-ATG: Identifying autophagy proteins by integrating their sequence and evolutionary information using an ensemble deep learning framework.集成深度学习自噬蛋白识别方法(EnsembleDL-ATG):使用集成深度学习框架整合自噬蛋白的序列和进化信息来识别自噬蛋白。
Comput Struct Biotechnol J. 2023 Sep 29;21:4836-4848. doi: 10.1016/j.csbj.2023.09.036. eCollection 2023.
6
Spatial Layouts of Low-Entropy Hydration Shells Guide Protein Binding.低熵水合壳层的空间布局指导蛋白质结合。
Glob Chall. 2023 May 2;7(7):2300022. doi: 10.1002/gch2.202300022. eCollection 2023 Jul.
7
Recent Advances in Deep Learning for Protein-Protein Interaction Analysis: A Comprehensive Review.深度学习在蛋白质-蛋白质相互作用分析中的最新进展:全面综述。
Molecules. 2023 Jul 2;28(13):5169. doi: 10.3390/molecules28135169.
编码琥珀酸脱氢酶铁硫亚单位 SDH2-1 的基因,影响水稻叶片衰老和籽粒产量。
Int J Mol Sci. 2020 Dec 25;22(1):157. doi: 10.3390/ijms22010157.
4
Deciphering interaction fingerprints from protein molecular surfaces using geometric deep learning.利用几何深度学习破译蛋白质分子表面的相互作用指纹。
Nat Methods. 2020 Feb;17(2):184-192. doi: 10.1038/s41592-019-0666-6. Epub 2019 Dec 9.
5
Protein-protein interaction site prediction through combining local and global features with deep neural networks.通过结合局部和全局特征与深度神经网络进行蛋白质-蛋白质相互作用位点预测。
Bioinformatics. 2020 Feb 15;36(4):1114-1120. doi: 10.1093/bioinformatics/btz699.
6
IntPred: a structure-based predictor of protein-protein interaction sites.IntPred:一种基于结构的蛋白质-蛋白质相互作用位点预测工具。
Bioinformatics. 2018 Jan 15;34(2):223-229. doi: 10.1093/bioinformatics/btx585.
7
PDBsum: Structural summaries of PDB entries.PDBsum:蛋白质数据库(PDB)条目的结构摘要。
Protein Sci. 2018 Jan;27(1):129-134. doi: 10.1002/pro.3289. Epub 2017 Oct 27.
8
Seeing the trees through the forest: sequence-based homo- and heteromeric protein-protein interaction sites prediction using random forest.透过森林看树木:使用随机森林基于序列预测同源和异源多聚体蛋白质-蛋白质相互作用位点
Bioinformatics. 2017 May 15;33(10):1479-1487. doi: 10.1093/bioinformatics/btx005.
9
Fundamentals of protein interaction network mapping.蛋白质相互作用网络图谱的基础
Mol Syst Biol. 2015 Dec 17;11(12):848. doi: 10.15252/msb.20156351.
10
Sequence-based prediction of protein-protein interaction sites with L1-logreg classifier.基于序列的 L1-logreg 分类器预测蛋白质-蛋白质相互作用位点。
J Theor Biol. 2014 May 7;348:47-54. doi: 10.1016/j.jtbi.2014.01.028. Epub 2014 Jan 31.