• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于深度学习和XGBoost的蛋白质-蛋白质相互作用位点预测方法。

A Deep Learning and XGBoost-Based Method for Predicting Protein-Protein Interaction Sites.

作者信息

Wang Pan, Zhang Guiyang, Yu Zu-Guo, Huang Guohua

机构信息

School of Electrical Engineering, Shaoyang University, Shaoyang, China.

Key Laboratory of Intelligent Computing and Information Processing of Ministry of Education and Hunan Key Laboratory for Computation and Simulation in Science and Engineering, Xiangtan University, Xiangtan, China.

出版信息

Front Genet. 2021 Oct 26;12:752732. doi: 10.3389/fgene.2021.752732. eCollection 2021.

DOI:10.3389/fgene.2021.752732
PMID:34764983
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8576272/
Abstract

Knowledge about protein-protein interactions is beneficial in understanding cellular mechanisms. Protein-protein interactions are usually determined according to their protein-protein interaction sites. Due to the limitations of current techniques, it is still a challenging task to detect protein-protein interaction sites. In this article, we presented a method based on deep learning and XGBoost (called DeepPPISP-XGB) for predicting protein-protein interaction sites. The deep learning model served as a feature extractor to remove redundant information from protein sequences. The Extreme Gradient Boosting algorithm was used to construct a classifier for predicting protein-protein interaction sites. The DeepPPISP-XGB achieved the following results: area under the receiver operating characteristic curve of 0.681, a recall of 0.624, and area under the precision-recall curve of 0.339, being competitive with the state-of-the-art methods. We also validated the positive role of global features in predicting protein-protein interaction sites.

摘要

了解蛋白质-蛋白质相互作用有助于理解细胞机制。蛋白质-蛋白质相互作用通常根据其蛋白质-蛋白质相互作用位点来确定。由于当前技术的局限性,检测蛋白质-蛋白质相互作用位点仍然是一项具有挑战性的任务。在本文中,我们提出了一种基于深度学习和XGBoost的方法(称为DeepPPISP-XGB)来预测蛋白质-蛋白质相互作用位点。深度学习模型作为特征提取器,用于去除蛋白质序列中的冗余信息。采用极端梯度提升算法构建预测蛋白质-蛋白质相互作用位点的分类器。DeepPPISP-XGB取得了以下结果:受试者工作特征曲线下面积为0.681,召回率为0.624,精确率-召回率曲线下面积为0.339,与现有最先进方法具有竞争力。我们还验证了全局特征在预测蛋白质-蛋白质相互作用位点中的积极作用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e1c/8576272/897cfe7c01ca/fgene-12-752732-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e1c/8576272/e9f7b9cbf582/fgene-12-752732-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e1c/8576272/39e245f58617/fgene-12-752732-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e1c/8576272/09b9226b2d27/fgene-12-752732-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e1c/8576272/20a4394b3ec9/fgene-12-752732-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e1c/8576272/bfae48caefe2/fgene-12-752732-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e1c/8576272/897cfe7c01ca/fgene-12-752732-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e1c/8576272/e9f7b9cbf582/fgene-12-752732-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e1c/8576272/39e245f58617/fgene-12-752732-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e1c/8576272/09b9226b2d27/fgene-12-752732-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e1c/8576272/20a4394b3ec9/fgene-12-752732-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e1c/8576272/bfae48caefe2/fgene-12-752732-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e1c/8576272/897cfe7c01ca/fgene-12-752732-g006.jpg

相似文献

1
A Deep Learning and XGBoost-Based Method for Predicting Protein-Protein Interaction Sites.一种基于深度学习和XGBoost的蛋白质-蛋白质相互作用位点预测方法。
Front Genet. 2021 Oct 26;12:752732. doi: 10.3389/fgene.2021.752732. eCollection 2021.
2
A machine learning-based model for predicting distant metastasis in patients with rectal cancer.一种基于机器学习的预测直肠癌患者远处转移的模型。
Front Oncol. 2023 Aug 15;13:1235121. doi: 10.3389/fonc.2023.1235121. eCollection 2023.
3
Prediction of protein-protein interaction sites through eXtreme gradient boosting with kernel principal component analysis.通过核主成分分析的极端梯度提升预测蛋白质-蛋白质相互作用位点。
Comput Biol Med. 2021 Jul;134:104516. doi: 10.1016/j.compbiomed.2021.104516. Epub 2021 Jun 1.
4
Ultrasound deep learning radiomics and clinical machine learning models to predict low nuclear grade, ER, PR, and HER2 receptor status in pure ductal carcinoma .超声深度学习影像组学和临床机器学习模型预测纯导管癌的低核分级、雌激素受体、孕激素受体和人表皮生长因子受体2受体状态
Gland Surg. 2024 Apr 29;13(4):512-527. doi: 10.21037/gs-23-417. Epub 2024 Apr 11.
5
Predicting Measles Outbreaks in the United States: Evaluation of Machine Learning Approaches.预测美国的麻疹疫情:机器学习方法评估
JMIR Form Res. 2023 Apr 4;7:e42832. doi: 10.2196/42832.
6
Machine Learning Model for Predicting Mortality Risk in Patients With Complex Chronic Conditions: Retrospective Analysis.用于预测复杂慢性病患者死亡风险的机器学习模型:回顾性分析
Online J Public Health Inform. 2023 Dec 28;15:e52782. doi: 10.2196/52782. eCollection 2023.
7
T4SE-XGB: Interpretable Sequence-Based Prediction of Type IV Secreted Effectors Using eXtreme Gradient Boosting Algorithm.T4SE-XGB:使用极端梯度提升算法对IV型分泌效应蛋白进行基于序列的可解释预测。
Front Microbiol. 2020 Sep 24;11:580382. doi: 10.3389/fmicb.2020.580382. eCollection 2020.
8
DeepStack-DTIs: Predicting Drug-Target Interactions Using LightGBM Feature Selection and Deep-Stacked Ensemble Classifier.DeepStack-DTIs:使用 LightGBM 特征选择和深度堆叠集成分类器预测药物-靶标相互作用。
Interdiscip Sci. 2022 Jun;14(2):311-330. doi: 10.1007/s12539-021-00488-7. Epub 2021 Nov 3.
9
IdentPMP: identification of moonlighting proteins in plants using sequence-based learning models.IdentPMP:使用基于序列的学习模型鉴定植物中的兼职蛋白
PeerJ. 2021 Aug 6;9:e11900. doi: 10.7717/peerj.11900. eCollection 2021.
10
Prediction of Critical Care Outcome for Adult Patients Presenting to Emergency Department Using Initial Triage Information: An XGBoost Algorithm Analysis.利用初始分诊信息预测急诊科成年患者的重症监护结局:一种XGBoost算法分析
JMIR Med Inform. 2021 Sep 20;9(9):e30770. doi: 10.2196/30770.

引用本文的文献

1
GNNs and ensemble models enhance the prediction of new sRNA-mRNA interactions in unseen conditions.图神经网络(GNNs)和集成模型增强了在未知条件下对新的小RNA-信使核糖核酸(sRNA-mRNA)相互作用的预测。
BMC Bioinformatics. 2025 May 21;26(1):131. doi: 10.1186/s12859-025-06153-w.
2
Feature fusion with attributed deepwalk for protein-protein interaction prediction.基于属性深度游走的特征融合用于蛋白质-蛋白质相互作用预测
Sci Rep. 2025 Apr 10;15(1):12255. doi: 10.1038/s41598-025-96510-9.
3
Ayu: a machine intelligence tool for identification of extracellular proteins in the marine secretome.

本文引用的文献

1
Prediction of protein-protein interaction sites through eXtreme gradient boosting with kernel principal component analysis.通过核主成分分析的极端梯度提升预测蛋白质-蛋白质相互作用位点。
Comput Biol Med. 2021 Jul;134:104516. doi: 10.1016/j.compbiomed.2021.104516. Epub 2021 Jun 1.
2
Using Recursive Feature Selection with Random Forest to Improve Protein Structural Class Prediction for Low-Similarity Sequences.使用递归特征选择和随机森林提高低相似度序列的蛋白质结构分类预测。
Comput Math Methods Med. 2021 May 7;2021:5529389. doi: 10.1155/2021/5529389. eCollection 2021.
3
Classification and prediction of protein-protein interaction interface using machine learning algorithm.
阿玉:一种用于识别海洋分泌组中细胞外蛋白质的机器智能工具。
Nat Commun. 2025 Mar 21;16(1):2793. doi: 10.1038/s41467-025-57974-5.
4
Machine Learning Methods in Protein-Protein Docking.机器学习方法在蛋白质-蛋白质对接中的应用。
Methods Mol Biol. 2024;2780:107-126. doi: 10.1007/978-1-0716-3985-6_7.
5
Fast end-to-end surface interpretation of SARS-CoV-2 variants by differentiable molecular surface interaction fingerprinting method.通过可微分子表面相互作用指纹识别方法对严重急性呼吸综合征冠状病毒2(SARS-CoV-2)变体进行快速端到端表面解释
Comput Struct Biotechnol J. 2023 Sep 28;21:4816-4824. doi: 10.1016/j.csbj.2023.09.033. eCollection 2023.
6
Recent Advances in Deep Learning for Protein-Protein Interaction Analysis: A Comprehensive Review.深度学习在蛋白质-蛋白质相互作用分析中的最新进展:全面综述。
Molecules. 2023 Jul 2;28(13):5169. doi: 10.3390/molecules28135169.
7
Identification of novel interacts partners of ADAR1 enzyme mediating the oncogenic process in aggressive breast cancer.鉴定 ADAR1 酶介导侵袭性乳腺癌致癌过程的新型相互作用伙伴。
Sci Rep. 2023 May 23;13(1):8341. doi: 10.1038/s41598-023-35517-6.
基于机器学习算法的蛋白质-蛋白质相互作用界面分类与预测。
Sci Rep. 2021 Jan 19;11(1):1761. doi: 10.1038/s41598-020-80900-2.
4
SNB-PSSM: A spatial neighbor-based PSSM used for protein-RNA binding site prediction.SNB-PSSM:一种基于空间邻居的 PSSM,用于蛋白质-RNA 结合位点预测。
J Mol Recognit. 2021 Jun;34(6):e2887. doi: 10.1002/jmr.2887. Epub 2021 Jan 14.
5
EPTool: A New Enhancing PSSM Tool for Protein Secondary Structure Prediction.EPTool:一种用于蛋白质二级结构预测的新型增强 PSSM 工具。
J Comput Biol. 2021 Apr;28(4):362-364. doi: 10.1089/cmb.2020.0417. Epub 2020 Dec 1.
6
'It will change everything': DeepMind's AI makes gigantic leap in solving protein structures.“它将改变一切”:深度思维公司的人工智能在解决蛋白质结构问题上取得巨大飞跃。
Nature. 2020 Dec;588(7837):203-204. doi: 10.1038/d41586-020-03348-4.
7
Developing Computational Model to Predict Protein-Protein Interaction Sites Based on the XGBoost Algorithm.基于 XGBoost 算法开发用于预测蛋白质-蛋白质相互作用位点的计算模型。
Int J Mol Sci. 2020 Mar 25;21(7):2274. doi: 10.3390/ijms21072274.
8
Semi-supervised prediction of protein interaction sites from unlabeled sample information.基于未标记样本信息的蛋白质相互作用位点的半监督预测。
BMC Bioinformatics. 2019 Dec 24;20(Suppl 25):699. doi: 10.1186/s12859-019-3274-7.
9
Imbalance Data Processing Strategy for Protein Interaction Sites Prediction.蛋白质相互作用位点预测的不平衡数据处理策略。
IEEE/ACM Trans Comput Biol Bioinform. 2021 May-Jun;18(3):985-994. doi: 10.1109/TCBB.2019.2953908. Epub 2021 Jun 3.
10
Protein-protein interaction site prediction through combining local and global features with deep neural networks.通过结合局部和全局特征与深度神经网络进行蛋白质-蛋白质相互作用位点预测。
Bioinformatics. 2020 Feb 15;36(4):1114-1120. doi: 10.1093/bioinformatics/btz699.