• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用随机森林预测序列和三维结构中的蛋白质-蛋白质相互作用位点。

Prediction of protein-protein interaction sites in sequences and 3D structures by random forests.

作者信息

Sikić Mile, Tomić Sanja, Vlahovicek Kristian

机构信息

Department of Electronic Systems and Information Processing, Faculty of Electrical Engineering and Computing, University of Zagreb, Zagreb, Croatia.

出版信息

PLoS Comput Biol. 2009 Jan;5(1):e1000278. doi: 10.1371/journal.pcbi.1000278. Epub 2009 Jan 30.

DOI:10.1371/journal.pcbi.1000278
PMID:19180183
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2621338/
Abstract

Identifying interaction sites in proteins provides important clues to the function of a protein and is becoming increasingly relevant in topics such as systems biology and drug discovery. Although there are numerous papers on the prediction of interaction sites using information derived from structure, there are only a few case reports on the prediction of interaction residues based solely on protein sequence. Here, a sliding window approach is combined with the Random Forests method to predict protein interaction sites using (i) a combination of sequence- and structure-derived parameters and (ii) sequence information alone. For sequence-based prediction we achieved a precision of 84% with a 26% recall and an F-measure of 40%. When combined with structural information, the prediction performance increases to a precision of 76% and a recall of 38% with an F-measure of 51%. We also present an attempt to rationalize the sliding window size and demonstrate that a nine-residue window is the most suitable for predictor construction. Finally, we demonstrate the applicability of our prediction methods by modeling the Ras-Raf complex using predicted interaction sites as target binding interfaces. Our results suggest that it is possible to predict protein interaction sites with quite a high accuracy using only sequence information.

摘要

识别蛋白质中的相互作用位点可为蛋白质功能提供重要线索,并且在系统生物学和药物发现等领域变得越来越重要。尽管有大量关于利用结构衍生信息预测相互作用位点的论文,但仅基于蛋白质序列预测相互作用残基的案例报告却很少。在此,一种滑动窗口方法与随机森林方法相结合,用于预测蛋白质相互作用位点,具体使用(i)基于序列和结构的参数组合,以及(ii)仅使用序列信息。对于基于序列的预测,我们实现了84%的精度、26%的召回率和40%的F值。当与结构信息相结合时,预测性能提高到76%的精度、38%的召回率和51%的F值。我们还尝试对滑动窗口大小进行合理化,并证明九个残基的窗口最适合构建预测器。最后,我们通过将预测的相互作用位点作为目标结合界面来模拟Ras-Raf复合物,证明了我们预测方法的适用性。我们的结果表明,仅使用序列信息就有可能以相当高的准确率预测蛋白质相互作用位点。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b03e/2621338/7b3d5d31f1bd/pcbi.1000278.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b03e/2621338/217852ba712c/pcbi.1000278.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b03e/2621338/74010de827c4/pcbi.1000278.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b03e/2621338/7b1c532b8b80/pcbi.1000278.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b03e/2621338/80ec6bf4e9f8/pcbi.1000278.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b03e/2621338/03a0ab5bc261/pcbi.1000278.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b03e/2621338/6ebe18924a4b/pcbi.1000278.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b03e/2621338/8f6d5c76e67b/pcbi.1000278.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b03e/2621338/7b3d5d31f1bd/pcbi.1000278.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b03e/2621338/217852ba712c/pcbi.1000278.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b03e/2621338/74010de827c4/pcbi.1000278.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b03e/2621338/7b1c532b8b80/pcbi.1000278.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b03e/2621338/80ec6bf4e9f8/pcbi.1000278.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b03e/2621338/03a0ab5bc261/pcbi.1000278.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b03e/2621338/6ebe18924a4b/pcbi.1000278.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b03e/2621338/8f6d5c76e67b/pcbi.1000278.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b03e/2621338/7b3d5d31f1bd/pcbi.1000278.g008.jpg

相似文献

1
Prediction of protein-protein interaction sites in sequences and 3D structures by random forests.利用随机森林预测序列和三维结构中的蛋白质-蛋白质相互作用位点。
PLoS Comput Biol. 2009 Jan;5(1):e1000278. doi: 10.1371/journal.pcbi.1000278. Epub 2009 Jan 30.
2
Prediction of protein-protein interaction sites using patch-based residue characterization.基于补丁的残基特征预测蛋白质-蛋白质相互作用位点。
J Theor Biol. 2012 Jan 21;293:143-50. doi: 10.1016/j.jtbi.2011.10.021. Epub 2011 Oct 25.
3
Prediction of protein-protein interaction sites using support vector machines.使用支持向量机预测蛋白质-蛋白质相互作用位点。
Protein Eng Des Sel. 2004 Feb;17(2):165-73. doi: 10.1093/protein/gzh020. Epub 2004 Jan 20.
4
A novel feature extraction scheme for prediction of protein-protein interaction sites.一种用于预测蛋白质-蛋白质相互作用位点的新型特征提取方案。
Mol Biosyst. 2015 Feb;11(2):475-85. doi: 10.1039/c4mb00625a. Epub 2014 Nov 21.
5
Prediction of interacting protein residues using sequence and structure data.利用序列和结构数据预测相互作用的蛋白质残基。
Methods Mol Biol. 2012;819:233-51. doi: 10.1007/978-1-61779-465-0_16.
6
Seeing the trees through the forest: sequence-based homo- and heteromeric protein-protein interaction sites prediction using random forest.透过森林看树木:使用随机森林基于序列预测同源和异源多聚体蛋白质-蛋白质相互作用位点
Bioinformatics. 2017 May 15;33(10):1479-1487. doi: 10.1093/bioinformatics/btx005.
7
Prediction of lipid-binding sites based on support vector machine and position specific scoring matrix.基于支持向量机和位置特异性评分矩阵预测脂质结合位点。
Protein J. 2010 Aug;29(6):427-31. doi: 10.1007/s10930-010-9269-x.
8
Integrating statistical predictions and experimental verifications for enhancing protein-chemical interaction predictions in virtual screening.整合统计预测与实验验证以增强虚拟筛选中的蛋白质-化学相互作用预测。
PLoS Comput Biol. 2009 Jun;5(6):e1000397. doi: 10.1371/journal.pcbi.1000397. Epub 2009 Jun 5.
9
Prediction of Ras-effector interactions using position energy matrices.使用位置能量矩阵预测Ras效应器相互作用。
Bioinformatics. 2007 Sep 1;23(17):2226-30. doi: 10.1093/bioinformatics/btm336. Epub 2007 Jun 28.
10
Imbalance Data Processing Strategy for Protein Interaction Sites Prediction.蛋白质相互作用位点预测的不平衡数据处理策略。
IEEE/ACM Trans Comput Biol Bioinform. 2021 May-Jun;18(3):985-994. doi: 10.1109/TCBB.2019.2953908. Epub 2021 Jun 3.

引用本文的文献

1
The identification of metabolites from gut microbiota in autism spectrum disorder via network pharmacology.通过网络药理学鉴定自闭症谱系障碍中肠道微生物群的代谢产物
Sci Rep. 2025 Aug 28;15(1):31765. doi: 10.1038/s41598-025-15921-w.
2
Structure-Based Approaches for Protein-Protein Interaction Prediction Using Machine Learning and Deep Learning.基于结构的机器学习和深度学习蛋白质-蛋白质相互作用预测方法
Biomolecules. 2025 Jan 17;15(1):141. doi: 10.3390/biom15010141.
3
Comprehensive bioinformatics and machine learning analyses for breast cancer staging using TCGA dataset.

本文引用的文献

1
PSAIA - protein structure and interaction analyzer.PSAIA - 蛋白质结构与相互作用分析仪
BMC Struct Biol. 2008 Apr 9;8:21. doi: 10.1186/1472-6807-8-21.
2
The universal protein resource (UniProt).通用蛋白质资源(UniProt)。
Nucleic Acids Res. 2008 Jan;36(Database issue):D190-5. doi: 10.1093/nar/gkm895. Epub 2007 Nov 27.
3
ISIS: interaction sites identified from sequence.ISIS:从序列中识别出的相互作用位点。
使用TCGA数据集进行乳腺癌分期的综合生物信息学和机器学习分析。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae628.
4
A comprehensive review of protein-centric predictors for biomolecular interactions: from proteins to nucleic acids and beyond.蛋白质中心预测因子在生物分子相互作用研究中的综合综述:从蛋白质到核酸及其他。
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae162.
5
Potential of Gut Microbial Metabolites in Treating Osteoporosis and Obesity: A Network Pharmacology and Bioinformatics Approach.肠道微生物代谢产物在治疗骨质疏松症和肥胖症中的潜力:网络药理学和生物信息学方法。
Med Sci Monit. 2024 Mar 10;30:e942899. doi: 10.12659/MSM.942899.
6
Predicting Immunotherapy Outcomes in Glioblastoma Patients through Machine Learning.通过机器学习预测胶质母细胞瘤患者的免疫治疗结果
Cancers (Basel). 2024 Jan 18;16(2):408. doi: 10.3390/cancers16020408.
7
A Network Pharmacology and Molecular-Docking-Based Approach to Identify the Probable Targets of Short-Chain Fatty-Acid-Producing Microbial Metabolites against Kidney Cancer and Inflammation.基于网络药理学和分子对接的方法鉴定短链脂肪酸产生菌代谢产物治疗肾癌和炎症的潜在靶点
Biomolecules. 2023 Nov 20;13(11):1678. doi: 10.3390/biom13111678.
8
Computational approaches for the design of modulators targeting protein-protein interactions.针对靶向蛋白质-蛋白质相互作用的调节剂设计的计算方法。
Expert Opin Drug Discov. 2023 Mar;18(3):315-333. doi: 10.1080/17460441.2023.2171396. Epub 2023 Feb 23.
9
Integrated Analysis of Bulk RNA-Seq and Single-Cell RNA-Seq Unravels the Influences of SARS-CoV-2 Infections to Cancer Patients. bulk RNA-Seq 和单细胞 RNA-Seq 的综合分析揭示了 SARS-CoV-2 感染对癌症患者的影响。
Int J Mol Sci. 2022 Dec 10;23(24):15698. doi: 10.3390/ijms232415698.
10
SENSDeep: An Ensemble Deep Learning Method for Protein-Protein Interaction Sites Prediction.SENSDeep:一种用于蛋白质-蛋白质相互作用位点预测的集成深度学习方法。
Interdiscip Sci. 2023 Mar;15(1):55-87. doi: 10.1007/s12539-022-00543-x. Epub 2022 Nov 8.
Bioinformatics. 2007 Jan 15;23(2):e13-6. doi: 10.1093/bioinformatics/btl303.
4
Insights into protein-protein interfaces using a Bayesian network prediction method.使用贝叶斯网络预测方法洞察蛋白质-蛋白质相互作用界面
J Mol Biol. 2006 Sep 15;362(2):365-86. doi: 10.1016/j.jmb.2006.07.028. Epub 2006 Jul 21.
5
Predicting protein interaction sites: binding hot-spots in protein-protein and protein-ligand interfaces.预测蛋白质相互作用位点:蛋白质-蛋白质和蛋白质-配体界面中的结合热点
Bioinformatics. 2006 Jun 1;22(11):1335-42. doi: 10.1093/bioinformatics/btl079. Epub 2006 Mar 7.
6
Predicting protein interaction sites from residue spatial sequence profile and evolution rate.基于残基空间序列轮廓和进化速率预测蛋白质相互作用位点。
FEBS Lett. 2006 Jan 23;580(2):380-4. doi: 10.1016/j.febslet.2005.11.081. Epub 2005 Dec 19.
7
ROCR: visualizing classifier performance in R.ROCR:在R语言中可视化分类器性能
Bioinformatics. 2005 Oct 15;21(20):3940-1. doi: 10.1093/bioinformatics/bti623. Epub 2005 Aug 11.
8
Prediction of protein-protein interactions by combining structure and sequence conservation in protein interfaces.通过结合蛋白质界面中的结构和序列保守性来预测蛋白质-蛋白质相互作用。
Bioinformatics. 2005 Jun 15;21(12):2850-5. doi: 10.1093/bioinformatics/bti443. Epub 2005 Apr 26.
9
An evolution based classifier for prediction of protein interfaces without using protein structures.一种无需使用蛋白质结构即可预测蛋白质界面的基于进化的分类器。
Bioinformatics. 2005 May 15;21(10):2496-501. doi: 10.1093/bioinformatics/bti340. Epub 2005 Feb 22.
10
Improved prediction of protein-protein binding sites using a support vector machines approach.使用支持向量机方法改进蛋白质-蛋白质结合位点的预测。
Bioinformatics. 2005 Apr 15;21(8):1487-94. doi: 10.1093/bioinformatics/bti242. Epub 2004 Dec 21.