• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于 mRMR 和 IFS 的多特征融合和 RF 预测蛋白质-蛋白质相互作用位点。

Prediction of Protein-Protein Interaction Sites by Multifeature Fusion and RF with mRMR and IFS.

机构信息

School of Information Science and Technology, Northeast Normal University, Changchun, 130024 Jilin, China.

Graduate School, Northeast Normal University, Changchun 130024, Jilin, China.

出版信息

Dis Markers. 2022 Oct 4;2022:5892627. doi: 10.1155/2022/5892627. eCollection 2022.

DOI:10.1155/2022/5892627
PMID:36246558
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9553539/
Abstract

Prediction of protein-protein interaction (PPI) sites is one of the most perplexing problems in drug discovery and computational biology. Although significant progress has been made by combining different machine learning techniques with a variety of distinct characteristics, the problem still remains unresolved. In this study, a technique for PPI sites is presented using a random forest (RF) algorithm followed by the minimum redundancy maximal relevance (mRMR) approach, and the method of incremental feature selection (IFS). Physicochemical properties of proteins and the features of the residual disorder, sequence conservation, secondary structure, and solvent accessibility are incorporated. Five 3D structural characteristics are also used to predict PPI sites. Analysis of features shows that 3D structural features such as relative solvent-accessible surface area (RASA) and surface curvature (SC) help in the prediction of PPI sites. Results show that the performance of the proposed predictor is superior to several other state-of-the-art predictors, whose average prediction accuracy is 81.44%, sensitivity is 82.17%, and specificity is 80.71%, respectively. The proposed predictor is expected to become a helpful tool for finding PPI sites, and the feature analysis presented in this study will give useful insights into protein interaction mechanisms.

摘要

蛋白质-蛋白质相互作用(PPI)位点的预测是药物发现和计算生物学中最棘手的问题之一。尽管通过将不同的机器学习技术与各种不同的特征相结合已经取得了重大进展,但该问题仍然没有得到解决。在这项研究中,提出了一种使用随机森林(RF)算法结合最小冗余最大相关性(mRMR)方法和增量特征选择(IFS)方法的 PPI 位点预测技术。该方法结合了蛋白质的物理化学性质和残差无序、序列保守性、二级结构和溶剂可及性的特征。还使用了五个 3D 结构特征来预测 PPI 位点。特征分析表明,相对溶剂可及表面积(RASA)和表面曲率(SC)等 3D 结构特征有助于预测 PPI 位点。结果表明,所提出的预测器的性能优于其他几种最先进的预测器,其平均预测精度为 81.44%,灵敏度为 82.17%,特异性为 80.71%。预计该预测器将成为寻找 PPI 位点的有用工具,本研究中提出的特征分析将为蛋白质相互作用机制提供有用的见解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65dd/9553539/43b2805278f7/DM2022-5892627.007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65dd/9553539/77e61169591c/DM2022-5892627.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65dd/9553539/2cf215cd1d97/DM2022-5892627.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65dd/9553539/322e4b697379/DM2022-5892627.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65dd/9553539/ec5ce3b73515/DM2022-5892627.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65dd/9553539/40eb8fdec833/DM2022-5892627.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65dd/9553539/84a458cae6c1/DM2022-5892627.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65dd/9553539/43b2805278f7/DM2022-5892627.007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65dd/9553539/77e61169591c/DM2022-5892627.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65dd/9553539/2cf215cd1d97/DM2022-5892627.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65dd/9553539/322e4b697379/DM2022-5892627.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65dd/9553539/ec5ce3b73515/DM2022-5892627.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65dd/9553539/40eb8fdec833/DM2022-5892627.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65dd/9553539/84a458cae6c1/DM2022-5892627.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65dd/9553539/43b2805278f7/DM2022-5892627.007.jpg

相似文献

1
Prediction of Protein-Protein Interaction Sites by Multifeature Fusion and RF with mRMR and IFS.基于 mRMR 和 IFS 的多特征融合和 RF 预测蛋白质-蛋白质相互作用位点。
Dis Markers. 2022 Oct 4;2022:5892627. doi: 10.1155/2022/5892627. eCollection 2022.
2
Prediction of protein-protein interaction sites by random forest algorithm with mRMR and IFS.基于 mRMR 和 IFS 的随机森林算法预测蛋白质相互作用位点
PLoS One. 2012;7(8):e43927. doi: 10.1371/journal.pone.0043927. Epub 2012 Aug 28.
3
Prediction of active sites of enzymes by maximum relevance minimum redundancy (mRMR) feature selection.通过最大相关最小冗余(mRMR)特征选择预测酶的活性位点。
Mol Biosyst. 2013 Jan 27;9(1):61-9. doi: 10.1039/c2mb25327e. Epub 2012 Nov 2.
4
Prediction of tyrosine sulfation with mRMR feature selection and analysis.酪氨酸硫酸化的预测与 mRMR 特征选择和分析。
J Proteome Res. 2010 Dec 3;9(12):6490-7. doi: 10.1021/pr1007152. Epub 2010 Nov 11.
5
Sequence-Based Prediction of RNA-Binding Proteins Using Random Forest with Minimum Redundancy Maximum Relevance Feature Selection.基于序列的RNA结合蛋白预测:使用具有最小冗余最大相关特征选择的随机森林算法
Biomed Res Int. 2015;2015:425810. doi: 10.1155/2015/425810. Epub 2015 Oct 12.
6
Analysis and Prediction of Myristoylation Sites Using the mRMR Method, the IFS Method and an Extreme Learning Machine Algorithm.使用最大相关最小冗余(mRMR)方法、迭代特征选择(IFS)方法和极限学习机算法对肉豆蔻酰化位点进行分析与预测
Comb Chem High Throughput Screen. 2017;20(2):96-106. doi: 10.2174/1386207319666161220114424.
7
Predicting DNA-binding sites of proteins based on sequential and 3D structural information.基于序列和 3D 结构信息预测蛋白质的 DNA 结合位点。
Mol Genet Genomics. 2014 Jun;289(3):489-99. doi: 10.1007/s00438-014-0812-x. Epub 2014 Jan 22.
8
Prediction of protein modification sites of pyrrolidone carboxylic acid using mRMR feature selection and analysis.使用 mRMR 特征选择和分析预测吡咯烷酮羧酸的蛋白质修饰位点。
PLoS One. 2011;6(12):e28221. doi: 10.1371/journal.pone.0028221. Epub 2011 Dec 9.
9
Predict and analyze S-nitrosylation modification sites with the mRMR and IFS approaches.采用 mRMR 和 IFS 方法预测和分析 S-亚硝基化修饰位点。
J Proteomics. 2012 Feb 16;75(5):1654-65. doi: 10.1016/j.jprot.2011.12.003. Epub 2011 Dec 11.
10
Prediction of protein domain with mRMR feature selection and analysis.基于 mRMR 特征选择的蛋白质结构域预测及分析。
PLoS One. 2012;7(6):e39308. doi: 10.1371/journal.pone.0039308. Epub 2012 Jun 15.

引用本文的文献

1
Unravelling the human taste receptor interactome: machine learning and molecular modelling insights into protein-protein interactions.解析人类味觉受体相互作用组:机器学习与蛋白质-蛋白质相互作用的分子建模见解
NPJ Sci Food. 2025 Jul 1;9(1):113. doi: 10.1038/s41538-025-00478-9.
2
Accelerating the Screening of Small Peptide Ligands by Combining Peptide-Protein Docking and Machine Learning.通过肽-蛋白对接和机器学习加速小分子肽配体的筛选。
Int J Mol Sci. 2023 Jul 29;24(15):12144. doi: 10.3390/ijms241512144.

本文引用的文献

1
MoRF-FUNCpred: Molecular Recognition Feature Function Prediction Based on Multi-Label Learning and Ensemble Learning.MoRF-FUNCpred:基于多标签学习和集成学习的分子识别特征功能预测
Front Pharmacol. 2022 Mar 8;13:856417. doi: 10.3389/fphar.2022.856417. eCollection 2022.
2
CWLy-RF: A novel approach for identifying cell wall lyases based on random forest classifier.CWLy-RF:一种基于随机森林分类器识别细胞壁裂解酶的新方法。
Genomics. 2021 Sep;113(5):2919-2924. doi: 10.1016/j.ygeno.2021.06.038. Epub 2021 Jun 27.
3
Computationally identifying hot spots in protein-DNA binding interfaces using an ensemble approach.
使用集成方法计算蛋白质-DNA 结合界面中的热点。
BMC Bioinformatics. 2020 Sep 17;21(Suppl 13):384. doi: 10.1186/s12859-020-03675-3.
4
How is structural divergence related to evolutionary information?结构分歧与进化信息有何关系?
Mol Phylogenet Evol. 2018 Oct;127:859-866. doi: 10.1016/j.ympev.2018.06.033. Epub 2018 Jun 25.
5
Prediction of protein-protein interaction sites in sequences and 3D structures by random forests.利用随机森林预测序列和三维结构中的蛋白质-蛋白质相互作用位点。
PLoS Comput Biol. 2009 Jan;5(1):e1000278. doi: 10.1371/journal.pcbi.1000278. Epub 2009 Jan 30.
6
Prediction of protein-protein binding site by using core interface residue and support vector machine.利用核心界面残基和支持向量机预测蛋白质-蛋白质结合位点
BMC Bioinformatics. 2008 Dec 22;9:553. doi: 10.1186/1471-2105-9-553.
7
Structural descriptor database: a new tool for sequence-based functional site prediction.结构描述符数据库:一种基于序列的功能位点预测新工具。
BMC Bioinformatics. 2008 Nov 25;9:492. doi: 10.1186/1471-2105-9-492.
8
How proteins get in touch: interface prediction in the study of biomolecular complexes.蛋白质如何相互作用:生物分子复合物研究中的界面预测
Curr Protein Pept Sci. 2008 Aug;9(4):394-406. doi: 10.2174/138920308785132712.
9
ISIS: interaction sites identified from sequence.ISIS:从序列中识别出的相互作用位点。
Bioinformatics. 2007 Jan 15;23(2):e13-6. doi: 10.1093/bioinformatics/btl303.
10
Protein-protein interaction site prediction based on conditional random fields.基于条件随机场的蛋白质-蛋白质相互作用位点预测
Bioinformatics. 2007 Mar 1;23(5):597-604. doi: 10.1093/bioinformatics/btl660. Epub 2007 Jan 18.