• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于序列的ATP结合残基预测器,采用随机森林和mRMR-IFS特征选择方法。

Sequence-based predictor of ATP-binding residues using random forest and mRMR-IFS feature selection.

作者信息

Ma Xin, Sun Xiao

机构信息

Golden Audit College, Nanjing Audit University, Nanjing 210029, China.

State Key Laboratory of Bioelectronics, Southeast University, Nanjing 210096, China.

出版信息

J Theor Biol. 2014 Nov 7;360:59-66. doi: 10.1016/j.jtbi.2014.06.037. Epub 2014 Jul 8.

DOI:10.1016/j.jtbi.2014.06.037
PMID:25014477
Abstract

We develop a computational and statistical approach (ATPBR) for predicting ATP-binding residues in proteins from amino acid sequences by using random forests with a novel hybrid feature. The hybrid feature incorporates a new feature called PSSMPP, the predicted secondary structure and orthogonal binary vectors. The mRMR-IFS feature selection method is utilized to construct the best prediction model. At last, ATPBR achieves significantly improved performance over existing methods, with 87.53% accuracy and a Matthew׳s correlation coefficient of 0.554. In addition, our further analysis demonstrates that PSSMPP distinguishes more effectively between ATP-binding and non-binding residues. Besides, the optimal features selected by the mRMR-IFS method improve the prediction performance and may provide useful insights for revealing the mechanisms of ATP and proteins interactions.

摘要

我们开发了一种计算和统计方法(ATPBR),通过使用具有新型混合特征的随机森林,从氨基酸序列预测蛋白质中的ATP结合残基。该混合特征包含一种名为PSSMPP的新特征、预测的二级结构和正交二元向量。利用mRMR-IFS特征选择方法构建最佳预测模型。最后,ATPBR相对于现有方法实现了显著提高的性能,准确率为87.53%,马修斯相关系数为0.554。此外,我们的进一步分析表明,PSSMPP能更有效地区分ATP结合残基和非结合残基。此外,mRMR-IFS方法选择的最佳特征提高了预测性能,并可能为揭示ATP与蛋白质相互作用的机制提供有用的见解。

相似文献

1
Sequence-based predictor of ATP-binding residues using random forest and mRMR-IFS feature selection.基于序列的ATP结合残基预测器,采用随机森林和mRMR-IFS特征选择方法。
J Theor Biol. 2014 Nov 7;360:59-66. doi: 10.1016/j.jtbi.2014.06.037. Epub 2014 Jul 8.
2
Sequence-Based Prediction of RNA-Binding Proteins Using Random Forest with Minimum Redundancy Maximum Relevance Feature Selection.基于序列的RNA结合蛋白预测:使用具有最小冗余最大相关特征选择的随机森林算法
Biomed Res Int. 2015;2015:425810. doi: 10.1155/2015/425810. Epub 2015 Oct 12.
3
DNABP: Identification of DNA-Binding Proteins Based on Feature Selection Using a Random Forest and Predicting Binding Residues.DNABP:基于随机森林特征选择识别DNA结合蛋白并预测结合残基
PLoS One. 2016 Dec 1;11(12):e0167345. doi: 10.1371/journal.pone.0167345. eCollection 2016.
4
Prediction of protein-protein interaction sites by random forest algorithm with mRMR and IFS.基于 mRMR 和 IFS 的随机森林算法预测蛋白质相互作用位点
PLoS One. 2012;7(8):e43927. doi: 10.1371/journal.pone.0043927. Epub 2012 Aug 28.
5
Prediction of RNA-binding residues in proteins from primary sequence using an enriched random forest model with a novel hybrid feature.基于新型混合特征的富集随机森林模型预测蛋白质中 RNA 结合残基的一级序列
Proteins. 2011 Apr;79(4):1230-9. doi: 10.1002/prot.22958. Epub 2011 Jan 25.
6
Predict and analyze S-nitrosylation modification sites with the mRMR and IFS approaches.采用 mRMR 和 IFS 方法预测和分析 S-亚硝基化修饰位点。
J Proteomics. 2012 Feb 16;75(5):1654-65. doi: 10.1016/j.jprot.2011.12.003. Epub 2011 Dec 11.
7
A novel method of predicting protein disordered regions based on sequence features.基于序列特征预测蛋白质无规则区域的新方法。
Biomed Res Int. 2013;2013:414327. doi: 10.1155/2013/414327. Epub 2013 Apr 22.
8
Prediction of interactiveness of proteins and nucleic acids based on feature selections.基于特征选择的蛋白质和核酸相互作用预测。
Mol Divers. 2010 Nov;14(4):627-33. doi: 10.1007/s11030-009-9198-9. Epub 2009 Oct 9.
9
Prediction of tyrosine sulfation with mRMR feature selection and analysis.酪氨酸硫酸化的预测与 mRMR 特征选择和分析。
J Proteome Res. 2010 Dec 3;9(12):6490-7. doi: 10.1021/pr1007152. Epub 2010 Nov 11.
10
Prediction of DNA-binding residues in proteins from amino acid sequences using a random forest model with a hybrid feature.使用具有混合特征的随机森林模型从氨基酸序列预测蛋白质中的DNA结合残基。
Bioinformatics. 2009 Jan 1;25(1):30-5. doi: 10.1093/bioinformatics/btn583. Epub 2008 Nov 12.

引用本文的文献

1
CPPred: coding potential prediction based on the global description of RNA sequence.CPPred:基于 RNA 序列全局描述的编码潜能预测。
Nucleic Acids Res. 2019 May 7;47(8):e43. doi: 10.1093/nar/gkz087.
2
DNABP: Identification of DNA-Binding Proteins Based on Feature Selection Using a Random Forest and Predicting Binding Residues.DNABP:基于随机森林特征选择识别DNA结合蛋白并预测结合残基
PLoS One. 2016 Dec 1;11(12):e0167345. doi: 10.1371/journal.pone.0167345. eCollection 2016.
3
Protein ligand-specific binding residue predictions by an ensemble classifier.
通过集成分类器预测蛋白质配体特异性结合残基
BMC Bioinformatics. 2016 Nov 17;17(1):470. doi: 10.1186/s12859-016-1348-3.
4
Sequence-Based Prediction of RNA-Binding Proteins Using Random Forest with Minimum Redundancy Maximum Relevance Feature Selection.基于序列的RNA结合蛋白预测:使用具有最小冗余最大相关特征选择的随机森林算法
Biomed Res Int. 2015;2015:425810. doi: 10.1155/2015/425810. Epub 2015 Oct 12.