• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PETs: A Stable and Accurate Predictor of Protein-Protein Interacting Sites Based on Extremely-Randomized Trees.

作者信息

Xia Bin, Zhang Hong, Li Qianmu, Li Tao

出版信息

IEEE Trans Nanobioscience. 2015 Dec;14(8):882-93. doi: 10.1109/TNB.2015.2491303. Epub 2015 Oct 27.

DOI:10.1109/TNB.2015.2491303
PMID:26529772
Abstract

Protein-protein interaction (PPI) plays crucial roles in the performance of various biological processes. A variety of methods are dedicated to identify whether proteins have interaction residues, but it is often more crucial to recognize each amino acid. In practical applications, the stability of a prediction model is as important as its accuracy. However, random sampling, which is widely used in previous prediction models, often brings large difference between each training model. In this paper, a Predictor of protein-protein interaction sites based on Extremely-randomized Trees (PETs) is proposed to improve the prediction accuracy while maintaining the prediction stability. In PETs, a cluster-based sampling strategy is proposed to ensure the model stability: first, the training dataset is divided into subsets using specific features; second, the subsets are clustered using K-means; and finally the samples are selected from each cluster. Using the proposed sampling strategy, samples which have different types of significant features could be selected independently from different clusters. The evaluation shows that PETs is able to achieve better accuracy while maintaining a good stability. The source code and toolkit are available at https://github.com/BinXia/PETs.

摘要

相似文献

1
PETs: A Stable and Accurate Predictor of Protein-Protein Interacting Sites Based on Extremely-Randomized Trees.
IEEE Trans Nanobioscience. 2015 Dec;14(8):882-93. doi: 10.1109/TNB.2015.2491303. Epub 2015 Oct 27.
2
A Cascade Random Forests Algorithm for Predicting Protein-Protein Interaction Sites.一种用于预测蛋白质-蛋白质相互作用位点的级联随机森林算法。
IEEE Trans Nanobioscience. 2015 Oct;14(7):746-60. doi: 10.1109/TNB.2015.2475359. Epub 2015 Sep 28.
3
Imbalance Data Processing Strategy for Protein Interaction Sites Prediction.蛋白质相互作用位点预测的不平衡数据处理策略。
IEEE/ACM Trans Comput Biol Bioinform. 2021 May-Jun;18(3):985-994. doi: 10.1109/TCBB.2019.2953908. Epub 2021 Jun 3.
4
Prediction of protein-protein interactions based on PseAA composition and hybrid feature selection.基于伪氨基酸组成和混合特征选择的蛋白质-蛋白质相互作用预测
Biochem Biophys Res Commun. 2009 Mar 6;380(2):318-22. doi: 10.1016/j.bbrc.2009.01.077. Epub 2009 Jan 24.
5
Computational design, construction, and characterization of a set of specificity determining residues in protein-protein interactions.计算设计、构建和鉴定蛋白质-蛋白质相互作用中一组特异性决定残基。
Proteins. 2012 Oct;80(10):2426-36. doi: 10.1002/prot.24127. Epub 2012 Jul 10.
6
Large scale statistical prediction of protein-protein interaction by potentially interacting domain (PID) pair.通过潜在相互作用结构域(PID)对进行蛋白质-蛋白质相互作用的大规模统计预测。
Genome Inform. 2002;13:42-50.
7
Predicting interacting residues using long-distance information and novel decoding in hidden Markov models.利用长程信息和隐马尔可夫模型中的新型解码技术预测相互作用残基。
IEEE Trans Nanobioscience. 2013 Sep;12(3):158-64. doi: 10.1109/TNB.2013.2263810. Epub 2013 Aug 15.
8
Heterogeneous data integration by tree-augmented naïve Bayes for protein-protein interactions prediction.基于树增强朴素贝叶斯的异质数据集成在蛋白质-蛋白质相互作用预测中的应用。
Proteomics. 2013 Jan;13(2):261-8. doi: 10.1002/pmic.201200326. Epub 2012 Dec 3.
9
Sequence-based prediction of protein-protein interaction sites with L1-logreg classifier.基于序列的 L1-logreg 分类器预测蛋白质-蛋白质相互作用位点。
J Theor Biol. 2014 May 7;348:47-54. doi: 10.1016/j.jtbi.2014.01.028. Epub 2014 Jan 31.
10
Discovery of stable and significant binding motif pairs from PDB complexes and protein interaction datasets.从蛋白质数据银行复合物和蛋白质相互作用数据集中发现稳定且显著的结合基序对。
Bioinformatics. 2005 Feb 1;21(3):314-24. doi: 10.1093/bioinformatics/bti019. Epub 2004 Sep 16.

引用本文的文献

1
Tailored machine learning for evaluating the long-term diabetes risk in older individuals: findings from the Irish Longitudinal Study on Ageing (TILDA).个体化机器学习评估老年人长期糖尿病风险:爱尔兰老龄化纵向研究(TILDA)的结果。
BMJ Open. 2023 May 30;13(5):e072991. doi: 10.1136/bmjopen-2023-072991.
2
An improved deep forest model for prediction of e-commerce consumers' repurchase behavior.一种改进的深度森林模型,用于预测电子商务消费者的回购行为。
PLoS One. 2021 Sep 20;16(9):e0255906. doi: 10.1371/journal.pone.0255906. eCollection 2021.
3
Extremely-randomized-tree-based Prediction of N-Methyladenosine Sites in .
基于极端随机树的……中N-甲基腺苷位点预测
Curr Genomics. 2020 Jan;21(1):26-33. doi: 10.2174/1389202921666200219125625.