• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

提高一种基于支持向量机的蛋白质-蛋白质相互作用预测方法的性能。

Improving the performance of an SVM-based method for predicting protein-protein interactions.

作者信息

Dohkan Shinsuke, Koike Asako, Takagi Toshihisa

机构信息

Department of Computational Biology, Graduate School of Frontier Sciences, University of Tokyo, (CB01) 5-1-5 Kashiwanoha, Kashiwa-shi, Chiba 277-8581, Japan.

出版信息

In Silico Biol. 2006;6(6):515-29.

PMID:17518762
Abstract

Predicting the interactions between all the possible pairs of proteins in a given organism (making a protein-protein interaction map) is a crucial subject in bioinformatics. Most of the previous methods based on supervised machine learning use datasets containing approximately the same number of interacting pairs of proteins (positives) and non-interacting pairs of proteins (negatives) for training a classifier and are estimated to yield a large number of false positives. Thinking that the negatives used in previous studies cannot adequately represent all the negatives that need to be taken into account, we have developed a method based on multiple Support Vector Machines (SVMs) that uses more negatives than positives for predicting interactions between pairs of yeast proteins and pairs of human proteins. We show that the performance of a single SVM improved as we increased the number of negatives used for training and that, if more than one CPU is available, an approach using multiple SVMs is useful not only for improving the performance of classifiers but also for reducing the time required for training them. Our approach can also be applied to assessing the reliability of high-throughput interactions.

摘要

预测给定生物体中所有可能蛋白质对之间的相互作用(构建蛋白质 - 蛋白质相互作用图谱)是生物信息学中的一个关键课题。以前大多数基于监督机器学习的方法使用的数据集包含大致相同数量的相互作用蛋白质对(阳性)和非相互作用蛋白质对(阴性)来训练分类器,据估计会产生大量假阳性。考虑到先前研究中使用的阴性不能充分代表所有需要考虑的阴性,我们开发了一种基于多个支持向量机(SVM)的方法,该方法在预测酵母蛋白质对和人类蛋白质对之间的相互作用时使用的阴性比阳性更多。我们表明,随着用于训练的阴性数量增加,单个支持向量机的性能会提高,并且,如果有多个CPU可用,使用多个支持向量机的方法不仅有助于提高分类器的性能,还能减少训练所需的时间。我们的方法也可用于评估高通量相互作用的可靠性。

相似文献

1
Improving the performance of an SVM-based method for predicting protein-protein interactions.提高一种基于支持向量机的蛋白质-蛋白质相互作用预测方法的性能。
In Silico Biol. 2006;6(6):515-29.
2
Predicting protein-protein interactions from sequence using correlation coefficient and high-quality interaction dataset.利用相关系数和高质量的交互数据集从序列预测蛋白质-蛋白质相互作用。
Amino Acids. 2010 Mar;38(3):891-9. doi: 10.1007/s00726-009-0295-y. Epub 2009 Apr 24.
3
Kernel methods for predicting protein-protein interactions.用于预测蛋白质-蛋白质相互作用的核方法。
Bioinformatics. 2005 Jun;21 Suppl 1:i38-46. doi: 10.1093/bioinformatics/bti1016.
4
Feature-based classification of native and non-native protein-protein interactions: Comparing supervised and semi-supervised learning approaches.基于特征的天然和非天然蛋白质-蛋白质相互作用分类:比较监督学习和半监督学习方法。
Proteomics. 2011 Nov;11(22):4321-30. doi: 10.1002/pmic.201100217. Epub 2011 Oct 17.
5
Training based on ligand efficiency improves prediction of bioactivities of ligands and drug target proteins in a machine learning approach.基于配体效率的训练可以提高机器学习方法中配体和药物靶标蛋白生物活性预测的准确性。
J Chem Inf Model. 2013 Oct 28;53(10):2525-37. doi: 10.1021/ci400240u. Epub 2013 Sep 24.
6
Increasing confidence of protein interactomes using network topological metrics.利用网络拓扑指标提高蛋白质相互作用组的可信度
Bioinformatics. 2006 Aug 15;22(16):1998-2004. doi: 10.1093/bioinformatics/btl335. Epub 2006 Jun 20.
7
A machine learning based method for the prediction of secretory proteins using amino acid composition, their order and similarity-search.一种基于机器学习的方法,利用氨基酸组成、顺序和相似性搜索来预测分泌蛋白。
In Silico Biol. 2008;8(2):129-40.
8
Quantitative assessment of the structural bias in protein-protein interaction assays.蛋白质-蛋白质相互作用检测中结构偏差的定量评估。
Proteomics. 2008 Nov;8(22):4657-67. doi: 10.1002/pmic.200800150.
9
Probabilistic prediction and ranking of human protein-protein interactions.人类蛋白质-蛋白质相互作用的概率预测与排序
BMC Bioinformatics. 2007 Jul 5;8:239. doi: 10.1186/1471-2105-8-239.
10
Prediction of protein structure class by coupling improved genetic algorithm and support vector machine.结合改进遗传算法与支持向量机预测蛋白质结构类别
Amino Acids. 2008 Oct;35(3):581-90. doi: 10.1007/s00726-008-0084-z. Epub 2008 Apr 22.

引用本文的文献

1
Graph-based prediction of Protein-protein interactions with attributed signed graph embedding.基于属性有向图嵌入的蛋白质-蛋白质相互作用的图预测。
BMC Bioinformatics. 2020 Jul 21;21(1):323. doi: 10.1186/s12859-020-03646-8.
2
The Helitron family classification using SVM based on Fourier transform features applied on an unbalanced dataset.基于傅里叶变换特征的支持向量机在不平衡数据集上的Helitron 家族分类。
Med Biol Eng Comput. 2019 Oct;57(10):2289-2304. doi: 10.1007/s11517-019-02027-5. Epub 2019 Aug 17.
3
A novel matrix of sequence descriptors for predicting protein-protein interactions from amino acid sequences.
一种用于从氨基酸序列预测蛋白质-蛋白质相互作用的新型序列描述符矩阵。
PLoS One. 2019 Jun 7;14(6):e0217312. doi: 10.1371/journal.pone.0217312. eCollection 2019.
4
Evolution of In Silico Strategies for Protein-Protein Interaction Drug Discovery.基于计算机的策略在蛋白质-蛋白质相互作用药物研发中的发展。
Molecules. 2018 Aug 6;23(8):1963. doi: 10.3390/molecules23081963.
5
in silico identification of protein-protein interactions in Silkworm, Bombyx mori.家蚕蛋白质-蛋白质相互作用的计算机模拟鉴定
Bioinformation. 2014 Feb 19;10(2):56-62. doi: 10.6026/97320630010056. eCollection 2014.
6
Computational Prediction of Protein-Protein Interaction Networks: Algo-rithms and Resources.计算蛋白质-蛋白质相互作用网络的预测:算法和资源。
Curr Genomics. 2013 Sep;14(6):397-414. doi: 10.2174/1389202911314060004.
7
Comparative pathogenesis and systems biology for biodefense virus vaccine development.用于生物防御病毒疫苗开发的比较发病机制与系统生物学
J Biomed Biotechnol. 2010;2010:236528. doi: 10.1155/2010/236528. Epub 2010 Jun 6.
8
Predicting protein-protein interactions in unbalanced data using the primary structure of proteins.利用蛋白质的一级结构预测不平衡数据中的蛋白质-蛋白质相互作用。
BMC Bioinformatics. 2010 Apr 2;11:167. doi: 10.1186/1471-2105-11-167.
9
Large-scale prediction of protein-protein interactions from structures.从结构大规模预测蛋白质-蛋白质相互作用。
BMC Bioinformatics. 2010 Mar 18;11:144. doi: 10.1186/1471-2105-11-144.
10
Predicting the protein-protein interactions using primary structures with predicted protein surface.利用预测的蛋白质表面的一级结构预测蛋白质-蛋白质相互作用。
BMC Bioinformatics. 2010 Jan 18;11 Suppl 1(Suppl 1):S3. doi: 10.1186/1471-2105-11-S1-S3.