• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种利用网络结构进行蛋白质特性预测的统计方法。

A statistical approach using network structure in the prediction of protein characteristics.

作者信息

Chen Pao-Yang, Deane Charlotte M, Reinert Gesine

机构信息

Department of Statistics, University of Oxford, Oxford, OX1 3TG, UK.

出版信息

Bioinformatics. 2007 Sep 1;23(17):2314-21. doi: 10.1093/bioinformatics/btm342. Epub 2007 Jun 28.

DOI:10.1093/bioinformatics/btm342
PMID:17599931
Abstract

MOTIVATION

The Majority Vote approach has demonstrated that protein-protein interactions can be used to predict the structure or function of a protein. In this article we propose a novel method for the prediction of such protein characteristics based on frequencies of pairwise interactions. In addition, we study a second new approach using the pattern frequencies of triplets of proteins, thus for the first time taking network structure explicitly into account. Both these methods are extended to jointly consider multiple organisms and multiple characteristics.

RESULTS

Compared to the standard non-network-based method, namely the Majority Vote method, in large networks our predictions tend to be more accurate. For structure prediction, the Frequency-based method reaches up to 71% accuracy, and the Triplet-based method reaches up to 72% accuracy, whereas for function prediction, both the Triplet-based method and the Frequency-based method reach up to 90% accuracy. Function prediction on proteins without homologues showed slightly less but comparable accuracies. Including partially annotated proteins substantially increases the number of proteins for which our methods predict their characteristics with reasonable accuracy. We find that the enhanced Triplet-based method does not currently yield significantly better results than the enhanced Frequency-based method, suggesting that triplets of interactions do not contain substantially more information about protein characteristics than interaction pairs. Our methods offer two main improvements over current approaches--first, multiple protein characteristics are considered simultaneously, and second, data is integrated from multiple species. In addition, the Triplet-based method includes network structure more explicitly than the Majority Vote and the Frequency-based method.

AVAILABILITY

The program is available upon request.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

多数投票法已证明蛋白质-蛋白质相互作用可用于预测蛋白质的结构或功能。在本文中,我们提出了一种基于成对相互作用频率预测此类蛋白质特征的新方法。此外,我们研究了另一种使用蛋白质三联体模式频率的新方法,从而首次明确考虑了网络结构。这两种方法都被扩展到联合考虑多种生物体和多种特征。

结果

与标准的非基于网络的方法(即多数投票法)相比,在大型网络中我们的预测往往更准确。对于结构预测,基于频率的方法准确率高达71%,基于三联体的方法准确率高达72%;而对于功能预测,基于三联体的方法和基于频率的方法准确率均高达90%。对无同源物的蛋白质进行功能预测时,准确率略低但相当。纳入部分注释的蛋白质显著增加了我们的方法能够以合理准确率预测其特征的蛋白质数量。我们发现,目前增强后的基于三联体的方法并未比增强后的基于频率的方法产生明显更好的结果,这表明相互作用三联体所包含的关于蛋白质特征的信息并不比相互作用对多得多。我们的方法相对于当前方法有两个主要改进——第一,同时考虑了多种蛋白质特征;第二,整合了来自多个物种的数据。此外,基于三联体的方法比多数投票法和基于频率的方法更明确地纳入了网络结构。

可用性

可应要求提供该程序。

补充信息

补充数据可在《生物信息学》在线获取。

相似文献

1
A statistical approach using network structure in the prediction of protein characteristics.一种利用网络结构进行蛋白质特性预测的统计方法。
Bioinformatics. 2007 Sep 1;23(17):2314-21. doi: 10.1093/bioinformatics/btm342. Epub 2007 Jun 28.
2
Efficient estimation of graphlet frequency distributions in protein-protein interaction networks.蛋白质-蛋白质相互作用网络中图形频率分布的高效估计
Bioinformatics. 2006 Apr 15;22(8):974-80. doi: 10.1093/bioinformatics/btl030. Epub 2006 Feb 1.
3
Fast protein classification with multiple networks.使用多个网络进行快速蛋白质分类。
Bioinformatics. 2005 Sep 1;21 Suppl 2:ii59-65. doi: 10.1093/bioinformatics/bti1110.
4
Edge-based scoring and searching method for identifying condition-responsive protein-protein interaction sub-network.用于识别疾病状态响应性蛋白质-蛋白质相互作用子网的基于边缘的评分和搜索方法
Bioinformatics. 2007 Aug 15;23(16):2121-8. doi: 10.1093/bioinformatics/btm294. Epub 2007 Jun 1.
5
Selective integration of multiple biological data for supervised network inference.用于监督网络推理的多生物数据的选择性整合。
Bioinformatics. 2005 May 15;21(10):2488-95. doi: 10.1093/bioinformatics/bti339. Epub 2005 Feb 22.
6
Statistical prediction of protein chemical interactions based on chemical structure and mass spectrometry data.基于化学结构和质谱数据的蛋白质化学相互作用的统计预测
Bioinformatics. 2007 Aug 1;23(15):2004-12. doi: 10.1093/bioinformatics/btm266. Epub 2007 May 17.
7
Improving protein protein interaction prediction based on phylogenetic information using a least-squares support vector machine.基于系统发育信息,使用最小二乘支持向量机改进蛋白质-蛋白质相互作用预测。
Ann N Y Acad Sci. 2007 Dec;1115:154-67. doi: 10.1196/annals.1407.005. Epub 2007 Oct 9.
8
Protein complex prediction via cost-based clustering.基于成本聚类的蛋白质复合物预测
Bioinformatics. 2004 Nov 22;20(17):3013-20. doi: 10.1093/bioinformatics/bth351. Epub 2004 Jun 4.
9
Fitting a geometric graph to a protein-protein interaction network.将几何图拟合到蛋白质-蛋白质相互作用网络。
Bioinformatics. 2008 Apr 15;24(8):1093-9. doi: 10.1093/bioinformatics/btn079. Epub 2008 Mar 14.
10
Prediction of Ras-effector interactions using position energy matrices.使用位置能量矩阵预测Ras效应器相互作用。
Bioinformatics. 2007 Sep 1;23(17):2226-30. doi: 10.1093/bioinformatics/btm336. Epub 2007 Jun 28.

引用本文的文献

1
Identification of immunity-related genes in Arabidopsis and cassava using genomic data.利用基因组数据鉴定拟南芥和木薯中的免疫相关基因。
Genomics Proteomics Bioinformatics. 2013 Dec;11(6):345-53. doi: 10.1016/j.gpb.2013.09.010. Epub 2013 Dec 6.
2
Predicting and validating protein interactions using network structure.利用网络结构预测和验证蛋白质相互作用。
PLoS Comput Biol. 2008 Jul 25;4(7):e1000118. doi: 10.1371/journal.pcbi.1000118.