• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用概率链图和基因本体结构进行蛋白质分类。

Protein classification using probabilistic chain graphs and the Gene Ontology structure.

作者信息

Carroll Steven, Pavlovic Vladimir

机构信息

Department of Computer Science, Rutgers University, Piscataway, NJ 08854, USA.

出版信息

Bioinformatics. 2006 Aug 1;22(15):1871-8. doi: 10.1093/bioinformatics/btl187. Epub 2006 May 16.

DOI:10.1093/bioinformatics/btl187
PMID:16705013
Abstract

MOTIVATION

Probabilistic graphical models have been developed in the past for the task of protein classification. In many cases, classifications obtained from the Gene Ontology have been used to validate these models. In this work we directly incorporate the structure of the Gene Ontology into the graphical representation for protein classification. We present a method in which each protein is represented by a replicate of the Gene Ontology structure, effectively modeling each protein in its own 'annotation space'. Proteins are also connected to one another according to different measures of functional similarity, after which belief propagation is run to make predictions at all ontology terms.

RESULTS

The proposed method was evaluated on a set of 4879 proteins from the Saccharomyces Genome Database whose interactions were also recorded in the GRID project. Results indicate that direct utilization of the Gene Ontology improves predictive ability, outperforming traditional models that do not take advantage of dependencies among functional terms. Average increase in accuracy (precision) of positive and negative term predictions of 27.8% (2.0%) over three different similarity measures and three subontologies was observed.

AVAILABILITY

C/C++/Perl implementation is available from authors upon request.

摘要

动机

过去已经开发了概率图形模型用于蛋白质分类任务。在许多情况下,从基因本体获得的分类已被用于验证这些模型。在这项工作中,我们将基因本体的结构直接纳入用于蛋白质分类的图形表示中。我们提出了一种方法,其中每个蛋白质由基因本体结构的一个副本表示,有效地在其自己的“注释空间”中对每个蛋白质进行建模。蛋白质还根据不同的功能相似性度量相互连接,然后运行信念传播以在所有本体术语上进行预测。

结果

在一组来自酵母基因组数据库的4879个蛋白质上评估了所提出的方法,这些蛋白质的相互作用也记录在GRID项目中。结果表明,直接利用基因本体提高了预测能力,优于不利用功能术语之间依赖性的传统模型。在三种不同的相似性度量和三个子本体上,观察到正项和负项预测的准确率(精确率)平均提高了27.8%(2.0%)。

可用性

作者可应要求提供C/C++/Perl实现。

相似文献

1
Protein classification using probabilistic chain graphs and the Gene Ontology structure.使用概率链图和基因本体结构进行蛋白质分类。
Bioinformatics. 2006 Aug 1;22(15):1871-8. doi: 10.1093/bioinformatics/btl187. Epub 2006 May 16.
2
How to decide which are the most pertinent overly-represented features during gene set enrichment analysis.如何在基因集富集分析中确定哪些是最相关的过度表达特征。
BMC Bioinformatics. 2007 Sep 11;8:332. doi: 10.1186/1471-2105-8-332.
3
Graph sharpening plus graph integration: a synergy that improves protein functional classification.图谱锐化加图谱整合:一种改善蛋白质功能分类的协同作用。
Bioinformatics. 2007 Dec 1;23(23):3217-24. doi: 10.1093/bioinformatics/btm511. Epub 2007 Oct 31.
4
Assessing protein similarity with Gene Ontology and its use in subnuclear localization prediction.利用基因本体论评估蛋白质相似性及其在核亚定位预测中的应用。
BMC Bioinformatics. 2006 Nov 7;7:491. doi: 10.1186/1471-2105-7-491.
5
A genetic similarity algorithm for searching the Gene Ontology terms and annotating anonymous protein sequences.一种用于搜索基因本体术语和注释匿名蛋白质序列的遗传相似性算法。
J Biomed Inform. 2008 Feb;41(1):65-81. doi: 10.1016/j.jbi.2007.05.010. Epub 2007 Jun 27.
6
Protein classification using ontology classification.使用本体分类法进行蛋白质分类。
Bioinformatics. 2006 Jul 15;22(14):e530-8. doi: 10.1093/bioinformatics/btl208.
7
Automatic extension of Gene Ontology with flexible identification of candidate terms.通过灵活识别候选术语自动扩展基因本体论
Bioinformatics. 2006 Mar 15;22(6):665-70. doi: 10.1093/bioinformatics/btl010. Epub 2006 Jan 21.
8
Annotating proteins by mining protein interaction networks.通过挖掘蛋白质相互作用网络对蛋白质进行注释。
Bioinformatics. 2006 Jul 15;22(14):e260-70. doi: 10.1093/bioinformatics/btl221.
9
Literature-based concept profiles for gene annotation: the issue of weighting.基于文献的基因注释概念概况:加权问题。
Int J Med Inform. 2008 May;77(5):354-62. doi: 10.1016/j.ijmedinf.2007.07.004. Epub 2007 Sep 10.
10
Support Vector Machine-based classification of protein folds using the structural properties of amino acid residues and amino acid residue pairs.基于支持向量机,利用氨基酸残基和氨基酸残基对的结构特性对蛋白质折叠进行分类。
Bioinformatics. 2007 Dec 15;23(24):3320-7. doi: 10.1093/bioinformatics/btm527. Epub 2007 Nov 7.

引用本文的文献

1
Effusion: prediction of protein function from sequence similarity networks.积液:从序列相似性网络预测蛋白质功能。
Bioinformatics. 2019 Feb 1;35(3):442-451. doi: 10.1093/bioinformatics/bty672.
2
Detection of Interactions between Proteins by Using Legendre Moments Descriptor to Extract Discriminatory Information Embedded in PSSM.利用勒让德矩描述符提取PSSM中嵌入的鉴别信息来检测蛋白质之间的相互作用
Molecules. 2017 Aug 18;22(8):1366. doi: 10.3390/molecules22081366.
3
A combined approach for genome wide protein function annotation/prediction.
一种用于全基因组蛋白质功能注释/预测的联合方法。
Proteome Sci. 2013 Nov 7;11(Suppl 1):S1. doi: 10.1186/1477-5956-11-S1-S1.
4
Prediction of drugs target groups based on ChEBI ontology.基于 ChEBI 本体论预测药物靶标群体。
Biomed Res Int. 2013;2013:132724. doi: 10.1155/2013/132724. Epub 2013 Nov 20.
5
Ontologies for bioinformatics.用于生物信息学的本体论。
Bioinform Biol Insights. 2008 Mar 12;2:187-200. doi: 10.4137/bbi.s451.
6
Structural Learning of Chain Graphs via Decomposition.通过分解进行链图的结构学习
J Mach Learn Res. 2008 Dec 1;9:2847-2880.
7
Incorporating functional inter-relationships into protein function prediction algorithms.将功能相互关系纳入蛋白质功能预测算法。
BMC Bioinformatics. 2009 May 12;10:142. doi: 10.1186/1471-2105-10-142.
8
ProLoc-GO: utilizing informative Gene Ontology terms for sequence-based prediction of protein subcellular localization.ProLoc-GO:利用信息丰富的基因本体术语进行基于序列的蛋白质亚细胞定位预测。
BMC Bioinformatics. 2008 Feb 1;9:80. doi: 10.1186/1471-2105-9-80.
9
Predicting gene ontology functions from protein's regional surface structures.从蛋白质的区域表面结构预测基因本体功能。
BMC Bioinformatics. 2007 Dec 11;8:475. doi: 10.1186/1471-2105-8-475.
10
ProbCD: enrichment analysis accounting for categorization uncertainty.ProbCD:考虑分类不确定性的富集分析。
BMC Bioinformatics. 2007 Oct 12;8:383. doi: 10.1186/1471-2105-8-383.