• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于网络数据的蛋白质功能预测的贝叶斯马尔可夫随机场分析。

Bayesian Markov Random Field analysis for protein function prediction based on network data.

机构信息

Biometris, Wageningen University and Research Centre, Wageningen, The Netherlands.

出版信息

PLoS One. 2010 Feb 24;5(2):e9293. doi: 10.1371/journal.pone.0009293.

DOI:10.1371/journal.pone.0009293
PMID:20195360
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2827541/
Abstract

Inference of protein functions is one of the most important aims of modern biology. To fully exploit the large volumes of genomic data typically produced in modern-day genomic experiments, automated computational methods for protein function prediction are urgently needed. Established methods use sequence or structure similarity to infer functions but those types of data do not suffice to determine the biological context in which proteins act. Current high-throughput biological experiments produce large amounts of data on the interactions between proteins. Such data can be used to infer interaction networks and to predict the biological process that the protein is involved in. Here, we develop a probabilistic approach for protein function prediction using network data, such as protein-protein interaction measurements. We take a Bayesian approach to an existing Markov Random Field method by performing simultaneous estimation of the model parameters and prediction of protein functions. We use an adaptive Markov Chain Monte Carlo algorithm that leads to more accurate parameter estimates and consequently to improved prediction performance compared to the standard Markov Random Fields method. We tested our method using a high quality S. cereviciae validation network with 1622 proteins against 90 Gene Ontology terms of different levels of abstraction. Compared to three other protein function prediction methods, our approach shows very good prediction performance. Our method can be directly applied to protein-protein interaction or coexpression networks, but also can be extended to use multiple data sources. We apply our method to physical protein interaction data from S. cerevisiae and provide novel predictions, using 340 Gene Ontology terms, for 1170 unannotated proteins and we evaluate the predictions using the available literature.

摘要

蛋白质功能推断是现代生物学最重要的目标之一。为了充分利用现代基因组实验通常产生的大量基因组数据,迫切需要自动化的计算方法来预测蛋白质功能。已建立的方法使用序列或结构相似性来推断功能,但这些类型的数据不足以确定蛋白质作用的生物背景。当前的高通量生物学实验产生了大量关于蛋白质相互作用的数据。这些数据可用于推断相互作用网络,并预测蛋白质参与的生物学过程。在这里,我们使用网络数据(如蛋白质-蛋白质相互作用测量值)开发了一种基于概率的蛋白质功能预测方法。我们通过对模型参数进行同时估计和对蛋白质功能进行预测,采用了一种现有的马尔可夫随机场方法的贝叶斯方法。我们使用自适应马尔可夫链蒙特卡罗算法,与标准的马尔可夫随机场方法相比,该算法可以得到更准确的参数估计,从而提高预测性能。我们使用具有 1622 个蛋白质和 90 个不同抽象级别的基因本体术语的高质量 S. cereviciae 验证网络对我们的方法进行了测试。与其他三种蛋白质功能预测方法相比,我们的方法表现出非常好的预测性能。我们的方法可以直接应用于蛋白质相互作用或共表达网络,也可以扩展到使用多个数据源。我们将我们的方法应用于来自 S. cerevisiae 的物理蛋白质相互作用数据,并使用 340 个基因本体术语,对 1170 个未注释的蛋白质进行了新的预测,并使用可用的文献对预测进行了评估。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f9e/2827541/0729ed7a6b69/pone.0009293.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f9e/2827541/0e009ef4a376/pone.0009293.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f9e/2827541/d741c521ee03/pone.0009293.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f9e/2827541/85f02ccf327c/pone.0009293.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f9e/2827541/bf33497f9e12/pone.0009293.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f9e/2827541/b6ca83511817/pone.0009293.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f9e/2827541/0729ed7a6b69/pone.0009293.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f9e/2827541/0e009ef4a376/pone.0009293.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f9e/2827541/d741c521ee03/pone.0009293.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f9e/2827541/85f02ccf327c/pone.0009293.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f9e/2827541/bf33497f9e12/pone.0009293.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f9e/2827541/b6ca83511817/pone.0009293.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f9e/2827541/0729ed7a6b69/pone.0009293.g006.jpg

相似文献

1
Bayesian Markov Random Field analysis for protein function prediction based on network data.基于网络数据的蛋白质功能预测的贝叶斯马尔可夫随机场分析。
PLoS One. 2010 Feb 24;5(2):e9293. doi: 10.1371/journal.pone.0009293.
2
Predicting protein complexes from weighted protein-protein interaction graphs with a novel unsupervised methodology: Evolutionary enhanced Markov clustering.利用一种新颖的无监督方法从加权蛋白质 - 蛋白质相互作用图预测蛋白质复合物:进化增强的马尔可夫聚类。
Artif Intell Med. 2015 Mar;63(3):181-9. doi: 10.1016/j.artmed.2014.12.012. Epub 2015 Feb 18.
3
A transdimensional Bayesian model for pattern recognition in DNA sequences.一种用于DNA序列模式识别的跨维度贝叶斯模型。
Biostatistics. 2008 Oct;9(4):668-85. doi: 10.1093/biostatistics/kxm058. Epub 2008 Mar 18.
4
Predicting protein function from protein/protein interaction data: a probabilistic approach.从蛋白质/蛋白质相互作用数据预测蛋白质功能:一种概率方法。
Bioinformatics. 2003;19 Suppl 1:i197-204. doi: 10.1093/bioinformatics/btg1026.
5
Markov chain Monte Carlo simulation of a Bayesian mixture model for gene network inference.贝叶斯混合模型的马尔可夫链蒙特卡罗模拟在基因网络推断中的应用。
Genes Genomics. 2019 May;41(5):547-555. doi: 10.1007/s13258-019-00789-8. Epub 2019 Feb 11.
6
Markov clustering versus affinity propagation for the partitioning of protein interaction graphs.用于蛋白质相互作用图划分的马尔可夫聚类与亲和传播算法
BMC Bioinformatics. 2009 Mar 30;10:99. doi: 10.1186/1471-2105-10-99.
7
Resolving the structure of interactomes with hierarchical agglomerative clustering.利用层次凝聚聚类解析互作组学结构。
BMC Bioinformatics. 2011 Feb 15;12 Suppl 1(Suppl 1):S44. doi: 10.1186/1471-2105-12-S1-S44.
8
A full bayesian approach for boolean genetic network inference.一种用于布尔基因网络推理的全贝叶斯方法。
PLoS One. 2014 Dec 31;9(12):e115806. doi: 10.1371/journal.pone.0115806. eCollection 2014.
9
WNP: a novel algorithm for gene products annotation from weighted functional networks.WNP:一种从加权功能网络中注释基因产物的新算法。
PLoS One. 2012;7(6):e38767. doi: 10.1371/journal.pone.0038767. Epub 2012 Jun 28.
10
Joint clustering of protein interaction networks through Markov random walk.通过马尔可夫随机游走对蛋白质相互作用网络进行联合聚类。
BMC Syst Biol. 2014;8 Suppl 1(Suppl 1):S9. doi: 10.1186/1752-0509-8-S1-S9. Epub 2014 Jan 24.

引用本文的文献

1
GhostBuster: A Deep-Learning-based, Literature-Unbiased Gene Prioritization Tool for Gene Annotation Prediction.幽灵克星:一种基于深度学习、不受文献偏差影响的用于基因注释预测的基因优先级排序工具。
bioRxiv. 2025 Jun 27:2025.06.22.660948. doi: 10.1101/2025.06.22.660948.
2
A Gated Recurrent Unit based architecture for recognizing ontology concepts from biological literature.一种基于门控循环单元的架构,用于从生物文献中识别本体概念。
BioData Min. 2022 Sep 28;15(1):22. doi: 10.1186/s13040-022-00310-0.
3
Integrated Deadenylase Genetic Association Network and Transcriptome Analysis in Thoracic Carcinomas.

本文引用的文献

1
Stochastic relaxation, gibbs distributions, and the bayesian restoration of images.随机松弛,吉布斯分布,以及贝叶斯图像恢复。
IEEE Trans Pattern Anal Mach Intell. 1984 Jun;6(6):721-41. doi: 10.1109/tpami.1984.4767596.
2
Joining forces in the quest for orthologs.联手寻找直系同源物。
Genome Biol. 2009;10(9):403. doi: 10.1186/gb-2009-10-9-403. Epub 2009 Sep 29.
3
The rough guide to in silico function prediction, or how to use sequence and structure information to predict protein function.计算机模拟功能预测的简要指南,或如何利用序列和结构信息预测蛋白质功能。
胸癌中整合去腺苷酸化基因关联网络和转录组分析。
Molecules. 2022 May 12;27(10):3102. doi: 10.3390/molecules27103102.
4
Computational Identification of Guillain-Barré Syndrome-Related Genes by an mRNA Gene Expression Profile and a Protein-Protein Interaction Network.通过mRNA基因表达谱和蛋白质-蛋白质相互作用网络对吉兰-巴雷综合征相关基因进行计算识别
Front Mol Neurosci. 2022 Mar 17;15:850209. doi: 10.3389/fnmol.2022.850209. eCollection 2022.
5
Protein function prediction with gene ontology: from traditional to deep learning models.利用基因本体进行蛋白质功能预测:从传统模型到深度学习模型
PeerJ. 2021 Aug 24;9:e12019. doi: 10.7717/peerj.12019. eCollection 2021.
6
Metric Labeling and Semimetric Embedding for Protein Annotation Prediction.蛋白质标注预测的度量标记和半度量嵌入。
J Comput Biol. 2021 May;28(5):514-525. doi: 10.1089/cmb.2020.0425. Epub 2020 Dec 23.
7
Defects in the rice aconitase-encoding OsACO1 gene alter iron homeostasis.水稻 aconitase 编码基因 OsACO1 的缺陷改变了铁的稳态。
Plant Mol Biol. 2020 Dec;104(6):629-645. doi: 10.1007/s11103-020-01065-0. Epub 2020 Sep 9.
8
Common Functions of Disordered Proteins across Evolutionary Distant Organisms.进化上远缘生物体中无序蛋白质的常见功能。
Int J Mol Sci. 2020 Mar 19;21(6):2105. doi: 10.3390/ijms21062105.
9
Identification of Triple-Negative Breast Cancer Genes and a Novel High-Risk Breast Cancer Prediction Model Development Based on PPI Data and Support Vector Machines.基于蛋白质-蛋白质相互作用数据和支持向量机的三阴性乳腺癌基因鉴定及新型高危乳腺癌预测模型开发
Front Genet. 2019 Mar 15;10:180. doi: 10.3389/fgene.2019.00180. eCollection 2019.
10
Improving protein function prediction using protein sequence and GO-term similarities.利用蛋白质序列和 GO 术语相似性提高蛋白质功能预测。
Bioinformatics. 2019 Apr 1;35(7):1116-1124. doi: 10.1093/bioinformatics/bty751.
PLoS Comput Biol. 2008 Oct;4(10):e1000160. doi: 10.1371/journal.pcbi.1000160. Epub 2008 Oct 31.
4
The quest for orthologs: finding the corresponding gene across genomes.寻找直系同源基因:在不同基因组中找到对应的基因。
Trends Genet. 2008 Nov;24(11):539-51. doi: 10.1016/j.tig.2008.08.009. Epub 2008 Sep 24.
5
Genetic and structural analysis of Hmg2p-induced endoplasmic reticulum remodeling in Saccharomyces cerevisiae.酿酒酵母中Hmg2p诱导的内质网重塑的遗传与结构分析
Mol Biol Cell. 2008 Oct;19(10):4506-20. doi: 10.1091/mbc.e07-11-1188. Epub 2008 Jul 30.
6
From E-MAPs to module maps: dissecting quantitative genetic interactions using physical interactions.从E-MAPs到模块图谱:利用物理相互作用剖析定量遗传相互作用
Mol Syst Biol. 2008;4:209. doi: 10.1038/msb.2008.42. Epub 2008 Jul 15.
7
GeneMANIA: a real-time multiple association network integration algorithm for predicting gene function.GeneMANIA:一种用于预测基因功能的实时多重关联网络整合算法。
Genome Biol. 2008;9 Suppl 1(Suppl 1):S4. doi: 10.1186/gb-2008-9-s1-s4. Epub 2008 Jun 27.
8
Predicting gene function in a hierarchical context with an ensemble of classifiers.使用分类器集成在分层背景下预测基因功能。
Genome Biol. 2008;9 Suppl 1(Suppl 1):S3. doi: 10.1186/gb-2008-9-s1-s3. Epub 2008 Jun 27.
9
A critical assessment of Mus musculus gene function prediction using integrated genomic evidence.利用整合基因组证据对小家鼠基因功能预测的批判性评估。
Genome Biol. 2008;9 Suppl 1(Suppl 1):S2. doi: 10.1186/gb-2008-9-s1-s2. Epub 2008 Jun 27.
10
A yeast exosome cofactor, Mpp6, functions in RNA surveillance and in the degradation of noncoding RNA transcripts.一种酵母外切体辅因子Mpp6在RNA监测和非编码RNA转录本的降解中发挥作用。
Mol Cell Biol. 2008 Sep;28(17):5446-57. doi: 10.1128/MCB.00463-08. Epub 2008 Jun 30.