• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

蛋白质标注预测的度量标记和半度量嵌入。

Metric Labeling and Semimetric Embedding for Protein Annotation Prediction.

机构信息

Department of Computer Science, Ozyegin University, Istanbul, Turkey.

Computational Biology Department, Carnegie Mellon University, Pittsburgh, Pennsylvania, USA.

出版信息

J Comput Biol. 2021 May;28(5):514-525. doi: 10.1089/cmb.2020.0425. Epub 2020 Dec 23.

DOI:10.1089/cmb.2020.0425
PMID:33370163
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8165475/
Abstract

Computational techniques have been successful at predicting protein function from relational data (functional or physical interactions). These techniques have been used to generate hypotheses and to direct experimental validation. With few exceptions, the task is modeled as multilabel classification problems where the labels (functions) are treated independently or semi-independently. However, databases such as the Gene Ontology provide information about the similarities between functions. We explore the use of the Metric Labeling combinatorial optimization problem to make use of heuristically computed distances between functions to make more accurate predictions of protein function in networks derived from both physical interactions and a combination of other data types. To do this, we give a new technique (based on convex optimization) for converting heuristic semimetric distances into a metric with minimum least-squared distortion (LSD). The Metric Labeling approach is shown to outperform five existing techniques for inferring function from networks. These results suggest that Metric Labeling is useful for protein function prediction, and that LSD minimization can help solve the problem of converting heuristic distances to a metric.

摘要

计算技术在从关系数据(功能或物理相互作用)预测蛋白质功能方面取得了成功。这些技术已被用于生成假设并指导实验验证。除了少数例外,任务被建模为多标签分类问题,其中标签(功能)被独立或半独立地对待。然而,像基因本体论这样的数据库提供了关于功能之间相似性的信息。我们探索使用度量标记组合优化问题来利用启发式计算的功能之间的距离,以便更准确地预测来自物理相互作用和其他数据类型组合的网络中的蛋白质功能。为此,我们提出了一种新的技术(基于凸优化),用于将启发式半度量距离转换为具有最小最小二乘失真(LSD)的度量。度量标记方法在从网络推断功能方面优于五种现有技术。这些结果表明,度量标记对于蛋白质功能预测很有用,并且 LSD 最小化可以帮助解决将启发式距离转换为度量的问题。

相似文献

1
Metric Labeling and Semimetric Embedding for Protein Annotation Prediction.蛋白质标注预测的度量标记和半度量嵌入。
J Comput Biol. 2021 May;28(5):514-525. doi: 10.1089/cmb.2020.0425. Epub 2020 Dec 23.
2
Protein complex prediction in large ontology attributed protein-protein interaction networks.大型本体属性蛋白质 - 蛋白质相互作用网络中的蛋白质复合物预测
IEEE/ACM Trans Comput Biol Bioinform. 2013 May-Jun;10(3):729-41. doi: 10.1109/TCBB.2013.86.
3
Integrating multiple networks for protein function prediction.整合多个网络用于蛋白质功能预测。
BMC Syst Biol. 2015;9 Suppl 1(Suppl 1):S3. doi: 10.1186/1752-0509-9-S1-S3. Epub 2015 Jan 21.
4
5
AVID: an integrative framework for discovering functional relationships among proteins.AVID:一个用于发现蛋白质间功能关系的综合框架。
BMC Bioinformatics. 2005 Jun 1;6:136. doi: 10.1186/1471-2105-6-136.
6
Information theory applied to the sparse gene ontology annotation network to predict novel gene function.信息论应用于稀疏基因本体注释网络以预测新的基因功能。
Bioinformatics. 2007 Jul 1;23(13):i529-38. doi: 10.1093/bioinformatics/btm195.
7
Identification of functional hubs and modules by converting interactome networks into hierarchical ordering of proteins.通过将互作网络转化为蛋白质的层级排序来鉴定功能枢纽和模块。
BMC Bioinformatics. 2010 Apr 29;11 Suppl 3(Suppl 3):S3. doi: 10.1186/1471-2105-11-S3-S3.
8
Semi-supervised multi-label collective classification ensemble for functional genomics.用于功能基因组学的半监督多标签集体分类集成方法
BMC Genomics. 2014;15 Suppl 9(Suppl 9):S17. doi: 10.1186/1471-2164-15-S9-S17. Epub 2014 Dec 8.
9
Exploratory analysis of protein translation regulatory networks using hierarchical random graphs.使用层次随机图探索性分析蛋白质翻译调控网络。
BMC Bioinformatics. 2010 Apr 29;11 Suppl 3(Suppl 3):S2. doi: 10.1186/1471-2105-11-S3-S2.
10
Comparison of topological clustering within protein networks using edge metrics that evaluate full sequence, full structure, and active site microenvironment similarity.使用评估全序列、全结构和活性位点微环境相似性的边度量对蛋白质网络内的拓扑聚类进行比较。
Protein Sci. 2015 Sep;24(9):1423-39. doi: 10.1002/pro.2724. Epub 2015 Aug 18.

引用本文的文献

1
Diffusion archeology for diffusion progression history reconstruction.用于扩散进程历史重建的扩散考古学。
Knowl Inf Syst. 2016 Nov;49(2):403-427. doi: 10.1007/s10115-015-0904-x. Epub 2015 Dec 11.
2
Exploiting ontology graph for predicting sparsely annotated gene function.利用本体图预测注释稀疏的基因功能。
Bioinformatics. 2015 Jun 15;31(12):i357-64. doi: 10.1093/bioinformatics/btv260.

本文引用的文献

1
The BioGRID interaction database: 2019 update.生物相互作用数据库(BioGRID):2019 年更新版。
Nucleic Acids Res. 2019 Jan 8;47(D1):D529-D541. doi: 10.1093/nar/gky1079.
2
Bayesian Markov Random Field analysis for protein function prediction based on network data.基于网络数据的蛋白质功能预测的贝叶斯马尔可夫随机场分析。
PLoS One. 2010 Feb 24;5(2):e9293. doi: 10.1371/journal.pone.0009293.
3
Seeing the forest for the trees: using the Gene Ontology to restructure hierarchical clustering.见林又见树:利用基因本体论重组层次聚类
Bioinformatics. 2009 Jul 15;25(14):1789-95. doi: 10.1093/bioinformatics/btp327. Epub 2009 Jun 3.
4
Approximate labeling via graph cuts based on linear programming.基于线性规划的通过图割进行近似标记
IEEE Trans Pattern Anal Mach Intell. 2007 Aug;29(8):1436-53. doi: 10.1109/TPAMI.2007.1061.
5
Network-based prediction of protein function.基于网络的蛋白质功能预测。
Mol Syst Biol. 2007;3:88. doi: 10.1038/msb4100129. Epub 2007 Mar 13.
6
A new measure for functional similarity of gene products based on Gene Ontology.一种基于基因本体论的基因产物功能相似性新度量方法。
BMC Bioinformatics. 2006 Jun 15;7:302. doi: 10.1186/1471-2105-7-302.
7
Diffusion kernel-based logistic regression models for protein function prediction.基于扩散核的蛋白质功能预测逻辑回归模型
OMICS. 2006 Spring;10(1):40-55. doi: 10.1089/omi.2006.10.40.
8
Hierarchical multi-label prediction of gene function.基因功能的分层多标签预测
Bioinformatics. 2006 Apr 1;22(7):830-6. doi: 10.1093/bioinformatics/btk048. Epub 2006 Jan 12.
9
Whole-proteome prediction of protein function via graph-theoretic analysis of interaction maps.通过相互作用图谱的图论分析对蛋白质功能进行全蛋白质组预测。
Bioinformatics. 2005 Jun;21 Suppl 1:i302-10. doi: 10.1093/bioinformatics/bti1054.
10
A knowledge-based clustering algorithm driven by Gene Ontology.
J Biopharm Stat. 2004 Aug;14(3):687-700. doi: 10.1081/bip-200025659.