• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过多标签相关半监督学习预测蛋白质功能。

Predicting protein function by multi-label correlated semi-supervised learning.

机构信息

Department of Information System, City University of Hong Kong, 83 Tat Chee Avenue, Kowloon, Hong Kong.

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2012 Jul-Aug;9(4):1059-69. doi: 10.1109/TCBB.2011.156.

DOI:10.1109/TCBB.2011.156
PMID:22595236
Abstract

Assigning biological functions to uncharacterized proteins is a fundamental problem in the postgenomic era. The increasing availability of large amounts of data on protein-protein interactions (PPIs) has led to the emergence of a considerable number of computational methods for determining protein function in the context of a network. These algorithms, however, treat each functional class in isolation and thereby often suffer from the difficulty of the scarcity of labeled data. In reality, different functional classes are naturally dependent on one another. We propose a new algorithm, Multi-label Correlated Semi-supervised Learning (MCSL), to incorporate the intrinsic correlations among functional classes into protein function prediction by leveraging the relationships provided by the PPI network and the functional class network. The guiding intuition is that the classification function should be sufficiently smooth on subgraphs where the respective topologies of these two networks are a good match. We encode this intuition as regularized learning with intraclass and interclass consistency, which can be understood as an extension of the graph-based learning with local and global consistency (LGC) method. Cross validation on the yeast proteome illustrates that MCSL consistently outperforms several state-of-the-art methods. Most notably, it effectively overcomes the problem associated with scarcity of label data. The supplementary files are freely available at http://sites.google.com/site/csaijiang/MCSL.

摘要

将未被阐明的蛋白质赋予生物学功能是后基因组时代的一个基本问题。大量的蛋白质-蛋白质相互作用(PPIs)数据的不断增加,导致了大量的计算方法的出现,用于在网络背景下确定蛋白质的功能。然而,这些算法将每个功能类别孤立地处理,因此经常受到标记数据匮乏的困难的影响。实际上,不同的功能类别是自然相互依赖的。我们提出了一种新的算法,多标签相关半监督学习(MCSL),通过利用 PPI 网络和功能类别网络提供的关系,将功能类别之间的内在相关性纳入到蛋白质功能预测中。指导思想是,分类函数在子图上应该足够平滑,这些子图的拓扑结构与这两个网络的拓扑结构非常匹配。我们将这种直觉编码为具有类内和类间一致性的正则化学习,这可以理解为基于图的学习与局部和全局一致性(LGC)方法的扩展。在酵母蛋白质组上的交叉验证表明,MCSL 始终优于几种最先进的方法。最值得注意的是,它有效地克服了标签数据匮乏的问题。补充文件可在 http://sites.google.com/site/csaijiang/MCSL 上免费获取。

相似文献

1
Predicting protein function by multi-label correlated semi-supervised learning.通过多标签相关半监督学习预测蛋白质功能。
IEEE/ACM Trans Comput Biol Bioinform. 2012 Jul-Aug;9(4):1059-69. doi: 10.1109/TCBB.2011.156.
2
Semi-supervised multi-label collective classification ensemble for functional genomics.用于功能基因组学的半监督多标签集体分类集成方法
BMC Genomics. 2014;15 Suppl 9(Suppl 9):S17. doi: 10.1186/1471-2164-15-S9-S17. Epub 2014 Dec 8.
3
Protein functional properties prediction in sparsely-label PPI networks through regularized non-negative matrix factorization.通过正则化非负矩阵分解在稀疏标记的蛋白质-蛋白质相互作用网络中预测蛋白质功能特性
BMC Syst Biol. 2015;9 Suppl 1(Suppl 1):S9. doi: 10.1186/1752-0509-9-S1-S9. Epub 2015 Jan 21.
4
AVID: an integrative framework for discovering functional relationships among proteins.AVID:一个用于发现蛋白质间功能关系的综合框架。
BMC Bioinformatics. 2005 Jun 1;6:136. doi: 10.1186/1471-2105-6-136.
5
Semi-supervised multi-task learning for predicting interactions between HIV-1 and human proteins.半监督多任务学习预测 HIV-1 与人类蛋白质相互作用。
Bioinformatics. 2010 Sep 15;26(18):i645-52. doi: 10.1093/bioinformatics/btq394.
6
Protein-Protein Interactions Prediction via Multimodal Deep Polynomial Network and Regularized Extreme Learning Machine.基于多模态深度多项式网络和正则化极限学习机的蛋白质-蛋白质相互作用预测。
IEEE J Biomed Health Inform. 2019 May;23(3):1290-1303. doi: 10.1109/JBHI.2018.2845866. Epub 2018 Jun 12.
7
Function-function correlated multi-label protein function prediction over interaction networks.基于相互作用网络的功能-功能相关多标签蛋白质功能预测
J Comput Biol. 2013 Apr;20(4):322-43. doi: 10.1089/cmb.2012.0272.
8
Sparse Markov chain-based semi-supervised multi-instance multi-label method for protein function prediction.基于稀疏马尔可夫链的半监督多示例多标签蛋白质功能预测方法。
J Bioinform Comput Biol. 2015 Oct;13(5):1543001. doi: 10.1142/S0219720015430015. Epub 2015 Sep 16.
9
Sequential linear neighborhood propagation for semi-supervised protein function prediction.用于半监督蛋白质功能预测的序列线性邻域传播
J Bioinform Comput Biol. 2011 Dec;9(6):663-79. doi: 10.1142/s0219720011005550.
10
Discovering reliable protein interactions from high-throughput experimental data using network topology.利用网络拓扑结构从高通量实验数据中发现可靠的蛋白质相互作用。
Artif Intell Med. 2005 Sep-Oct;35(1-2):37-47. doi: 10.1016/j.artmed.2005.02.004.

引用本文的文献

1
Learning a generalized graph transformer for protein function prediction in dissimilar sequences.学习用于预测不同序列中蛋白质功能的广义图变换器。
Gigascience. 2024 Jan 2;13. doi: 10.1093/gigascience/giae093.
2
PFP-GO: Integrating protein sequence, domain and protein-protein interaction information for protein function prediction using ranked GO terms.PFP-GO:利用排序后的基因本体(GO)术语整合蛋白质序列、结构域和蛋白质-蛋白质相互作用信息以进行蛋白质功能预测。
Front Genet. 2022 Sep 29;13:969915. doi: 10.3389/fgene.2022.969915. eCollection 2022.
3
Gene Ontology Capsule GAN: an improved architecture for protein function prediction.
基因本体胶囊生成对抗网络:一种用于蛋白质功能预测的改进架构。
PeerJ Comput Sci. 2022 Aug 15;8:e1014. doi: 10.7717/peerj-cs.1014. eCollection 2022.
4
DeepRTCP: Predicting ATP-Binding Cassette Transporters Based on 1-Dimensional Convolutional Network.深度实时定量聚合酶链反应:基于一维卷积网络预测ATP结合盒转运蛋白
Front Cell Dev Biol. 2021 Feb 1;8:614080. doi: 10.3389/fcell.2020.614080. eCollection 2020.
5
SDN2GO: An Integrated Deep Learning Model for Protein Function Prediction.SDN2GO:一种用于蛋白质功能预测的集成深度学习模型。
Front Bioeng Biotechnol. 2020 Apr 29;8:391. doi: 10.3389/fbioe.2020.00391. eCollection 2020.
6
FunPred 3.0: improved protein function prediction using protein interaction network.FunPred 3.0:利用蛋白质相互作用网络改进蛋白质功能预测
PeerJ. 2019 May 22;7:e6830. doi: 10.7717/peerj.6830. eCollection 2019.
7
Multiple kernels learning-based biological entity relationship extraction method.基于多核学习的生物实体关系提取方法。
J Biomed Semantics. 2017 Sep 20;8(Suppl 1):38. doi: 10.1186/s13326-017-0138-9.
8
DeepGO: predicting protein functions from sequence and interactions using a deep ontology-aware classifier.DeepGO:使用深度本体感知分类器从序列和相互作用预测蛋白质功能。
Bioinformatics. 2018 Feb 15;34(4):660-668. doi: 10.1093/bioinformatics/btx624.
9
Application of gap-constraints given sequential frequent pattern mining for protein function prediction.给定序列频繁模式挖掘的间隙约束在蛋白质功能预测中的应用。
Osong Public Health Res Perspect. 2015 Apr;6(2):112-20. doi: 10.1016/j.phrp.2015.01.006. Epub 2015 Feb 24.
10
Predicting protein functions using incomplete hierarchical labels.使用不完整的层次标签预测蛋白质功能。
BMC Bioinformatics. 2015 Jan 16;16:1. doi: 10.1186/s12859-014-0430-y.