Suppr超能文献

InterLabelGO+:揭示蛋白质功能预测中标签相关性。

InterLabelGO+: unraveling label correlations in protein function prediction.

机构信息

Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, 48109, USA.

Department of Biological Chemistry, University of Michigan, Ann Arbor, MI, 48109, USA.

出版信息

Bioinformatics. 2024 Nov 1;40(11). doi: 10.1093/bioinformatics/btae655.

Abstract

MOTIVATION

Accurate protein function prediction is crucial for understanding biological processes and advancing biomedical research. However, the rapid growth of protein sequences far outpaces the experimental characterization of their functions, necessitating the development of automated computational methods.

RESULTS

We present InterLabelGO+, a hybrid approach that integrates a deep learning-based method with an alignment-based method for improved protein function prediction. InterLabelGO+ incorporates a novel loss function that addresses label dependency and imbalance and further enhances performance through dynamic weighting of the alignment-based component. A preliminary version of InterLabelGO+ achieved a strong performance in the CAFA5 challenge, ranking sixth out of 1625 participating teams. Comprehensive evaluations on large-scale protein function prediction tasks demonstrate InterLabelGO+'s ability to accurately predict Gene Ontology terms across various functional categories and evaluation metrics.

AVAILABILITY AND IMPLEMENTATION

The source code and datasets for InterLabelGO+ are freely available on GitHub at https://github.com/QuanEvans/InterLabelGO. A web-server is available at https://seq2fun.dcmb.med.umich.edu/InterLabelGO/. The software is implemented in Python and PyTorch, and is supported on Linux and macOS.

摘要

动机

准确的蛋白质功能预测对于理解生物过程和推进生物医学研究至关重要。然而,蛋白质序列的快速增长远远超过了其功能的实验表征,因此需要开发自动化的计算方法。

结果

我们提出了 InterLabelGO+,这是一种混合方法,它将基于深度学习的方法与基于比对的方法相结合,以提高蛋白质功能预测的准确性。InterLabelGO+采用了一种新颖的损失函数,解决了标签依赖性和不平衡问题,并通过动态加权比对方法进一步提高了性能。InterLabelGO+的初步版本在 CAFA5 挑战赛中表现出色,在 1625 个参赛团队中排名第六。在大规模蛋白质功能预测任务上的综合评估表明,InterLabelGO+能够准确预测各种功能类别和评估指标的基因本体术语。

可用性和实现

InterLabelGO+的源代码和数据集可在 GitHub 上免费获得,网址为 https://github.com/QuanEvans/InterLabelGO。一个网络服务器可在 https://seq2fun.dcmb.med.umich.edu/InterLabelGO/ 上访问。该软件是用 Python 和 PyTorch 实现的,支持 Linux 和 macOS。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验