• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于基因本体(GO)相似性的异质网络传播进行蛋白质功能预测

Protein function prediction using GO similarity-based heterogeneous network propagation.

作者信息

Hu Sai, Zhao Bihai

机构信息

School of Mathematics, Changsha University, Changsha, 410022, Hunan, China.

School of Computer Science and Engineering, Changsha University, Changsha, 410022, Hunan, China.

出版信息

Sci Rep. 2025 May 31;15(1):19131. doi: 10.1038/s41598-025-04933-1.

DOI:10.1038/s41598-025-04933-1
PMID:40450118
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12126536/
Abstract

Protein function prediction is a fundamental cornerstone in bioinformatics, providing critical insights into biological processes and disease mechanisms. Despite significant advances, challenges persist due to data sparsity and functional ambiguity. We introduce GOHPro (GO Similarity-based Heterogeneous Network Propagation), a novel method that constructs a heterogeneous network by integrating protein functional similarity (derived from domain profiles and modular complexes) with GO semantic relationships. This method applies a network propagation algorithm to prioritize annotations based on multi-omics context. When evaluated on yeast and human datasets, GOHPro outperformed six state-of-the-art methods. Specifically, it achieved F improvements ranging from 6.8 to 47.5% over methods like exp2GO across the Biological Process (BP), Molecular Function (MF), and Cellular Component (CC) ontologies in both yeast and human species. Rigorous case studies on proteins with shared domains, such as AAA + ATPases, demonstrated GOHPro's ability to resolve functional ambiguity by leveraging contextual interactions and modular complexes. Further validation on the CAFA3 benchmark confirmed its generalizability, with F gains exceeding 62% compared to baseline approaches in human species. Our analysis revealed that homology and network connectivity critically influence prediction robustness, with the modular similarity network compensating for evolutionary gaps in dark proteins. The framework's extensibility to de novo structural predictions highlights its potential to bridge the annotation gap in uncharacterized proteomes.

摘要

蛋白质功能预测是生物信息学的一个基本基石,为深入了解生物过程和疾病机制提供了关键见解。尽管取得了重大进展,但由于数据稀疏性和功能模糊性,挑战依然存在。我们引入了GOHPro(基于GO相似性的异质网络传播),这是一种通过将蛋白质功能相似性(源自结构域概况和模块化复合物)与GO语义关系相结合来构建异质网络的新方法。该方法应用网络传播算法,根据多组学背景对注释进行优先级排序。在酵母和人类数据集上进行评估时,GOHPro优于六种先进方法。具体而言,在酵母和人类物种的生物过程(BP)、分子功能(MF)和细胞成分(CC)本体中,它比exp2GO等方法在F值上有6.8%至47.5%的提升。对具有共享结构域的蛋白质(如AAA + ATP酶)进行的严格案例研究表明,GOHPro能够通过利用上下文相互作用和模块化复合物来解决功能模糊性问题。在CAFA3基准上的进一步验证证实了其通用性,在人类物种中,与基线方法相比,F值提升超过62%。我们的分析表明,同源性和网络连通性对预测稳健性有至关重要的影响,模块化相似性网络弥补了暗蛋白中的进化差距。该框架对从头结构预测的可扩展性突出了其弥合未表征蛋白质组注释差距的潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61b2/12126536/91ea38fce5df/41598_2025_4933_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61b2/12126536/05859a623db6/41598_2025_4933_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61b2/12126536/e527f7df7134/41598_2025_4933_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61b2/12126536/856e2aa237d9/41598_2025_4933_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61b2/12126536/7721ffb29fc4/41598_2025_4933_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61b2/12126536/493ad2ee3ffd/41598_2025_4933_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61b2/12126536/0f0517701233/41598_2025_4933_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61b2/12126536/5d06c3e3db6c/41598_2025_4933_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61b2/12126536/e7938e382632/41598_2025_4933_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61b2/12126536/91ea38fce5df/41598_2025_4933_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61b2/12126536/05859a623db6/41598_2025_4933_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61b2/12126536/e527f7df7134/41598_2025_4933_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61b2/12126536/856e2aa237d9/41598_2025_4933_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61b2/12126536/7721ffb29fc4/41598_2025_4933_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61b2/12126536/493ad2ee3ffd/41598_2025_4933_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61b2/12126536/0f0517701233/41598_2025_4933_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61b2/12126536/5d06c3e3db6c/41598_2025_4933_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61b2/12126536/e7938e382632/41598_2025_4933_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61b2/12126536/91ea38fce5df/41598_2025_4933_Fig9_HTML.jpg

相似文献

1
Protein function prediction using GO similarity-based heterogeneous network propagation.基于基因本体(GO)相似性的异质网络传播进行蛋白质功能预测
Sci Rep. 2025 May 31;15(1):19131. doi: 10.1038/s41598-025-04933-1.
2
FunPredCATH: An ensemble method for predicting protein function using CATH.FunPredCATH:一种使用 CATH 预测蛋白质功能的集成方法。
Biochim Biophys Acta Proteins Proteom. 2024 Feb 1;1872(2):140985. doi: 10.1016/j.bbapap.2023.140985. Epub 2023 Dec 19.
3
Improving automatic GO annotation with semantic similarity.利用语义相似度提高 GO 自动注释的效果。
BMC Bioinformatics. 2022 Dec 12;23(Suppl 2):433. doi: 10.1186/s12859-022-04958-7.
4
Information theory applied to the sparse gene ontology annotation network to predict novel gene function.信息论应用于稀疏基因本体注释网络以预测新的基因功能。
Bioinformatics. 2007 Jul 1;23(13):i529-38. doi: 10.1093/bioinformatics/btm195.
5
Interspecies gene function prediction using semantic similarity.基于语义相似性的跨物种基因功能预测
BMC Syst Biol. 2016 Dec 23;10(Suppl 4):121. doi: 10.1186/s12918-016-0361-5.
6
Mutual annotation-based prediction of protein domain functions with Domain2GO.基于互注释的蛋白质结构域功能预测与 Domain2GO。
Protein Sci. 2024 Jun;33(6):e4988. doi: 10.1002/pro.4988.
7
Protein function prediction from protein-protein interaction network using gene ontology based neighborhood analysis and physico-chemical features.基于基因本体的邻域分析和物理化学特征,从蛋白质-蛋白质相互作用网络预测蛋白质功能。
J Bioinform Comput Biol. 2018 Dec;16(6):1850025. doi: 10.1142/S0219720018500257. Epub 2018 Sep 19.
8
Consistent prediction of GO protein localization.GO 蛋白定位的一致性预测。
Sci Rep. 2018 May 17;8(1):7757. doi: 10.1038/s41598-018-26041-z.
9
A multi-objective evolutionary algorithm for detecting protein complexes in PPI networks using gene ontology.一种利用基因本体在蛋白质-蛋白质相互作用网络中检测蛋白质复合物的多目标进化算法。
Sci Rep. 2025 May 15;15(1):16855. doi: 10.1038/s41598-025-01667-y.
10
NPF:network propagation for protein function prediction.NPF:用于蛋白质功能预测的网络传播。
BMC Bioinformatics. 2020 Aug 12;21(1):355. doi: 10.1186/s12859-020-03663-7.

本文引用的文献

1
DeepGOMeta for functional insights into microbial communities using deep learning-based protein function prediction.使用基于深度学习的蛋白质功能预测对微生物群落进行功能洞察的DeepGOMeta
Sci Rep. 2024 Dec 30;14(1):31813. doi: 10.1038/s41598-024-82956-w.
2
Information propagation dynamics on heterogeneous-homogeneous coupling bi-layer networks.异质-同质耦合双层网络上的信息传播动力学
Sci Rep. 2024 Dec 28;14(1):30766. doi: 10.1038/s41598-024-80998-8.
3
Accurate prediction of protein function using statistics-informed graph networks.
使用统计信息驱动的图网络准确预测蛋白质功能。
Nat Commun. 2024 Aug 4;15(1):6601. doi: 10.1038/s41467-024-50955-0.
4
Heterogeneous Network Representation Learning: A Unified Framework with Survey and Benchmark.异构网络表示学习:一个包含综述与基准测试的统一框架
IEEE Trans Knowl Data Eng. 2022 Oct;34(10):4854-4873. doi: 10.1109/tkde.2020.3045924. Epub 2020 Dec 21.
5
Domain-PFP allows protein function prediction using function-aware domain embedding representations.域-PFP 使用感知功能的域嵌入表示来进行蛋白质功能预测。
Commun Biol. 2023 Oct 31;6(1):1103. doi: 10.1038/s42003-023-05476-9.
6
Sequence-structure-function relationships in the microbial protein universe.微生物蛋白质宇宙中的序列-结构-功能关系。
Nat Commun. 2023 Apr 26;14(1):2351. doi: 10.1038/s41467-023-37896-w.
7
Improving automatic GO annotation with semantic similarity.利用语义相似度提高 GO 自动注释的效果。
BMC Bioinformatics. 2022 Dec 12;23(Suppl 2):433. doi: 10.1186/s12859-022-04958-7.
8
Protein function annotation based on heterogeneous biological networks.基于异质生物网络的蛋白质功能注释。
BMC Bioinformatics. 2022 Nov 18;23(1):493. doi: 10.1186/s12859-022-05057-3.
9
exp2GO: Improving Prediction of Functions in the Gene Ontology With Expression Data.exp2GO:利用表达数据改进基因本体中功能的预测
IEEE/ACM Trans Comput Biol Bioinform. 2023 Mar-Apr;20(2):999-1008. doi: 10.1109/TCBB.2022.3167245. Epub 2023 Apr 3.
10
The ubiquitin-dependent ATPase p97 removes cytotoxic trapped PARP1 from chromatin.泛素依赖性 ATP 酶 p97 将细胞毒性 PARP1 从染色质中移除。
Nat Cell Biol. 2022 Jan;24(1):62-73. doi: 10.1038/s41556-021-00807-6. Epub 2022 Jan 10.