• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ProtLoc-GRPO:使用基于图的模型和强化学习进行细胞系特异性亚细胞定位预测。

ProtLoc-GRPO: Cell line-specific subcellular localization prediction using a graph-based model and reinforcement learning.

作者信息

Zeng Shuai, Zhang Weinan, Li Chaohan, Jiang Yuexu, Wang Duolin, Shao Qing, Xu Dong

机构信息

Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO 65211, USA.

Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO 65211, USA.

出版信息

bioRxiv. 2025 Jul 22:2025.07.17.665451. doi: 10.1101/2025.07.17.665451.

DOI:10.1101/2025.07.17.665451
PMID:40777512
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12330487/
Abstract

Subcellular localization prediction is crucial for understanding protein functions and cellular processes. Subcellular localization is dependent on tissue and cell lines derived from different cell types. Predicting cell line-specific subcellular localization using the information of protein-protein interactions (PPIs) offers deeper insights into dynamic cellular organization and molecular mechanisms. However, many existing PPI networks contain systematic errors that limit prediction accuracy. In this study, we propose a reinforcement learning approach, ProtLoc-GRPO, to enhance subcellular localization prediction by optimizing the structure of the underlying PPI network. ProtLoc-GRPO learns to rank and retain the most informative PPI edges to maximize the macro-F1 score for cell line-specific subcellular localization. Our approach yields a 7% improvement in macro-F1 score over the baseline. We further evaluate its robustness across various edge pruning rates and benchmark it against conventional pruning strategies. Results show that our proposed method consistently outperforms existing approaches. To our knowledge, this work represents the first study to predict cell line-specific protein subcellular localization and the first application of the Group Relative Policy Optimization (GRPO) framework to a graph-based model for bioinformatics tasks.

摘要

亚细胞定位预测对于理解蛋白质功能和细胞过程至关重要。亚细胞定位取决于源自不同细胞类型的组织和细胞系。利用蛋白质-蛋白质相互作用(PPI)信息预测细胞系特异性亚细胞定位,能更深入地了解动态细胞组织和分子机制。然而,许多现有的PPI网络存在系统性错误,限制了预测准确性。在本研究中,我们提出一种强化学习方法ProtLoc-GRPO,通过优化潜在PPI网络结构来增强亚细胞定位预测。ProtLoc-GRPO学习对最具信息性的PPI边进行排序和保留,以最大化细胞系特异性亚细胞定位的宏F1分数。我们的方法在宏F1分数上比基线提高了7%。我们进一步评估了其在各种边剪枝率下的稳健性,并与传统剪枝策略进行了基准测试。结果表明,我们提出的方法始终优于现有方法。据我们所知,这项工作代表了首次预测细胞系特异性蛋白质亚细胞定位的研究,也是首次将组相对策略优化(GRPO)框架应用于基于图的生物信息学任务模型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0332/12330487/a6a07e1d520f/nihpp-2025.07.17.665451v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0332/12330487/59008b66d9f8/nihpp-2025.07.17.665451v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0332/12330487/9a53301a2dc6/nihpp-2025.07.17.665451v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0332/12330487/47b57182b288/nihpp-2025.07.17.665451v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0332/12330487/306feffb7bae/nihpp-2025.07.17.665451v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0332/12330487/a6a07e1d520f/nihpp-2025.07.17.665451v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0332/12330487/59008b66d9f8/nihpp-2025.07.17.665451v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0332/12330487/9a53301a2dc6/nihpp-2025.07.17.665451v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0332/12330487/47b57182b288/nihpp-2025.07.17.665451v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0332/12330487/306feffb7bae/nihpp-2025.07.17.665451v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0332/12330487/a6a07e1d520f/nihpp-2025.07.17.665451v1-f0005.jpg

相似文献

1
ProtLoc-GRPO: Cell line-specific subcellular localization prediction using a graph-based model and reinforcement learning.ProtLoc-GRPO:使用基于图的模型和强化学习进行细胞系特异性亚细胞定位预测。
bioRxiv. 2025 Jul 22:2025.07.17.665451. doi: 10.1101/2025.07.17.665451.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.利用基础模型库进行跨设备肿瘤显微镜检查中的细胞相似性搜索。
Front Oncol. 2025 Jun 18;15:1480384. doi: 10.3389/fonc.2025.1480384. eCollection 2025.
4
Systemic Inflammatory Response Syndrome全身炎症反应综合征
5
Predicting cognitive decline: Deep-learning reveals subtle brain changes in pre-MCI stage.预测认知衰退:深度学习揭示轻度认知障碍前阶段大脑的细微变化。
J Prev Alzheimers Dis. 2025 May;12(5):100079. doi: 10.1016/j.tjpad.2025.100079. Epub 2025 Feb 6.
6
Short-Term Memory Impairment短期记忆障碍
7
An End-to-End Knowledge Graph Fused Graph Neural Network for Accurate Protein-Protein Interactions Prediction.一种用于准确预测蛋白质-蛋白质相互作用的端到端知识图谱融合图神经网络
IEEE/ACM Trans Comput Biol Bioinform. 2024 Nov-Dec;21(6):2518-2530. doi: 10.1109/TCBB.2024.3486216. Epub 2024 Dec 10.
8
Interplay between tumor mutation burden and the tumor microenvironment predicts the prognosis of pan-cancer anti-PD-1/PD-L1 therapy.肿瘤突变负荷与肿瘤微环境之间的相互作用可预测泛癌抗PD-1/PD-L1治疗的预后。
Front Immunol. 2025 Jul 24;16:1557461. doi: 10.3389/fimmu.2025.1557461. eCollection 2025.
9
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状Meta分析。
Cochrane Database Syst Rev. 2020 Jan 9;1(1):CD011535. doi: 10.1002/14651858.CD011535.pub3.
10
Long-term care plan recommendation for older adults with disabilities: a bipartite graph transformer and self-supervised approach.针对残疾老年人的长期护理计划建议:一种二分图变压器和自监督方法。
J Am Med Inform Assoc. 2025 Apr 1;32(4):689-701. doi: 10.1093/jamia/ocae327.

本文引用的文献

1
ProtNote: a multimodal method for protein-function annotation.ProtNote:一种用于蛋白质功能注释的多模态方法。
Bioinformatics. 2025 May 6;41(5). doi: 10.1093/bioinformatics/btaf170.
2
Protein codes promote selective subcellular compartmentalization.蛋白质编码促进选择性亚细胞区室化。
Science. 2025 Mar 7;387(6738):1095-1101. doi: 10.1126/science.adq2634. Epub 2025 Feb 6.
3
The STRING database in 2025: protein networks with directionality of regulation.2025年的STRING数据库:具有调控方向性的蛋白质网络。
Nucleic Acids Res. 2025 Jan 6;53(D1):D730-D737. doi: 10.1093/nar/gkae1113.
4
Parameter-efficient fine-tuning on large protein language models improves signal peptide prediction.大型蛋白质语言模型的参数高效微调可提高信号肽预测的效果。
Genome Res. 2024 Oct 11;34(9):1445-1454. doi: 10.1101/gr.279132.124.
5
Ensembl 2024.Ensembl 2024.
Nucleic Acids Res. 2024 Jan 5;52(D1):D891-D899. doi: 10.1093/nar/gkad1049.
6
MULocDeep web service for protein localization prediction and visualization at subcellular and suborganellar levels.MULocDeep 网络服务,用于在亚细胞和亚细胞器水平上进行蛋白质定位预测和可视化。
Nucleic Acids Res. 2023 Jul 5;51(W1):W343-W349. doi: 10.1093/nar/gkad374.
7
Evolutionary-scale prediction of atomic-level protein structure with a language model.用语言模型进行原子级蛋白质结构的进化尺度预测。
Science. 2023 Mar 17;379(6637):1123-1130. doi: 10.1126/science.ade2574. Epub 2023 Mar 16.
8
The Gene Ontology knowledgebase in 2023.2023 版基因本体论知识库。
Genetics. 2023 May 4;224(1). doi: 10.1093/genetics/iyad031.
9
Networks as Biomarkers: Uses and Purposes.网络作为生物标志物:用途和目的。
Genes (Basel). 2023 Feb 8;14(2):429. doi: 10.3390/genes14020429.
10
DeepCellEss: cell line-specific essential protein prediction with attention-based interpretable deep learning.DeepCellEss:基于注意力机制的可解释深度学习的细胞系特异性必需蛋白预测。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac779.