• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过强化学习在蛋白质相互作用网络中检测分子复合物。

Molecular complex detection in protein interaction networks through reinforcement learning.

机构信息

Department of Molecular Biosciences, Center for Systems and Synthetic Biology, University of Texas, Austin, TX, 78712, USA.

Oden Institute for Computational Engineering and Sciences, University of Texas, Austin, TX, 78712, USA.

出版信息

BMC Bioinformatics. 2023 Aug 2;24(1):306. doi: 10.1186/s12859-023-05425-7.

DOI:10.1186/s12859-023-05425-7
PMID:37532987
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10394916/
Abstract

BACKGROUND

Proteins often assemble into higher-order complexes to perform their biological functions. Such protein-protein interactions (PPI) are often experimentally measured for pairs of proteins and summarized in a weighted PPI network, to which community detection algorithms can be applied to define the various higher-order protein complexes. Current methods include unsupervised and supervised approaches, often assuming that protein complexes manifest only as dense subgraphs. Utilizing supervised approaches, the focus is not on how to find them in a network, but only on learning which subgraphs correspond to complexes, currently solved using heuristics. However, learning to walk trajectories on a network to identify protein complexes leads naturally to a reinforcement learning (RL) approach, a strategy not extensively explored for community detection. Here, we develop and evaluate a reinforcement learning pipeline for community detection on weighted protein-protein interaction networks to detect new protein complexes. The algorithm is trained to calculate the value of different subgraphs encountered while walking on the network to reconstruct known complexes. A distributed prediction algorithm then scales the RL pipeline to search for novel protein complexes on large PPI networks.

RESULTS

The reinforcement learning pipeline is applied to a human PPI network consisting of 8k proteins and 60k PPI, which results in 1,157 protein complexes. The method demonstrated competitive accuracy with improved speed compared to previous algorithms. We highlight protein complexes such as C4orf19, C18orf21, and KIAA1522 which are currently minimally characterized. Additionally, the results suggest TMC04 be a putative additional subunit of the KICSTOR complex and confirm the involvement of C15orf41 in a higher-order complex with HIRA, CDAN1, ASF1A, and by 3D structural modeling.

CONCLUSIONS

Reinforcement learning offers several distinct advantages for community detection, including scalability and knowledge of the walk trajectories defining those communities. Applied to currently available human protein interaction networks, this method had comparable accuracy with other algorithms and notable savings in computational time, and in turn, led to clear predictions of protein function and interactions for several uncharacterized human proteins.

摘要

背景

蛋白质经常组装成更高阶的复合物来执行其生物功能。这些蛋白质-蛋白质相互作用(PPI)通常是通过实验测量成对蛋白质之间的相互作用,并总结在加权 PPI 网络中,然后可以应用社区检测算法来定义各种高阶蛋白质复合物。当前的方法包括无监督和监督方法,通常假设蛋白质复合物仅表现为密集子图。利用监督方法,重点不是如何在网络中找到它们,而是仅学习哪些子图对应于复合物,目前使用启发式方法解决。然而,学习在网络上行走轨迹以识别蛋白质复合物自然会导致强化学习(RL)方法,这是一种尚未广泛探索用于社区检测的策略。在这里,我们开发并评估了一种用于加权蛋白质-蛋白质相互作用网络上社区检测的强化学习管道,以检测新的蛋白质复合物。该算法经过训练,可以计算在网络上行走时遇到的不同子图的价值,以重建已知的复合物。然后,分布式预测算法将 RL 管道扩展到大型 PPI 网络上搜索新的蛋白质复合物。

结果

强化学习管道应用于由 8k 种蛋白质和 60k 个 PPI 组成的人类 PPI 网络,结果得到 1157 个蛋白质复合物。该方法与之前的算法相比,具有竞争力的准确性和更快的速度。我们突出了一些蛋白质复合物,如 C4orf19、C18orf21 和 KIAA1522,它们目前的特征最少。此外,结果表明 TMC04 可能是 KICSTOR 复合物的另一个假定亚基,并通过 3D 结构建模证实了 C15orf41 与 HIRA、CDAN1、ASF1A 更高阶复合物的参与。

结论

强化学习为社区检测提供了几个明显的优势,包括可扩展性和定义这些社区的行走轨迹的知识。将其应用于当前可用的人类蛋白质相互作用网络,该方法与其他算法具有相当的准确性,并且在计算时间上有显著的节省,进而对几个未表征的人类蛋白质的蛋白质功能和相互作用做出了明确的预测。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3def/10394916/9f8588cbc7dd/12859_2023_5425_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3def/10394916/d28061611cfb/12859_2023_5425_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3def/10394916/1f21144a6fcb/12859_2023_5425_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3def/10394916/9c8cc186459a/12859_2023_5425_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3def/10394916/86a1eb6680fe/12859_2023_5425_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3def/10394916/8089189a8936/12859_2023_5425_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3def/10394916/cda6ab214dce/12859_2023_5425_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3def/10394916/3cb84cdb24d3/12859_2023_5425_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3def/10394916/13a410cfb259/12859_2023_5425_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3def/10394916/aa93ae4b214d/12859_2023_5425_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3def/10394916/cb8815cd4d2f/12859_2023_5425_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3def/10394916/9f8588cbc7dd/12859_2023_5425_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3def/10394916/d28061611cfb/12859_2023_5425_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3def/10394916/1f21144a6fcb/12859_2023_5425_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3def/10394916/9c8cc186459a/12859_2023_5425_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3def/10394916/86a1eb6680fe/12859_2023_5425_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3def/10394916/8089189a8936/12859_2023_5425_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3def/10394916/cda6ab214dce/12859_2023_5425_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3def/10394916/3cb84cdb24d3/12859_2023_5425_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3def/10394916/13a410cfb259/12859_2023_5425_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3def/10394916/aa93ae4b214d/12859_2023_5425_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3def/10394916/cb8815cd4d2f/12859_2023_5425_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3def/10394916/9f8588cbc7dd/12859_2023_5425_Fig11_HTML.jpg

相似文献

1
Molecular complex detection in protein interaction networks through reinforcement learning.通过强化学习在蛋白质相互作用网络中检测分子复合物。
BMC Bioinformatics. 2023 Aug 2;24(1):306. doi: 10.1186/s12859-023-05425-7.
2
Short-Term Memory Impairment短期记忆障碍
3
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
4
Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.利用基础模型库进行跨设备肿瘤显微镜检查中的细胞相似性搜索。
Front Oncol. 2025 Jun 18;15:1480384. doi: 10.3389/fonc.2025.1480384. eCollection 2025.
5
Falls prevention interventions for community-dwelling older adults: systematic review and meta-analysis of benefits, harms, and patient values and preferences.社区居住的老年人跌倒预防干预措施:系统评价和荟萃分析的益处、危害以及患者的价值观和偏好。
Syst Rev. 2024 Nov 26;13(1):289. doi: 10.1186/s13643-024-02681-3.
6
Sexual Harassment and Prevention Training性骚扰与预防培训
7
The Lived Experience of Autistic Adults in Employment: A Systematic Search and Synthesis.成年自闭症患者的就业生活经历:系统检索与综述
Autism Adulthood. 2024 Dec 2;6(4):495-509. doi: 10.1089/aut.2022.0114. eCollection 2024 Dec.
8
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
9
Patient buy-in to social prescribing through link workers as part of person-centred care: a realist evaluation.患者通过联络人员接受社会处方作为以患者为中心的护理的一部分:一项现实主义评价。
Health Soc Care Deliv Res. 2024 Sep 25:1-17. doi: 10.3310/ETND8254.
10
A New Measure of Quantified Social Health Is Associated With Levels of Discomfort, Capability, and Mental and General Health Among Patients Seeking Musculoskeletal Specialty Care.一种新的量化社会健康指标与寻求肌肉骨骼专科护理的患者的不适程度、能力以及心理和总体健康水平相关。
Clin Orthop Relat Res. 2025 Apr 1;483(4):647-663. doi: 10.1097/CORR.0000000000003394. Epub 2025 Feb 5.

引用本文的文献

1
Sensitivity analysis on protein-protein interaction networks through deep graph networks.通过深度图网络对蛋白质-蛋白质相互作用网络进行敏感性分析。
BMC Bioinformatics. 2025 May 8;26(1):124. doi: 10.1186/s12859-025-06140-1.
2
FBN2 promotes the proliferation, mineralization, and differentiation of osteoblasts to accelerate fracture healing.FBN2促进成骨细胞的增殖、矿化和分化,以加速骨折愈合。
Sci Rep. 2025 Feb 9;15(1):4843. doi: 10.1038/s41598-025-89215-6.
3
NHSL3 controls single and collective cell migration through two distinct mechanisms.

本文引用的文献

1
Towards a structurally resolved human protein interaction network.构建具有结构解析的人类蛋白质相互作用网络
Nat Struct Mol Biol. 2023 Feb;30(2):216-225. doi: 10.1038/s41594-022-00910-8. Epub 2023 Jan 23.
2
Predicting the structure of large protein complexes using AlphaFold and Monte Carlo tree search.使用 AlphaFold 和蒙特卡罗树搜索预测大型蛋白质复合物的结构。
Nat Commun. 2022 Oct 12;13(1):6028. doi: 10.1038/s41467-022-33729-4.
3
Detecting protein complexes with multiple properties by an adaptive harmony search algorithm.
NHSL3 通过两种不同机制控制单个细胞和集体细胞迁移。
Nat Commun. 2025 Jan 2;16(1):205. doi: 10.1038/s41467-024-55647-3.
4
Revolutionizing Molecular Design for Innovative Therapeutic Applications through Artificial Intelligence.通过人工智能为创新治疗应用彻底改变分子设计。
Molecules. 2024 Sep 29;29(19):4626. doi: 10.3390/molecules29194626.
采用自适应和声搜索算法探测具有多种特性的蛋白质复合物。
BMC Bioinformatics. 2022 Oct 7;23(1):414. doi: 10.1186/s12859-022-04923-4.
4
ColabFold: making protein folding accessible to all.ColabFold:让蛋白质折叠变得人人可用。
Nat Methods. 2022 Jun;19(6):679-682. doi: 10.1038/s41592-022-01488-1. Epub 2022 May 30.
5
Understudied proteins: opportunities and challenges for functional proteomics.研究不足的蛋白质:功能蛋白质组学面临的机遇与挑战
Nat Methods. 2022 Jul;19(7):774-779. doi: 10.1038/s41592-022-01454-x.
6
Down-regulated C4orf19 confers poor prognosis in colon adenocarcinoma identified by gene co-expression network.通过基因共表达网络鉴定,下调的C4orf19预示着结肠腺癌的不良预后。
J Cancer. 2022 Jan 9;13(4):1145-1159. doi: 10.7150/jca.63635. eCollection 2022.
7
OpenCell: Endogenous tagging for the cartography of human cellular organization.OpenCell:用于人类细胞组织图谱绘制的内源性标记。
Science. 2022 Mar 11;375(6585):eabi6983. doi: 10.1126/science.abi6983.
8
Super.Complex: A supervised machine learning pipeline for molecular complex detection in protein-interaction networks.超级复合物:用于蛋白质相互作用网络中分子复合物检测的有监督机器学习管道。
PLoS One. 2021 Dec 31;16(12):e0262056. doi: 10.1371/journal.pone.0262056. eCollection 2021.
9
Pdcd10-Stk24/25 complex controls kidney water reabsorption by regulating Aqp2 membrane targeting.Pdcd10-Stk24/25 复合物通过调节 Aqp2 膜靶向控制肾脏水重吸收。
JCI Insight. 2021 Jun 22;6(12):e142838. doi: 10.1172/jci.insight.142838.
10
hu.MAP 2.0: integration of over 15,000 proteomic experiments builds a global compendium of human multiprotein assemblies.hu.MAP 2.0:整合超过 15000 项蛋白质组学实验,构建了人类多蛋白复合物的全球总览。
Mol Syst Biol. 2021 May;17(5):e10016. doi: 10.15252/msb.202010016.