• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多尺度概率建模:一种利用结合亲和力的机器学习预测增强细胞信号传导机制模型的贝叶斯方法。

Multiscale Probabilistic Modeling: A Bayesian Approach to Augment Mechanistic Models of Cell Signaling with Machine-Learning Predictions of Binding Affinity.

作者信息

Huber Holly A, Finley Stacey D

出版信息

bioRxiv. 2025 Jul 9:2025.05.23.655795. doi: 10.1101/2025.05.23.655795.

DOI:10.1101/2025.05.23.655795
PMID:40672255
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12265541/
Abstract

UNLABELLED

Computational models in systems biology are often underdetermined-that is, there is little data relative to the complexity and size of the model. The lack of data is primarily due to limits in our ability to observe specific biological systems and restricts the utility of computational models. However, there are a growing number of experimental databases in biology. While these databases provide more observations, they often do not have observations that match the system of interest exactly. For example, database measurements might be collected at different experimental conditions or on a different scale compared to the system of interest. Here, we investigate what information can be gleaned from generalizing databases across these differences in the context of modeling a specific system - cell signaling. Ultimately, our goal is to better determine models of specific systems, thereby increasing their utility. To do this, we propose a novel, multiscale, probabilistic framework. We use this framework to integrate measurements of protein structure from the Protein Data Bank and measurements of amino acid sequence from the Universal Protein Resource into the parameter inference of cell signaling models. Then, we quantify exactly what information is gained from these measurements when modeling cell signaling. We choose to investigate the utility of these databases in the context of dynamic cell signaling models because experimental measurements of the variables of interest, protein dynamics, are still quite limited. We find that we can successfully integrate measurements from these databases to significantly improve parameter estimation of signaling models. The impact of sequence and structure measurements on model predictions depends on the sensitivity of the prediction to perturbations in the parameter values. Overall, this study demonstrates that measurements of protein structure and amino acid sequence can be leveraged to better inform parameters in models of cell signaling.

AUTHOR SUMMARY

Computational models of cell signaling have provided mechanistic insights into complex biological systems, including in physiological and disease settings. Accurate and predictive modeling critically depends on the precise estimation of model parameters, which is often hindered by the limited availability of experimental data. In this study, we present a novel multiscale probabilistic inference framework that broadens the scope of data types that can be leveraged for parameter estimation for models of cell signaling. The framework integrates a machine learning pipeline with a generalizable parameter inference approach, enabling the use of experimental data across scales. Specifically, we demonstrate that incorporating protein amino acid sequence and 3D structural data enhances parameter estimation compared to traditional measurements such as protein concentrations over time. Improving parameter estimation increases the robustness and applicability of cell signaling models. Ultimately, our framework facilitates use of a broader range of data and supports the development of predictive computational models that increase our understanding of cell signaling.

摘要

未标注

系统生物学中的计算模型常常是欠定的,也就是说,相对于模型的复杂性和规模而言,数据很少。数据的缺乏主要是由于我们观察特定生物系统的能力有限,这限制了计算模型的实用性。然而,生物学中的实验数据库数量在不断增加。虽然这些数据库提供了更多的观测数据,但它们通常没有与感兴趣的系统完全匹配的观测数据。例如,与感兴趣的系统相比,数据库测量可能是在不同的实验条件下或不同的尺度上收集的。在这里,我们研究在对特定系统——细胞信号传导进行建模的背景下,从跨越这些差异的数据库泛化中可以收集到哪些信息。最终,我们的目标是更好地确定特定系统的模型,从而提高其效用。为此,我们提出了一个新颖的、多尺度的概率框架。我们使用这个框架将来自蛋白质数据库的蛋白质结构测量数据和来自通用蛋白质资源的氨基酸序列测量数据整合到细胞信号传导模型的参数推断中。然后,我们精确量化在对细胞信号传导进行建模时从这些测量中获得了哪些信息。我们选择在动态细胞信号传导模型的背景下研究这些数据库的效用,因为对感兴趣的变量——蛋白质动力学的实验测量仍然非常有限。我们发现我们可以成功地整合来自这些数据库的测量数据,以显著改善信号传导模型的参数估计。序列和结构测量对模型预测的影响取决于预测对参数值扰动的敏感性。总体而言,这项研究表明蛋白质结构和氨基酸序列的测量可以用来更好地为细胞信号传导模型中的参数提供信息。

作者总结

细胞信号传导的计算模型为包括生理和疾病环境在内的复杂生物系统提供了机制性见解。准确且具有预测性的建模关键取决于模型参数的精确估计,而这常常受到实验数据有限可用性的阻碍。在这项研究中,我们提出了一个新颖的多尺度概率推断框架,该框架拓宽了可用于细胞信号传导模型参数估计的数据类型范围。该框架将机器学习管道与一种可泛化的参数推断方法相结合,能够跨尺度使用实验数据。具体而言,我们证明与传统测量(如随时间变化的蛋白质浓度)相比,纳入蛋白质氨基酸序列和三维结构数据可增强参数估计。改进参数估计可提高细胞信号传导模型的稳健性和适用性。最终,我们的框架有助于使用更广泛的数据范围,并支持开发能够增进我们对细胞信号传导理解的预测性计算模型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b2d/12265541/4cd0e9bab07a/nihpp-2025.05.23.655795v2-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b2d/12265541/3e01e36f1abd/nihpp-2025.05.23.655795v2-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b2d/12265541/e8f21bdc2d50/nihpp-2025.05.23.655795v2-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b2d/12265541/46bf19e6e637/nihpp-2025.05.23.655795v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b2d/12265541/aaac34db9adf/nihpp-2025.05.23.655795v2-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b2d/12265541/23c52e596819/nihpp-2025.05.23.655795v2-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b2d/12265541/1149a6c9be93/nihpp-2025.05.23.655795v2-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b2d/12265541/4cd0e9bab07a/nihpp-2025.05.23.655795v2-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b2d/12265541/3e01e36f1abd/nihpp-2025.05.23.655795v2-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b2d/12265541/e8f21bdc2d50/nihpp-2025.05.23.655795v2-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b2d/12265541/46bf19e6e637/nihpp-2025.05.23.655795v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b2d/12265541/aaac34db9adf/nihpp-2025.05.23.655795v2-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b2d/12265541/23c52e596819/nihpp-2025.05.23.655795v2-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b2d/12265541/1149a6c9be93/nihpp-2025.05.23.655795v2-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b2d/12265541/4cd0e9bab07a/nihpp-2025.05.23.655795v2-f0007.jpg

相似文献

1
Multiscale Probabilistic Modeling: A Bayesian Approach to Augment Mechanistic Models of Cell Signaling with Machine-Learning Predictions of Binding Affinity.多尺度概率建模:一种利用结合亲和力的机器学习预测增强细胞信号传导机制模型的贝叶斯方法。
bioRxiv. 2025 Jul 9:2025.05.23.655795. doi: 10.1101/2025.05.23.655795.
2
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
3
The Lived Experience of Autistic Adults in Employment: A Systematic Search and Synthesis.成年自闭症患者的就业生活经历:系统检索与综述
Autism Adulthood. 2024 Dec 2;6(4):495-509. doi: 10.1089/aut.2022.0114. eCollection 2024 Dec.
4
Adapting Safety Plans for Autistic Adults with Involvement from the Autism Community.在自闭症群体的参与下为成年自闭症患者调整安全计划。
Autism Adulthood. 2025 May 28;7(3):293-302. doi: 10.1089/aut.2023.0124. eCollection 2025 Jun.
5
Autistic Students' Experiences of Employment and Employability Support while Studying at a UK University.自闭症学生在英国大学学习期间的就业经历及就业支持情况
Autism Adulthood. 2025 Apr 3;7(2):212-222. doi: 10.1089/aut.2024.0112. eCollection 2025 Apr.
6
Sexual Harassment and Prevention Training性骚扰与预防培训
7
"In a State of Flow": A Qualitative Examination of Autistic Adults' Phenomenological Experiences of Task Immersion.“心流状态”:对自闭症成年人任务沉浸现象学体验的质性研究
Autism Adulthood. 2024 Sep 16;6(3):362-373. doi: 10.1089/aut.2023.0032. eCollection 2024 Sep.
8
Short-Term Memory Impairment短期记忆障碍
9
Stigma Management Strategies of Autistic Social Media Users.自闭症社交媒体用户的污名管理策略
Autism Adulthood. 2025 May 28;7(3):273-282. doi: 10.1089/aut.2023.0095. eCollection 2025 Jun.
10
Interventions to reduce harm from continued tobacco use.减少持续吸烟危害的干预措施。
Cochrane Database Syst Rev. 2016 Oct 13;10(10):CD005231. doi: 10.1002/14651858.CD005231.pub3.

本文引用的文献

1
ProAffinity-GNN: A Novel Approach to Structure-Based Protein-Protein Binding Affinity Prediction via a Curated Data Set and Graph Neural Networks.ProAffinity-GNN:一种通过精心策划的数据集和图神经网络进行基于结构的蛋白质-蛋白质结合亲和力预测的新方法。
J Chem Inf Model. 2024 Dec 9;64(23):8796-8808. doi: 10.1021/acs.jcim.4c01850. Epub 2024 Nov 18.
2
DLKcat cannot predict meaningful values for mutants and unfamiliar enzymes.DLKcat无法预测突变体和不熟悉的酶的有意义的值。
Biol Methods Protoc. 2024 Aug 24;9(1):bpae061. doi: 10.1093/biomethods/bpae061. eCollection 2024.
3
SHC1 serves as a prognostic and immunological biomarker in clear cell renal cell carcinoma: a comprehensive bioinformatics and experimental analysis.
SHC1 作为透明细胞肾细胞癌的预后和免疫生物标志物:全面的生物信息学和实验分析。
Sci Rep. 2024 Aug 30;14(1):20150. doi: 10.1038/s41598-024-70897-3.
4
Perspectives on computational modeling of biological systems and the significance of the SysMod community.生物系统计算建模的观点及系统建模社区的重要性。
Bioinform Adv. 2024 Jun 26;4(1):vbae090. doi: 10.1093/bioadv/vbae090. eCollection 2024.
5
Accurate structure prediction of biomolecular interactions with AlphaFold 3.利用 AlphaFold 3 进行生物分子相互作用的精确结构预测。
Nature. 2024 Jun;630(8016):493-500. doi: 10.1038/s41586-024-07487-w. Epub 2024 May 8.
6
Computational Approaches to Predict Protein-Protein Interactions in Crowded Cellular Environments.计算方法在拥挤细胞环境中预测蛋白质-蛋白质相互作用。
Chem Rev. 2024 Apr 10;124(7):3932-3977. doi: 10.1021/acs.chemrev.3c00550. Epub 2024 Mar 27.
7
From Average Transient Transporter Currents to Microscopic Mechanism─A Bayesian Analysis.从平均瞬时转运体电流到微观机制——贝叶斯分析。
J Phys Chem B. 2024 Feb 29;128(8):1830-1842. doi: 10.1021/acs.jpcb.3c07025. Epub 2024 Feb 19.
8
Systematic investigation of machine learning on limited data: A study on predicting protein-protein binding strength.对有限数据上机器学习的系统研究:一项预测蛋白质-蛋白质结合强度的研究。
Comput Struct Biotechnol J. 2023 Dec 20;23:460-472. doi: 10.1016/j.csbj.2023.12.018. eCollection 2024 Dec.
9
UniKP: a unified framework for the prediction of enzyme kinetic parameters.UniKP:一种用于预测酶动力学参数的统一框架。
Nat Commun. 2023 Dec 11;14(1):8211. doi: 10.1038/s41467-023-44113-1.
10
Systematic Bayesian posterior analysis guided by Kullback-Leibler divergence facilitates hypothesis formation.基于 KL 散度的系统贝叶斯后验分析有助于假说形成。
J Theor Biol. 2023 Feb 7;558:111341. doi: 10.1016/j.jtbi.2022.111341. Epub 2022 Nov 3.