• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

大语言模型生成的信号网络基准测试。

Benchmarking of signaling networks generated by large language models.

作者信息

Tewari Jeevan, Dahl Benjamin W, Saucerman Jeffrey J

机构信息

co-equal contributors.

Department of Biomedical Engineering, University of Virginia.

出版信息

bioRxiv. 2025 Jul 29:2025.07.28.667217. doi: 10.1101/2025.07.28.667217.

DOI:10.1101/2025.07.28.667217
PMID:40766413
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12324320/
Abstract

Computational models of signaling networks provide frameworks for predicting how molecular cues guide cell decisions. But they are typically limited by manual curation from incomplete literature. Here, we test whether general-purpose large language models (LLMs) generate accurate models of signaling networks. We find that general purpose LLMs generate 24-58% of the reactions of literature-curated networks for cardiomyocyte hypertrophy, myofibroblast activation, and mechano-signaling, and predicting network responses to perturbations with accuracies of 5-26%. While current general-purpose LLMs generate signaling networks with limited accuracy, this study provides a pipeline and benchmarks to guide future improvements.

摘要

信号网络的计算模型为预测分子信号如何引导细胞决策提供了框架。但它们通常受到来自不完整文献的人工整理的限制。在这里,我们测试通用大语言模型(LLMs)是否能生成准确的信号网络模型。我们发现,通用大语言模型生成了文献整理网络中24%-58%的心肌细胞肥大、肌成纤维细胞激活和机械信号反应,并以5%-26%的准确率预测网络对扰动的反应。虽然目前的通用大语言模型生成信号网络的准确性有限,但这项研究提供了一个流程和基准,以指导未来的改进。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a65a/12324320/959edb39f5e8/nihpp-2025.07.28.667217v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a65a/12324320/e18831d74842/nihpp-2025.07.28.667217v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a65a/12324320/959edb39f5e8/nihpp-2025.07.28.667217v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a65a/12324320/e18831d74842/nihpp-2025.07.28.667217v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a65a/12324320/959edb39f5e8/nihpp-2025.07.28.667217v1-f0002.jpg

相似文献

1
Benchmarking of signaling networks generated by large language models.大语言模型生成的信号网络基准测试。
bioRxiv. 2025 Jul 29:2025.07.28.667217. doi: 10.1101/2025.07.28.667217.
2
Performance of Large Language Models in Numerical Versus Semantic Medical Knowledge: Cross-Sectional Benchmarking Study on Evidence-Based Questions and Answers.大型语言模型在数值与语义医学知识方面的表现:基于循证问答的横断面基准研究
J Med Internet Res. 2025 Jul 14;27:e64452. doi: 10.2196/64452.
3
Generative AI/LLMs for Plain Language Medical Information for Patients, Caregivers and General Public: Opportunities, Risks and Ethics.用于为患者、护理人员和普通公众提供通俗易懂的医学信息的生成式人工智能/大型语言模型:机遇、风险与伦理
Patient Prefer Adherence. 2025 Jul 31;19:2227-2249. doi: 10.2147/PPA.S527922. eCollection 2025.
4
Implementing Large Language Models in Health Care: Clinician-Focused Review With Interactive Guideline.在医疗保健中应用大语言模型:以临床医生为重点的回顾与交互式指南
J Med Internet Res. 2025 Jul 11;27:e71916. doi: 10.2196/71916.
5
Use of Large Language Models to Classify Epidemiological Characteristics in Synthetic and Real-World Social Media Posts About Conjunctivitis Outbreaks: Infodemiology Study.利用大语言模型对合成及真实世界社交媒体上有关结膜炎爆发的帖子中的流行病学特征进行分类:信息流行病学研究
J Med Internet Res. 2025 Jul 2;27:e65226. doi: 10.2196/65226.
6
Stench of Errors or the Shine of Potential: The Challenge of (Ir)Responsible Use of ChatGPT in Speech-Language Pathology.错误的恶臭还是潜力的光辉:言语病理学中(不)负责任地使用ChatGPT的挑战。
Int J Lang Commun Disord. 2025 Jul-Aug;60(4):e70088. doi: 10.1111/1460-6984.70088.
7
A dataset and benchmark for hospital course summarization with adapted large language models.一个用于医院病程总结的数据集和基准测试,采用了适配的大语言模型。
J Am Med Inform Assoc. 2025 Mar 1;32(3):470-479. doi: 10.1093/jamia/ocae312.
8
Improving Large Language Models' Summarization Accuracy by Adding Highlights to Discharge Notes: Comparative Evaluation.通过在出院小结中添加重点内容提高大语言模型的总结准确性:比较评估
JMIR Med Inform. 2025 Jul 24;13:e66476. doi: 10.2196/66476.
9
A Weighted Voting Approach for Traditional Chinese Medicine Formula Classification Using Large Language Models: Algorithm Development and Validation Study.一种使用大语言模型的中医方剂分类加权投票方法:算法开发与验证研究
JMIR Med Inform. 2025 Jul 24;13:e69286. doi: 10.2196/69286.
10
The first step is the hardest: pitfalls of representing and tokenizing temporal data for large language models.第一步是最困难的:为大型语言模型表示和标记时间数据的陷阱。
J Am Med Inform Assoc. 2024 Sep 1;31(9):2151-2158. doi: 10.1093/jamia/ocae090.

本文引用的文献

1
Virtual Cell Challenge: Toward a Turing test for the virtual cell.虚拟细胞挑战:迈向虚拟细胞的图灵测试。
Cell. 2025 Jun 26;188(13):3370-3374. doi: 10.1016/j.cell.2025.06.008.
2
Logic-based modeling of biological networks with Netflux.使用Netflux对生物网络进行基于逻辑的建模。
PLoS Comput Biol. 2025 Apr 4;21(4):e1012864. doi: 10.1371/journal.pcbi.1012864. eCollection 2025 Apr.
3
The Evolution of Systems Biology and Systems Medicine: From Mechanistic Models to Uncertainty Quantification.系统生物学与系统医学的演进:从机理模型到不确定性量化
Annu Rev Biomed Eng. 2025 May;27(1):425-447. doi: 10.1146/annurev-bioeng-102723-065309. Epub 2025 Feb 19.
4
Toward expert-level medical question answering with large language models.迈向使用大语言模型实现专家级医学问答
Nat Med. 2025 Mar;31(3):943-950. doi: 10.1038/s41591-024-03423-7. Epub 2025 Jan 8.
5
How to build the virtual cell with artificial intelligence: Priorities and opportunities.如何利用人工智能构建虚拟细胞:优先事项与机遇
Cell. 2024 Dec 12;187(25):7045-7063. doi: 10.1016/j.cell.2024.11.015.
6
Toward a foundation model of causal cell and tissue biology with a Perturbation Cell and Tissue Atlas.用扰动细胞和组织图谱构建因果细胞和组织生物学的基础模型。
Cell. 2024 Aug 22;187(17):4520-4545. doi: 10.1016/j.cell.2024.07.035.
7
Accurate structure prediction of biomolecular interactions with AlphaFold 3.利用 AlphaFold 3 进行生物分子相互作用的精确结构预测。
Nature. 2024 Jun;630(8016):493-500. doi: 10.1038/s41586-024-07487-w. Epub 2024 May 8.
8
scGPT: toward building a foundation model for single-cell multi-omics using generative AI.scGPT:迈向使用生成式人工智能构建单细胞多组学基础模型
Nat Methods. 2024 Aug;21(8):1470-1480. doi: 10.1038/s41592-024-02201-0. Epub 2024 Feb 26.
9
Transfer learning enables predictions in network biology.迁移学习可实现网络生物学预测。
Nature. 2023 Jun;618(7965):616-624. doi: 10.1038/s41586-023-06139-9. Epub 2023 May 31.
10
Benchmarking of protein interaction databases for integration with manually reconstructed signalling network models.蛋白质相互作用数据库的基准测试,用于与人工重建的信号网络模型集成。
J Physiol. 2024 Sep;602(18):4529-4542. doi: 10.1113/JP284616. Epub 2023 May 30.