Suppr
超能文献

评估 ChatGPT 和 Gemini 大型语言模型在 NONMEM 中的药物代谢动力学应用。

Evaluation of ChatGPT and Gemini large language models for pharmacometrics with NONMEM.

机构信息

Department of Pharmaceutical Sciences, University at Buffalo, The State University of New York, Buffalo, NY, 14214-8033, USA.

出版信息

J Pharmacokinet Pharmacodyn. 2024 Jun;51(3):187-197. doi: 10.1007/s10928-024-09921-y. Epub 2024 Apr 24.

DOI:10.1007/s10928-024-09921-y

PMID:38656706

Abstract

To assess ChatGPT 4.0 (ChatGPT) and Gemini Ultra 1.0 (Gemini) large language models on NONMEM coding tasks relevant to pharmacometrics and clinical pharmacology. ChatGPT and Gemini were assessed on tasks mimicking real-world applications of NONMEM. The tasks ranged from providing a curriculum for learning NONMEM, an overview of NONMEM code structure to generating code. Prompts in lay language to elicit NONMEM code for a linear pharmacokinetic (PK) model with oral administration and a more complex model with two parallel first-order absorption mechanisms were investigated. Reproducibility and the impact of "temperature" hyperparameter settings were assessed. The code was reviewed by two NONMEM experts. ChatGPT and Gemini provided NONMEM curriculum structures combining foundational knowledge with advanced concepts (e.g., covariate modeling and Bayesian approaches) and practical skills including NONMEM code structure and syntax. ChatGPT provided an informative summary of the NONMEM control stream structure and outlined the key NONMEM Translator (NM-TRAN) records needed. ChatGPT and Gemini were able to generate code blocks for the NONMEM control stream from the lay language prompts for the two coding tasks. The control streams contained focal structural and syntax errors that required revision before they could be executed without errors and warnings. The code output from ChatGPT and Gemini was not reproducible, and varying the temperature hyperparameter did not reduce the errors and omissions substantively. Large language models may be useful in pharmacometrics for efficiently generating an initial coding template for modeling projects. However, the output can contain errors and omissions that require correction.

摘要

评估 ChatGPT 4.0 (ChatGPT) 和 Gemini Ultra 1.0 (Gemini) 大型语言模型在与药物代谢动力学和临床药理学相关的 NONMEM 编码任务上的表现。ChatGPT 和 Gemini 被评估在模拟 NONMEM 实际应用的任务上的表现。这些任务从提供 NONMEM 学习课程、NONMEM 代码结构概述到生成代码不等。研究了用外行语言提出的、用于具有口服给药的线性药代动力学 (PK) 模型和具有两个平行一阶吸收机制的更复杂模型的 NONMEM 代码生成任务。评估了可重复性和“温度”超参数设置的影响。代码由两位 NONMEM 专家进行了审查。ChatGPT 和 Gemini 提供了结合基础知识和高级概念（例如，协变量建模和贝叶斯方法）以及实用技能（包括 NONMEM 代码结构和语法）的 NONMEM 课程结构。ChatGPT 提供了 NONMEM 控制流结构的信息性摘要，并概述了所需的关键 NONMEM 翻译器 (NM-TRAN) 记录。ChatGPT 和 Gemini 能够根据这两个编码任务的外行语言提示生成 NONMEM 控制流的代码块。控制流包含焦点结构和语法错误，需要在没有错误和警告的情况下执行之前进行修订。ChatGPT 和 Gemini 生成的代码不可重复，并且改变温度超参数并不能实质性地减少错误和遗漏。大型语言模型在药物代谢动力学中可能有助于高效生成建模项目的初始编码模板。然而，输出可能包含需要纠正的错误和遗漏。

相似文献

Evaluation of ChatGPT and Gemini large language models for pharmacometrics with NONMEM.

J Pharmacokinet Pharmacodyn. 2024 Jun;51(3):187-197. doi: 10.1007/s10928-024-09921-y. Epub 2024 Apr 24.

Leveraging large language models in pharmacometrics: evaluation of NONMEM output interpretation and simulation capabilities.

J Pharmacokinet Pharmacodyn. 2025 Jun 4;52(3):34. doi: 10.1007/s10928-025-09982-7.

Evaluation of prompt engineering strategies for pharmacokinetic data analysis with the ChatGPT large language model.

J Pharmacokinet Pharmacodyn. 2024 Apr;51(2):101-108. doi: 10.1007/s10928-023-09892-6. Epub 2023 Nov 11.

Performance of 3 Conversational Generative Artificial Intelligence Models for Computing Maximum Safe Doses of Local Anesthetics: Comparative Analysis.

JMIR AI. 2025 May 13;4:e66796. doi: 10.2196/66796.

Evaluating the Reasoning Capabilities of Large Language Models for Medical Coding and Hospital Readmission Risk Stratification: Zero-Shot Prompting Approach.

J Med Internet Res. 2025 Jul 30;27:e74142. doi: 10.2196/74142.

Prescription of Controlled Substances: Benefits and Risks

A multi-dimensional performance evaluation of large language models in dental implantology: comparison of ChatGPT, DeepSeek, Grok, Gemini and Qwen across diverse clinical scenarios.

BMC Oral Health. 2025 Jul 28;25(1):1272. doi: 10.1186/s12903-025-06619-6.

Assessing the Reproducibility of the Structured Abstracts Generated by ChatGPT and Bard Compared to Human-Written Abstracts in the Field of Spine Surgery: Comparative Analysis.

J Med Internet Res. 2024 Jun 26;26:e52001. doi: 10.2196/52001.

Evaluation of the accuracy of ChatGPT-4 and Gemini's responses to the World Dental Federation's frequently asked questions on oral health.

BMC Oral Health. 2025 Aug 2;25(1):1293. doi: 10.1186/s12903-025-06624-9.

Comparison of ChatGPT and Internet Research for Clinical Research and Decision-Making in Occupational Medicine: Randomized Controlled Trial.

JMIR Form Res. 2025 May 20;9:e63857. doi: 10.2196/63857.

引用本文的文献

State of Art of Dose Individualization to Support tacrolimus drug monitoring: What's Next?

Transpl Int. 2025 Sep 1;38:14201. doi: 10.3389/ti.2025.14201. eCollection 2025.

Assessing the role of large language models in adolescent idiopathic scoliosis care: a comparison between ChatGPT and Google Gemini.

Acta Orthop Traumatol Turc. 2025 Jul 18;59(4):222-229. doi: 10.5152/j.aott.2025.25279.

Leveraging In Silico and Artificial Intelligence Models to Advance Drug Disposition and Response Predictions Across the Lifespan.

Clin Transl Sci. 2025 Jun;18(6):e70272. doi: 10.1111/cts.70272.

The dawn of a new era: can machine learning and large language models reshape QSP modeling?

J Pharmacokinet Pharmacodyn. 2025 Jun 16;52(4):36. doi: 10.1007/s10928-025-09984-5.

Leveraging large language models in pharmacometrics: evaluation of NONMEM output interpretation and simulation capabilities.

J Pharmacokinet Pharmacodyn. 2025 Jun 4;52(3):34. doi: 10.1007/s10928-025-09982-7.

Large Language Models and Their Applications in Drug Discovery and Development: A Primer.

Clin Transl Sci. 2025 Apr;18(4):e70205. doi: 10.1111/cts.70205.

AI In Action: Redefining Drug Discovery and Development.

Clin Transl Sci. 2025 Feb;18(2):e70149. doi: 10.1111/cts.70149.

本文引用的文献

Human-like problem-solving abilities in large language models using ChatGPT.

Front Artif Intell. 2023 May 24;6:1199350. doi: 10.3389/frai.2023.1199350. eCollection 2023.

Artificial hallucination: GPT on LSD?

Crit Care. 2023 Apr 18;27(1):148. doi: 10.1186/s13054-023-04425-6.

R and nlmixr as a gateway between statistics and pharmacometrics.

CPT Pharmacometrics Syst Pharmacol. 2021 Apr;10(4):283-285. doi: 10.1002/psp4.12618.

Nonlinear Mixed-Effects Model Development and Simulation Using nlmixr and Related R Open-Source Packages.

CPT Pharmacometrics Syst Pharmacol. 2019 Sep;8(9):621-633. doi: 10.1002/psp4.12445. Epub 2019 Jul 16.

NONMEM Tutorial Part I: Description of Commands and Options, With Simple Examples of Population Analysis.

CPT Pharmacometrics Syst Pharmacol. 2019 Aug;8(8):525-537. doi: 10.1002/psp4.12404. Epub 2019 Jun 13.

Population pharmacokinetics. A regulatory perspective.

Clin Pharmacokinet. 1999 Jul;37(1):41-58. doi: 10.2165/00003088-199937010-00003.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

评估 ChatGPT 和 Gemini 大型语言模型在 NONMEM 中的药物代谢动力学应用。

Evaluation of ChatGPT and Gemini large language models for pharmacometrics with NONMEM.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译