• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

增强型和通过编程优化的大语言模型提示可减少化学幻觉。

Augmented and Programmatically Optimized LLM Prompts Reduce Chemical Hallucinations.

作者信息

Reed Scott M

机构信息

Department of Chemistry, University of Colorado Denver, 1151 Arapahoe St., Denver, Colorado 80217-3364, United States.

出版信息

J Chem Inf Model. 2025 May 12;65(9):4274-4280. doi: 10.1021/acs.jcim.4c02322. Epub 2025 Apr 22.

DOI:10.1021/acs.jcim.4c02322
PMID:40262168
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12076503/
Abstract

Utilizing large Language models (LLMs) for handling scientific information comes with risk of the outputs not matching expectations, commonly called hallucinations. To fully utilize LLMs in research requires improving their accuracy, avoiding hallucinations, and extending their scope to research topics outside their direct training. There is also a benefit to getting the most accurate information from an LLM at the time of inference without having to create and train custom new models for each application. Here, augmented generation and machine learning-driven prompt optimization are combined to extract performance improvements over base LLM function on a common chemical research task. Specifically, an LLM was used to predict the topological polar surface area (TPSA) of molecules. By using augmented generation and machine learning-optimized prompts, the error in the prediction was reduced to 11.76 root-mean-squared error (RMSE) from 62.34 RMSE with direct calls to the same LLM.

摘要

利用大语言模型(LLMs)处理科学信息存在输出与预期不符的风险,通常称为幻觉。要在研究中充分利用大语言模型,需要提高其准确性、避免幻觉,并将其范围扩展到直接训练之外的研究主题。在推理时从大语言模型获取最准确的信息也有好处,而不必为每个应用创建和训练定制的新模型。在这里,增强生成和机器学习驱动的提示优化相结合,以在一个常见的化学研究任务上提取比基础大语言模型功能更高的性能提升。具体来说,使用一个大语言模型来预测分子的拓扑极性表面积(TPSA)。通过使用增强生成和机器学习优化的提示,预测误差从直接调用同一大语言模型时的62.34均方根误差(RMSE)降至11.76 RMSE。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95d5/12076503/196bac9647fa/ci4c02322_0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95d5/12076503/979ef132d33e/ci4c02322_0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95d5/12076503/72c66ef1dbf6/ci4c02322_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95d5/12076503/cbbd6969d449/ci4c02322_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95d5/12076503/196bac9647fa/ci4c02322_0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95d5/12076503/979ef132d33e/ci4c02322_0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95d5/12076503/72c66ef1dbf6/ci4c02322_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95d5/12076503/cbbd6969d449/ci4c02322_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95d5/12076503/196bac9647fa/ci4c02322_0004.jpg

相似文献

1
Augmented and Programmatically Optimized LLM Prompts Reduce Chemical Hallucinations.增强型和通过编程优化的大语言模型提示可减少化学幻觉。
J Chem Inf Model. 2025 May 12;65(9):4274-4280. doi: 10.1021/acs.jcim.4c02322. Epub 2025 Apr 22.
2
Information extraction of UV-NIR spectral data in waste water based on Large Language Model.基于大语言模型的废水中紫外-近红外光谱数据信息提取
Spectrochim Acta A Mol Biomol Spectrosc. 2024 Oct 5;318:124475. doi: 10.1016/j.saa.2024.124475. Epub 2024 May 17.
3
Empowering large language models for automated clinical assessment with generation-augmented retrieval and hierarchical chain-of-thought.通过生成增强检索和分层思维链赋能大型语言模型进行自动化临床评估。
Artif Intell Med. 2025 Apr;162:103078. doi: 10.1016/j.artmed.2025.103078. Epub 2025 Feb 12.
4
Optimizing biomedical information retrieval with a keyword frequency-driven prompt enhancement strategy.基于关键词频率驱动的提示增强策略优化生物医学信息检索
BMC Bioinformatics. 2024 Aug 27;25(1):281. doi: 10.1186/s12859-024-05902-7.
5
Improving the use of LLMs in radiology through prompt engineering: from precision prompts to zero-shot learning.通过提示工程提高放射科对大语言模型的使用:从精准提示到零样本学习。
Rofo. 2024 Nov;196(11):1166-1170. doi: 10.1055/a-2264-5631. Epub 2024 Feb 26.
6
Distilling the knowledge from large-language model for health event prediction.从大语言模型中提取知识用于健康事件预测。
Sci Rep. 2024 Dec 28;14(1):30675. doi: 10.1038/s41598-024-75331-2.
7
Leveraging Large Language Models (LLM) for the Plastic Surgery Resident Training: Do They Have a Role?利用大语言模型进行整形外科住院医师培训:它们能发挥作用吗?
Indian J Plast Surg. 2023 Aug 28;56(5):413-420. doi: 10.1055/s-0043-1772704. eCollection 2023 Oct.
8
Application of NotebookLM, a large language model with retrieval-augmented generation, for lung cancer staging.具有检索增强生成功能的大型语言模型NotebookLM在肺癌分期中的应用。
Jpn J Radiol. 2025 Apr;43(4):706-712. doi: 10.1007/s11604-024-01705-1. Epub 2024 Nov 25.
9
Large language models for conducting systematic reviews: on the rise, but not yet ready for use-a scoping review.用于进行系统评价的大型语言模型:正在兴起,但尚未准备好投入使用——一项范围综述
J Clin Epidemiol. 2025 May;181:111746. doi: 10.1016/j.jclinepi.2025.111746. Epub 2025 Feb 26.
10
Steering veridical large language model analyses by correcting and enriching generated database queries: first steps toward ChatGPT bioinformatics.通过纠正和丰富生成的数据库查询来引导真实的大语言模型分析:迈向ChatGPT生物信息学的第一步。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbaf045.

本文引用的文献

1
DrugAssist: a large language model for molecule optimization.DrugAssist:用于分子优化的大型语言模型。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae693.
2
Extracting structured data from organic synthesis procedures using a fine-tuned large language model.使用微调的大语言模型从有机合成程序中提取结构化数据。
Digit Discov. 2024 Jul 31;3(9):1822-1831. doi: 10.1039/d4dd00091a. eCollection 2024 Sep 11.
3
Fine-tuning large language models for chemical text mining.针对化学文本挖掘对大语言模型进行微调。
Chem Sci. 2024 Jun 7;15(27):10600-10611. doi: 10.1039/d4sc00924j. eCollection 2024 Jul 10.
4
Augmenting large language models with chemistry tools.用化学工具增强大语言模型。
Nat Mach Intell. 2024;6(5):525-535. doi: 10.1038/s42256-024-00832-8. Epub 2024 May 8.
5
Assessment of chemistry knowledge in large language models that generate code.对生成代码的大语言模型中的化学知识评估。
Digit Discov. 2023 Jan 26;2(2):368-376. doi: 10.1039/d2dd00087c. eCollection 2023 Apr 11.
6
Prediction of placental barrier permeability: a model based on partial least squares variable selection procedure.胎盘屏障通透性的预测:基于偏最小二乘变量选择程序的模型
Molecules. 2015 May 7;20(5):8270-86. doi: 10.3390/molecules20058270.
7
Medicinal chemical properties of successful central nervous system drugs.成功的中枢神经系统药物的药用化学性质。
NeuroRx. 2005 Oct;2(4):541-53. doi: 10.1602/neurorx.2.4.541.
8
Fast calculation of molecular polar surface area as a sum of fragment-based contributions and its application to the prediction of drug transport properties.基于片段贡献之和的分子极性表面积快速计算及其在药物转运性质预测中的应用。
J Med Chem. 2000 Oct 5;43(20):3714-7. doi: 10.1021/jm000942e.