Suppr超能文献

探索苦与甜:大型语言模型在分子味觉预测中的应用。

Exploring Bitter and Sweet: The Application of Large Language Models in Molecular Taste Prediction.

机构信息

Key Laboratory for Molecular Enzymology and Engineering of Ministry of Education, School of Life Science, Jilin University, 2699 Qianjin Street, Changchun 130012, China.

Department of Electrical Engineering and Computer Science, Bond Life Sciences Center, University of Missouri, Columbia, Missouri 65211, United States.

出版信息

J Chem Inf Model. 2024 May 27;64(10):4102-4111. doi: 10.1021/acs.jcim.4c00681. Epub 2024 May 7.

Abstract

The perception of bitter and sweet tastes is a crucial aspect of human sensory experience. Concerns over the long-term use of aspartame, a widely used sweetener suspected of carcinogenic risks, highlight the importance of developing new taste modifiers. This study utilizes Large Language Models (LLMs) such as GPT-3.5 and GPT-4 for predicting molecular taste characteristics, with a focus on the bitter-sweet dichotomy. Employing random and scaffold data splitting strategies, GPT-4 demonstrated superior performance, achieving an impressive 86% accuracy under scaffold partitioning. Additionally, ChatGPT was employed to extract specific molecular features associated with bitter and sweet tastes. Utilizing these insights, novel molecular compounds with distinct taste profiles were successfully generated. These compounds were validated for their bitter and sweet properties through molecular docking and molecular dynamics simulation, and their practicality was further confirmed by ADMET toxicity testing and DeepSA synthesis feasibility. This research highlights the potential of LLMs in predicting molecular properties and their implications in health and chemical science.

摘要

苦味和甜味的感知是人类感官体验的一个关键方面。由于担心长期使用阿斯巴甜(一种广泛使用的甜味剂,被怀疑有致癌风险),因此开发新的味觉修饰剂显得尤为重要。本研究利用 GPT-3.5 和 GPT-4 等大型语言模型(LLMs)来预测分子味觉特征,重点关注苦甜二分法。采用随机和支架数据拆分策略,GPT-4 表现出卓越的性能,在支架拆分下达到令人印象深刻的 86%的准确率。此外,ChatGPT 还被用于提取与苦甜味道相关的特定分子特征。利用这些见解,成功生成了具有独特味道特征的新型分子化合物。通过分子对接和分子动力学模拟对这些化合物的苦味和甜味特性进行了验证,并通过 ADMET 毒性测试和 DeepSA 合成可行性进一步确认了它们的实用性。这项研究强调了 LLM 在预测分子性质方面的潜力及其在健康和化学科学中的应用。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验