Key Laboratory for Molecular Enzymology and Engineering of Ministry of Education, School of Life Science, Jilin University, 2699 Qianjin Street, Changchun 130012, China.
Department of Electrical Engineering and Computer Science, Bond Life Sciences Center, University of Missouri, Columbia, Missouri 65211, United States.
J Chem Inf Model. 2024 May 27;64(10):4102-4111. doi: 10.1021/acs.jcim.4c00681. Epub 2024 May 7.
The perception of bitter and sweet tastes is a crucial aspect of human sensory experience. Concerns over the long-term use of aspartame, a widely used sweetener suspected of carcinogenic risks, highlight the importance of developing new taste modifiers. This study utilizes Large Language Models (LLMs) such as GPT-3.5 and GPT-4 for predicting molecular taste characteristics, with a focus on the bitter-sweet dichotomy. Employing random and scaffold data splitting strategies, GPT-4 demonstrated superior performance, achieving an impressive 86% accuracy under scaffold partitioning. Additionally, ChatGPT was employed to extract specific molecular features associated with bitter and sweet tastes. Utilizing these insights, novel molecular compounds with distinct taste profiles were successfully generated. These compounds were validated for their bitter and sweet properties through molecular docking and molecular dynamics simulation, and their practicality was further confirmed by ADMET toxicity testing and DeepSA synthesis feasibility. This research highlights the potential of LLMs in predicting molecular properties and their implications in health and chemical science.
苦味和甜味的感知是人类感官体验的一个关键方面。由于担心长期使用阿斯巴甜(一种广泛使用的甜味剂,被怀疑有致癌风险),因此开发新的味觉修饰剂显得尤为重要。本研究利用 GPT-3.5 和 GPT-4 等大型语言模型(LLMs)来预测分子味觉特征,重点关注苦甜二分法。采用随机和支架数据拆分策略,GPT-4 表现出卓越的性能,在支架拆分下达到令人印象深刻的 86%的准确率。此外,ChatGPT 还被用于提取与苦甜味道相关的特定分子特征。利用这些见解,成功生成了具有独特味道特征的新型分子化合物。通过分子对接和分子动力学模拟对这些化合物的苦味和甜味特性进行了验证,并通过 ADMET 毒性测试和 DeepSA 合成可行性进一步确认了它们的实用性。这项研究强调了 LLM 在预测分子性质方面的潜力及其在健康和化学科学中的应用。