Suppr超能文献

一种用于分子味觉预测的化学语言模型。

A chemical language model for molecular taste prediction.

作者信息

Zimmermann Yoel, Sieben Leif, Seng Henrik, Pestlin Philipp, Görlich Franz

机构信息

Department of Chemistry and Applied Biosciences, ETH Zurich, Zurich, Switzerland.

Kvant AI Labs, Zurich, Switzerland.

出版信息

NPJ Sci Food. 2025 Jul 5;9(1):122. doi: 10.1038/s41538-025-00474-z.

Abstract

Determining molecular taste remains a significant challenge in food science. Here, we present FART (Flavor Analysis and Recognition Transformer), a chemical language model capable of predicting molecular taste from chemical structure. Trained on the largest public dataset (15,025 compounds) of molecular tastants to date, FART is the first model capable of parallel predictions across four taste categories: sweet, bitter, sour, and umami. FART achieves an accuracy above 91% for parallel taste prediction and outperforms previous state-of-the-art binary classifier models that specialize on predicting one taste class. Its transformer architecture allows for interpretability through gradient-based visualization of molecular features. The model identifies key structural elements driving taste properties and demonstrates utility in analyzing known tastants as well as novel compounds. By releasing both the model and dataset, we equip the food science community with tools for rapid taste prediction, accelerating flavor compound development and enabling systematic exploration of taste chemistry.

摘要

确定分子味觉仍然是食品科学中的一项重大挑战。在此,我们展示了FART(风味分析与识别变压器模型),这是一种能够从化学结构预测分子味觉的化学语言模型。在迄今为止最大的分子味觉剂公共数据集(15025种化合物)上进行训练后,FART是首个能够对甜、苦、酸、鲜四种味觉类别进行并行预测的模型。FART在并行味觉预测方面的准确率超过91%,优于之前专门预测单一味觉类别的最先进二元分类器模型。其变压器架构允许通过基于梯度的分子特征可视化来实现可解释性。该模型识别出驱动味觉特性的关键结构元素,并在分析已知味觉剂和新型化合物方面展现出实用性。通过发布模型和数据集,我们为食品科学界提供了快速味觉预测工具,加速风味化合物开发,并实现对味觉化学的系统探索。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a92/12228699/345c386f7443/41538_2025_474_Fig1_HTML.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验