Zimmermann Yoel, Sieben Leif, Seng Henrik, Pestlin Philipp, Görlich Franz
Department of Chemistry and Applied Biosciences, ETH Zurich, Zurich, Switzerland.
Kvant AI Labs, Zurich, Switzerland.
NPJ Sci Food. 2025 Jul 5;9(1):122. doi: 10.1038/s41538-025-00474-z.
Determining molecular taste remains a significant challenge in food science. Here, we present FART (Flavor Analysis and Recognition Transformer), a chemical language model capable of predicting molecular taste from chemical structure. Trained on the largest public dataset (15,025 compounds) of molecular tastants to date, FART is the first model capable of parallel predictions across four taste categories: sweet, bitter, sour, and umami. FART achieves an accuracy above 91% for parallel taste prediction and outperforms previous state-of-the-art binary classifier models that specialize on predicting one taste class. Its transformer architecture allows for interpretability through gradient-based visualization of molecular features. The model identifies key structural elements driving taste properties and demonstrates utility in analyzing known tastants as well as novel compounds. By releasing both the model and dataset, we equip the food science community with tools for rapid taste prediction, accelerating flavor compound development and enabling systematic exploration of taste chemistry.
确定分子味觉仍然是食品科学中的一项重大挑战。在此,我们展示了FART(风味分析与识别变压器模型),这是一种能够从化学结构预测分子味觉的化学语言模型。在迄今为止最大的分子味觉剂公共数据集(15025种化合物)上进行训练后,FART是首个能够对甜、苦、酸、鲜四种味觉类别进行并行预测的模型。FART在并行味觉预测方面的准确率超过91%,优于之前专门预测单一味觉类别的最先进二元分类器模型。其变压器架构允许通过基于梯度的分子特征可视化来实现可解释性。该模型识别出驱动味觉特性的关键结构元素,并在分析已知味觉剂和新型化合物方面展现出实用性。通过发布模型和数据集,我们为食品科学界提供了快速味觉预测工具,加速风味化合物开发,并实现对味觉化学的系统探索。