Keller Andreas, Gerkin Richard C, Guan Yuanfang, Dhurandhar Amit, Turu Gabor, Szalai Bence, Mainland Joel D, Ihara Yusuke, Yu Chung Wen, Wolfinger Russ, Vens Celine, Schietgat Leander, De Grave Kurt, Norel Raquel, Stolovitzky Gustavo, Cecchi Guillermo A, Vosshall Leslie B, Meyer Pablo
Laboratory of Neurogenetics and Behavior, The Rockefeller University, New York, NY 10065, USA.
School of Life Sciences, Arizona State University, Tempe, AZ 85281, USA.
Science. 2017 Feb 24;355(6327):820-826. doi: 10.1126/science.aal2014. Epub 2017 Feb 20.
It is still not possible to predict whether a given molecule will have a perceived odor or what olfactory percept it will produce. We therefore organized the crowd-sourced DREAM Olfaction Prediction Challenge. Using a large olfactory psychophysical data set, teams developed machine-learning algorithms to predict sensory attributes of molecules based on their chemoinformatic features. The resulting models accurately predicted odor intensity and pleasantness and also successfully predicted 8 among 19 rated semantic descriptors ("garlic," "fish," "sweet," "fruit," "burnt," "spices," "flower," and "sour"). Regularized linear models performed nearly as well as random forest-based ones, with a predictive accuracy that closely approaches a key theoretical limit. These models help to predict the perceptual qualities of virtually any molecule with high accuracy and also reverse-engineer the smell of a molecule.
目前仍无法预测一个给定的分子是否会有可感知的气味,或者它会产生何种嗅觉感知。因此,我们组织了众包的DREAM嗅觉预测挑战赛。利用一个大型嗅觉心理物理学数据集,各团队开发了机器学习算法,以根据分子的化学信息特征预测其感官属性。所得模型准确地预测了气味强度和愉悦度,还成功预测了19个评级语义描述符中的8个(“大蒜味”“鱼腥味”“甜味”“水果味”“焦糊味”“香料味”“花香味”和“酸味”)。正则化线性模型的表现与基于随机森林的模型相近,其预测准确率接近一个关键的理论极限。这些模型有助于高精度地预测几乎任何分子的感知特性,还能逆向设计分子的气味。