Tokyo Institute of Technology, Institute of Innovative Research, Yokohama, Kanagawa, Japan.
PLoS One. 2018 Jun 14;13(6):e0198475. doi: 10.1371/journal.pone.0198475. eCollection 2018.
Recent studies on machine learning technology have reported successful performances in some visual and auditory recognition tasks, while little has been reported in the field of olfaction. In this paper we report computational methods to predict the odor impression of a chemical from its physicochemical properties. Our predictive model utilizes nonlinear dimensionality reduction on mass spectra data and performs the clustering of descriptors by natural language processing. Sensory evaluation is widely used to measure human impressions to smell or taste by using verbal descriptors, such as "spicy" and "sweet". However, as it requires significant amounts of time and human resources, a large-scale sensory evaluation test is difficult to perform. Our model successfully predicts a group of descriptors for a target chemical through a series of computer simulations. Although the training text data used in the language modeling is not specialized for olfaction, the experimental results show that our method is useful for analyzing sensory datasets. This is the first report to combine machine olfaction with natural language processing for odor character prediction.
最近的机器学习技术研究在一些视觉和听觉识别任务中取得了成功的表现,而在嗅觉领域则鲜有报道。本文报告了从化学物质的物理化学性质预测其气味印象的计算方法。我们的预测模型利用质谱数据的非线性降维和自然语言处理对描述符进行聚类。感官评价广泛用于通过使用口头描述符(如“辣”和“甜”)来测量人类对气味或味道的印象。然而,由于它需要大量的时间和人力资源,因此很难进行大规模的感官评价测试。我们的模型通过一系列计算机模拟成功地预测了目标化学物质的一组描述符。尽管语言模型中使用的训练文本数据不是专门针对嗅觉的,但实验结果表明,我们的方法对于分析感官数据集是有用的。这是首次将机器嗅觉与自然语言处理相结合用于气味特征预测的报告。