从类似化学混合物的精油的质谱中预测连续值表示的人类气味感知。

Predicting human odor perception represented by continuous values from mass spectra of essential oils resembling chemical mixtures.

机构信息

Department of Information and Communications Engineering, Tokyo Institute of Technology, Yokohama, Kanagawa, Japan.

Laboratory for Future Interdisciplinary Research in Science and Technology, Tokyo Institute of Technology, Yokohama, Kanagawa, Japan.

出版信息

PLoS One. 2020 Jun 19;15(6):e0234688. doi: 10.1371/journal.pone.0234688. eCollection 2020.

DOI:10.1371/journal.pone.0234688

PMID:32559255

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7304616/

Abstract

There have been recent advances in predicting odor characteristics using molecular structure parameters of chemicals. Although the molecular structure parameters are available for each chemical, they cannot be used for chemical mixtures. This study will elucidate a computational method of predicting human odor perception from the mass spectra of chemical mixtures such as essential oils. Furthermore, a method for obtaining similarity among odor descriptors has been proposed although the dataset contains binary values only. When the database indicates a set of odor descriptors for one sample, only binary data are available and the correlation between the similar descriptors disappears. Thus, the prediction performance degrades for not considering the similarity among the odor descriptors. Since mass spectra dataset is highly dimensional, we use auto-encoder to learn the compressed representation from the mass spectra of essential oils in its bottleneck hidden layer and then accomplishes the hierarchical clustering to create odor descriptor groups with similar odor impressions using a matrix of continuous value-based correlation coefficient as well as natural language processing. This work will help to expatiate the process of overcoming binary value problem and find out the similarity among odor descriptors using machine learning with natural language semantic representation of words. To overcome the problem of disproportionate ratio of positive and negative class for both the continuous value-based correlation coefficient and word similarity based models, we use Synthetic Minority Oversampling Technique (SMOTE). This model allows us to predict human odor perception through computer simulations by forming odor descriptors group. Accordingly, this study demonstrates the feasibility of ensembling machine learning with natural language processing and SMOTE approach for predicting odor descriptor group from mass spectra of essential oils.

摘要

最近，人们在利用化学物质的分子结构参数来预测气味特征方面取得了一些进展。虽然每种化学物质都有分子结构参数，但这些参数不能用于化学混合物。本研究将阐明一种从精油等化学混合物的质谱中预测人类嗅觉感知的计算方法。此外，尽管数据集仅包含二进制值，我们还是提出了一种获取气味描述符相似性的方法。当数据库为一个样本指示一组气味描述符时，只有二进制数据可用，相似描述符之间的相关性就会消失。因此，如果不考虑气味描述符之间的相似性，预测性能就会下降。由于质谱数据集具有高度的维度性，我们使用自动编码器从精油的质谱中学习压缩表示，并在瓶颈隐藏层中，然后使用基于连续值的相关系数矩阵和自然语言处理来完成层次聚类，以创建具有相似气味印象的气味描述符组。这项工作将有助于阐述克服二进制值问题的过程，并使用具有自然语言语义表示的机器学习找到气味描述符之间的相似性。为了解决基于连续值的相关系数和基于词的相似性模型中正负类比例不均衡的问题，我们使用了 Synthetic Minority Oversampling Technique（SMOTE）。该模型允许我们通过形成气味描述符组，通过计算机模拟来预测人类的嗅觉感知。因此，本研究展示了通过集成机器学习和自然语言处理以及 SMOTE 方法，从精油的质谱中预测气味描述符组的可行性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a69c/7304616/9d067f22d426/pone.0234688.g001.jpg

相似文献

Predicting human odor perception represented by continuous values from mass spectra of essential oils resembling chemical mixtures.

PLoS One. 2020 Jun 19;15(6):e0234688. doi: 10.1371/journal.pone.0234688. eCollection 2020.

Predictive modeling for odor character of a chemical using machine learning combined with natural language processing.

PLoS One. 2018 Jun 14;13(6):e0198475. doi: 10.1371/journal.pone.0198475. eCollection 2018.

Expansive linguistic representations to predict interpretable odor mixture discriminability.

Chem Senses. 2023 Jan 1;48. doi: 10.1093/chemse/bjad018.

Extraction of sensing data for desired scent impressions using mass spectra of odorant molecules.

Sci Rep. 2022 Sep 29;12(1):16297. doi: 10.1038/s41598-022-20388-0.

Data based predictive models for odor perception.

Sci Rep. 2020 Oct 13;10(1):17136. doi: 10.1038/s41598-020-73978-1.

Odor Impression Prediction from Mass Spectra.

PLoS One. 2016 Jun 21;11(6):e0157030. doi: 10.1371/journal.pone.0157030. eCollection 2016.

Predicting individual perceptual scent impression from imbalanced dataset using mass spectrum of odorant molecules.

Sci Rep. 2022 Mar 8;12(1):3778. doi: 10.1038/s41598-022-07802-3.

An Olfactory Sensor Array for Predicting Chemical Odor Characteristics from Mass Spectra with Deep Learning.

Methods Mol Biol. 2019;2027:29-47. doi: 10.1007/978-1-4939-9616-2_3.

Perceptual processing strategy and exposure influence the perception of odor mixtures.

Chem Senses. 2008 Feb;33(2):193-9. doi: 10.1093/chemse/bjm080. Epub 2007 Dec 10.

Comparing molecular representations, e-nose signals, and other featurization, for learning to smell aroma molecules.

PLoS One. 2023 Aug 11;18(8):e0289881. doi: 10.1371/journal.pone.0289881. eCollection 2023.

引用本文的文献

Odor classification: Exploring feature performance and imbalanced data learning techniques.

PLoS One. 2025 May 28;20(5):e0322514. doi: 10.1371/journal.pone.0322514. eCollection 2025.

Odor prediction of whiskies based on their molecular composition.

Commun Chem. 2024 Dec 19;7(1):293. doi: 10.1038/s42004-024-01373-2.

Regression Study of Odorant Chemical Space, Molecular Structural Diversity, and Natural Language Description.

ACS Omega. 2024 Jun 3;9(23):25054-25062. doi: 10.1021/acsomega.4c02268. eCollection 2024 Jun 11.

Comparing molecular representations, e-nose signals, and other featurization, for learning to smell aroma molecules.

PLoS One. 2023 Aug 11;18(8):e0289881. doi: 10.1371/journal.pone.0289881. eCollection 2023.

Prediction of gestational diabetes mellitus at the first trimester: machine-learning algorithms.

Arch Gynecol Obstet. 2024 Jun;309(6):2557-2566. doi: 10.1007/s00404-023-07131-4. Epub 2023 Jul 21.

OWSum: algorithmic odor prediction and insight into structure-odor relationships.

J Cheminform. 2023 May 7;15(1):51. doi: 10.1186/s13321-023-00722-y.

Utilisation of QSPR ODT modelling and odour vector modelling to predict Cannabis sativa odour.

PLoS One. 2023 Apr 25;18(4):e0284842. doi: 10.1371/journal.pone.0284842. eCollection 2023.

Extraction of sensing data for desired scent impressions using mass spectra of odorant molecules.

Sci Rep. 2022 Sep 29;12(1):16297. doi: 10.1038/s41598-022-20388-0.

Predicting individual perceptual scent impression from imbalanced dataset using mass spectrum of odorant molecules.

Sci Rep. 2022 Mar 8;12(1):3778. doi: 10.1038/s41598-022-07802-3.

本文引用的文献

Predictive modeling for odor character of a chemical using machine learning combined with natural language processing.

PLoS One. 2018 Jun 14;13(6):e0198475. doi: 10.1371/journal.pone.0198475. eCollection 2018.

Machine-Learning-Based Olfactometer: Prediction of Odor Perception from Physicochemical Features of Odorant Molecules.

Anal Chem. 2017 Nov 21;89(22):11999-12005. doi: 10.1021/acs.analchem.7b02389. Epub 2017 Nov 7.

Predicting human olfactory perception from chemical features of odor molecules.

Science. 2017 Feb 24;355(6327):820-826. doi: 10.1126/science.aal2014. Epub 2017 Feb 20.

Odor Impression Prediction from Mass Spectra.

PLoS One. 2016 Jun 21;11(6):e0157030. doi: 10.1371/journal.pone.0157030. eCollection 2016.

A problem of dimensionality: a simple example.

IEEE Trans Pattern Anal Mach Intell. 1979 Mar;1(3):306-7. doi: 10.1109/tpami.1979.4766926.

Reducing the dimensionality of data with neural networks.

Science. 2006 Jul 28;313(5786):504-7. doi: 10.1126/science.1127647.

Sparse coding with an overcomplete basis set: a strategy employed by V1?

Vision Res. 1997 Dec;37(23):3311-25. doi: 10.1016/s0042-6989(97)00169-7.

A novel multigene family may encode odorant receptors: a molecular basis for odor recognition.

Cell. 1991 Apr 5;65(1):175-87. doi: 10.1016/0092-8674(91)90418-x.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

从类似化学混合物的精油的质谱中预测连续值表示的人类气味感知。

Predicting human odor perception represented by continuous values from mass spectra of essential oils resembling chemical mixtures.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献