School of Electrical and Information Engineering, North Minzu University, No. 204 North Wenchang Street, Yinchuan, 750021, Ningxia, China.
School of Medical Technology, North Minzu University, No. 204 North Wenchang Street, Yinchuan, 750021, Ningxia, China.
Sci Rep. 2024 Jun 1;14(1):12598. doi: 10.1038/s41598-024-63280-9.
To tackle the difficulty of extracting features from one-dimensional spectral signals using traditional spectral analysis, a metabolomics analysis method is proposed to locate two-dimensional correlated spectral feature bands and combine it with deep learning classification for wine origin traceability. Metabolomics analysis was performed on 180 wine samples from 6 different wine regions using UPLC-Q-TOF-MS. Indole, Sulfacetamide, and caffeine were selected as the main differential components. By analyzing the molecular structure of these components and referring to the main functional groups on the infrared spectrum, characteristic band regions with wavelengths in the range of 1000-1400 nm and 1500-1800 nm were selected. Draw two-dimensional correlation spectra (2D-COS) separately, generate synchronous correlation spectra and asynchronous correlation spectra, establish convolutional neural network (CNN) classification models, and achieve the purpose of wine origin traceability. The experimental results demonstrate that combining two segments of two-dimensional characteristic spectra determined by metabolomics screening with convolutional neural networks yields optimal classification results. This validates the effectiveness of using metabolomics screening to determine spectral feature regions in tracing wine origin. This approach effectively removes irrelevant variables while retaining crucial chemical information, enhancing spectral resolution. This integrated approach strengthens the classification model's understanding of samples, significantly increasing accuracy.
为了解决传统光谱分析中从一维光谱信号中提取特征的困难,提出了一种代谢组学分析方法,用于定位二维相关光谱特征带,并结合深度学习分类进行葡萄酒产地溯源。使用 UPLC-Q-TOF-MS 对来自 6 个不同葡萄酒产区的 180 个葡萄酒样本进行代谢组学分析。选择吲哚、磺胺醋酰和咖啡因作为主要差异成分。通过分析这些成分的分子结构,并参考红外光谱上的主要功能团,选择波长范围在 1000-1400nm 和 1500-1800nm 的特征波段区域。分别绘制二维相关光谱(2D-COS),生成同步相关光谱和异步相关光谱,建立卷积神经网络(CNN)分类模型,实现葡萄酒产地溯源的目的。实验结果表明,将代谢组学筛选确定的两段二维特征光谱与卷积神经网络相结合,可获得最佳的分类结果。这验证了使用代谢组学筛选确定光谱特征区域进行葡萄酒产地溯源的有效性。该方法有效地去除了不相关的变量,同时保留了关键的化学信息,增强了光谱分辨率。这种集成方法增强了分类模型对样本的理解,显著提高了准确性。