理解卷积神经网络在光谱分析中的学习机制。

Understanding the learning mechanism of convolutional neural networks in spectral analysis.

机构信息

College of Biosystems Engineering and Food Science, Zhejiang University, Hangzhou, Zhejiang, 310058, China; Key Laboratory of on Site Processing Equipment for Agricultural Products, Ministry of Agriculture and Rural Affairs, China.

School of Geosciences and Info-Physics, Central South University, South Lushan Road, Changsha, 410000, China.

出版信息

Anal Chim Acta. 2020 Jul 4;1119:41-51. doi: 10.1016/j.aca.2020.03.055. Epub 2020 Apr 8.

DOI:10.1016/j.aca.2020.03.055

PMID:32439053

Abstract

Deep learning approaches, especially convolutional neural network (CNN) models, have achieved excellent performances in vibrational spectral analysis. The critical drawback of the CNN approach is the lack of interpretation, and it is regarded as a black box. Interpreting the learning mechanism of chemometric models is critical for intuitive understanding and further application. In this study, an interpretable CNN model with a global average pooling layer is presented for Raman and mid-infrared spectral data analysis. A class activation mapping (CAM)-based approach is leveraged to visualize the active variables in the whole spectrum. The visualization of active variables shows a discriminative pattern in which the most contributed variables peaked around theoretical chemical characteristic bands. The visualization of the feature maps by three convolutional layers demonstrates the data transformation pipeline and how the CNN model hierarchically extracts informative spectral features. The first layer acts as a Savitzky-Golay filter and learns spectral shape characteristics, while the second layer learns enhanced patterns from typical spectral peaks on a few correlated variables. The third layer shows stable activations on critical spectral peaks. A partial least squares - linear discriminant analysis (PLS-LDA) model is presented for comparison on classification accuracy and model interpretation. The CNN model yields mean classification accuracies of 99.01 and 100% for E. coli and meat datasets on the test set, while the PLS-LDA models obtain accuracies of 98.83 and 100%. Both the CNN and PLS-LDA models demonstrate stable patterns on active variables while CNN models are more stable than PLS-LDA models on classification performances for various dataset partitions with Monte-Carlo cross-validation.

摘要

深度学习方法，尤其是卷积神经网络（CNN）模型，在振动光谱分析中取得了优异的性能。CNN 方法的关键缺点是缺乏解释性，被视为黑箱。解释化学计量学模型的学习机制对于直观理解和进一步应用至关重要。在这项研究中，提出了一种具有全局平均池化层的可解释 CNN 模型，用于拉曼和中红外光谱数据分析。利用基于类激活映射（CAM）的方法对整个光谱中的活跃变量进行可视化。活跃变量的可视化显示了一个有区别的模式，其中贡献最大的变量在理论化学特征带附近达到峰值。通过三个卷积层对特征图的可视化展示了数据转换管道以及 CNN 模型如何分层提取信息丰富的光谱特征。第一层充当 Savitzky-Golay 滤波器，学习光谱形状特征，而第二层从少数相关变量上的典型光谱峰学习增强模式。第三层在关键光谱峰上显示稳定的激活。提出了偏最小二乘-线性判别分析（PLS-LDA）模型进行分类准确性和模型解释的比较。在测试集上，对于大肠杆菌和肉类数据集，CNN 模型的平均分类准确率分别为 99.01%和 100%，而 PLS-LDA 模型的准确率分别为 98.83%和 100%。CNN 和 PLS-LDA 模型在活跃变量上都表现出稳定的模式，而在各种数据集分区的 Monte-Carlo 交叉验证中，CNN 模型在分类性能上比 PLS-LDA 模型更稳定。

相似文献

Understanding the learning mechanism of convolutional neural networks in spectral analysis.

Anal Chim Acta. 2020 Jul 4;1119:41-51. doi: 10.1016/j.aca.2020.03.055. Epub 2020 Apr 8.

Feature selection of infrared spectra analysis with convolutional neural network.

Spectrochim Acta A Mol Biomol Spectrosc. 2022 Feb 5;266:120361. doi: 10.1016/j.saa.2021.120361. Epub 2021 Sep 4.

A deep dive into understanding tumor foci classification using multiparametric MRI based on convolutional neural network.

Med Phys. 2020 Sep;47(9):4077-4086. doi: 10.1002/mp.14255. Epub 2020 Jun 12.

Raman spectroscopy combined with convolutional neural network for the sub-types classification of breast cancer and critical feature visualization.

Comput Methods Programs Biomed. 2024 Oct;255:108361. doi: 10.1016/j.cmpb.2024.108361. Epub 2024 Aug 3.

Improving skin cancer detection by Raman spectroscopy using convolutional neural networks and data augmentation.

Front Oncol. 2024 Jun 19;14:1320220. doi: 10.3389/fonc.2024.1320220. eCollection 2024.

Convolutional neural networks for decoding of covert attention focus and saliency maps for EEG feature visualization.

J Neural Eng. 2019 Oct 23;16(6):066010. doi: 10.1088/1741-2552/ab3bb4.

Deep learning for liver tumor diagnosis part I: development of a convolutional neural network classifier for multi-phasic MRI.

Eur Radiol. 2019 Jul;29(7):3338-3347. doi: 10.1007/s00330-019-06205-9. Epub 2019 Apr 23.

GP-CNN-DTEL: Global-Part CNN Model With Data-Transformed Ensemble Learning for Skin Lesion Classification.

IEEE J Biomed Health Inform. 2020 Oct;24(10):2870-2882. doi: 10.1109/JBHI.2020.2977013. Epub 2020 Feb 28.

CEFEs: A CNN Explainable Framework for ECG Signals.

Artif Intell Med. 2021 May;115:102059. doi: 10.1016/j.artmed.2021.102059. Epub 2021 Mar 26.

Learning hidden patterns from patient multivariate time series data using convolutional neural networks: A case study of healthcare cost prediction.

J Biomed Inform. 2020 Nov;111:103565. doi: 10.1016/j.jbi.2020.103565. Epub 2020 Sep 25.

引用本文的文献

Unlocking chickpea flour potential: AI-powered prediction for quality assessment and compositional characterisation.

Curr Res Food Sci. 2025 Mar 21;10:101030. doi: 10.1016/j.crfs.2025.101030. eCollection 2025.

Quantitative modelling of Plato and total flavonoids in Qingke wort at mashing and boiling stages based on FT-IR combined with deep learning and chemometrics.

Food Chem X. 2024 Jul 18;23:101673. doi: 10.1016/j.fochx.2024.101673. eCollection 2024 Oct 30.

Classification of osteoarthritic and healthy cartilage using deep learning with Raman spectra.

Sci Rep. 2024 Jul 10;14(1):15902. doi: 10.1038/s41598-024-66857-6.

Multi-branch attention Raman network and surface-enhanced Raman spectroscopy for the classification of neurological disorders.

Biomed Opt Express. 2024 May 1;15(6):3523-3540. doi: 10.1364/BOE.514196. eCollection 2024 Jun 1.

Advances in microscopy characterization techniques for lipid nanocarriers in drug delivery: a comprehensive review.

Naunyn Schmiedebergs Arch Pharmacol. 2024 Aug;397(8):5463-5481. doi: 10.1007/s00210-024-03033-7. Epub 2024 Mar 9.

Detection of protein, starch, oil, and moisture content of corn kernels using one-dimensional convolutional autoencoder and near-infrared spectroscopy.

PeerJ Comput Sci. 2023 Mar 9;9:e1266. doi: 10.7717/peerj-cs.1266. eCollection 2023.

Novel prediction models for hyperketonemia using bovine milk Fourier-transform infrared spectroscopy.

Prev Vet Med. 2023 Apr;213:105860. doi: 10.1016/j.prevetmed.2023.105860. Epub 2023 Jan 25.

Raman spectroscopy and convolutional neural networks for monitoring biochemical radiation response in breast tumour xenografts.

Sci Rep. 2023 Jan 27;13(1):1530. doi: 10.1038/s41598-023-28479-2.

RamanNet: a lightweight convolutional neural network for bacterial identification based on Raman spectra.

RSC Adv. 2022 Sep 16;12(40):26463-26469. doi: 10.1039/d2ra03722j. eCollection 2022 Sep 12.

Estimation of Soluble Solids for Stone Fruit Varieties Based on Near-Infrared Spectra Using Machine Learning Techniques.

Sensors (Basel). 2022 Aug 14;22(16):6081. doi: 10.3390/s22166081.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

理解卷积神经网络在光谱分析中的学习机制。

Understanding the learning mechanism of convolutional neural networks in spectral analysis.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献