Yang Jie, Xu Jinfan, Zhang Xiaolei, Wu Chiyu, Lin Tao, Ying Yibin
College of Biosystems Engineering and Food Science, Zhejiang University, Hangzhou, Zhejiang, 310058, China; Key Laboratory of On Site Processing Equipment for Agricultural Products, Ministry of Agriculture and Rural Affairs, China.
College of Biosystems Engineering and Food Science, Zhejiang University, Hangzhou, Zhejiang, 310058, China.
Anal Chim Acta. 2019 Nov 12;1081:6-17. doi: 10.1016/j.aca.2019.06.012. Epub 2019 Jun 8.
The development of chemometrics aims to provide an effective analysis approach for data generated by advanced analytical instruments. The success of existing analytical approaches in spectral analysis still relies on preprocessing and feature selection techniques to remove signal artifacts based on prior experiences. Data-driven deep learning analysis has been developed and successfully applied in many domains in the last few years. How to integrate deep learning with spectral analysis received increased attention for chemometrics. Approximately 20 recently published studies demonstrate that deep neural networks can learn critical patterns from raw spectra, which significantly reduces the demand for feature engineering. The composition of multiple processing layers improves the fitting and feature extraction capability and makes them applicable to various analytical tasks. This advance offers a new solution for chemometrics toward resolving challenges related to spectral data with rapidly increased sample numbers from various sources. We further provide a practical guide to the development of a deep convolutional neural network-based analytical workflow. The design of the network structure, tuning the hyperparameters in the training process, and repeatability of results is mainly discussed. Future studies are needed on interpretability and repeatability of the deep learning approach in spectral analysis.
化学计量学的发展旨在为先进分析仪器生成的数据提供一种有效的分析方法。现有光谱分析方法的成功仍依赖于基于先验经验的预处理和特征选择技术来去除信号伪影。在过去几年中,数据驱动的深度学习分析得到了发展并成功应用于许多领域。如何将深度学习与光谱分析相结合在化学计量学中受到了越来越多的关注。最近大约20项已发表的研究表明,深度神经网络可以从原始光谱中学习关键模式,这显著减少了对特征工程的需求。多个处理层的组成提高了拟合和特征提取能力,并使其适用于各种分析任务。这一进展为化学计量学提供了一种新的解决方案,以应对来自各种来源的样本数量迅速增加的情况下与光谱数据相关的挑战。我们进一步提供了基于深度卷积神经网络的分析工作流程开发的实用指南。主要讨论了网络结构的设计、训练过程中超参数的调整以及结果的可重复性。未来需要对深度学习方法在光谱分析中的可解释性和可重复性进行研究。