Chowdhury M Arshad Zahangir, Oehlschlaeger Matthew A
Department of Mechanical, Aerospace, and Nuclear Engineering, Rensselaer Polytechnic Institute, 110 Eighth St., Troy, NY 12180, USA.
Sensors (Basel). 2024 Mar 14;24(6):1873. doi: 10.3390/s24061873.
Deep learning methods, a powerful form of artificial intelligence, have been applied in a number of spectroscopy and gas sensing applications. However, the speciation of multi-component gas mixtures from infrared (IR) absorption spectra using deep learning remains to be explored. Here, we propose a one-dimensional deep convolutional neural network gas classification model for the identification of small molecules of interest based on IR absorption spectra in flexible user-defined frequency ranges. The molecules considered include ten that are of interest in the atmosphere or in industrial and environmental processes: water vapor, carbon dioxide, ozone, nitrous oxide, carbon monoxide, methane, nitric oxide, sulfur dioxide, nitrogen dioxide, and ammonia. A simulated dataset of IR absorption spectra for mixtures of these molecules diluted in air was generated and used to train a deep learning model. The model was tested against simulated spectra containing noise and was found to provide speciation predictions with accuracy from 82 to 97%. The internal operation of the model was investigated using class activation maps that illustrate how the model prioritizes spectral information for classification. Finally, the model was demonstrated for the prediction of speciation for two synthetic experimental mixture spectra. The proposed model and the dataset generation strategies are generalized and can be implemented for other gases, different frequency ranges, and spectroscopy types. The multi-component speciation method developed herein is the first application of a convolutional neural network model, trained on HITRAN-based simulations, for spectral identification.
深度学习方法作为人工智能的一种强大形式,已被应用于许多光谱学和气体传感应用中。然而,利用深度学习从红外(IR)吸收光谱中对多组分气体混合物进行物种鉴定仍有待探索。在此,我们提出一种一维深度卷积神经网络气体分类模型,用于在灵活的用户定义频率范围内基于红外吸收光谱识别感兴趣的小分子。所考虑的分子包括在大气或工业及环境过程中感兴趣的十种分子:水蒸气、二氧化碳、臭氧、一氧化二氮、一氧化碳、甲烷、一氧化氮、二氧化硫、二氧化氮和氨。生成了这些分子在空气中稀释后的混合物的红外吸收光谱模拟数据集,并用于训练深度学习模型。该模型针对包含噪声的模拟光谱进行了测试,发现其物种鉴定预测准确率为82%至97%。使用类激活映射研究了模型的内部操作,该映射说明了模型如何为分类对光谱信息进行优先级排序。最后,对两个合成实验混合光谱的物种鉴定预测进行了演示。所提出的模型和数据集生成策略具有通用性,可用于其他气体、不同频率范围和光谱类型。本文开发的多组分物种鉴定方法是首个基于HITRAN模拟训练的卷积神经网络模型用于光谱识别的应用。