Suppr超能文献

基于双波段变换和竞争自适应重加权采样的近红外光谱数据定量分析。

Quantitative analysis of near infrared spectroscopic data based on dual-band transformation and competitive adaptive reweighted sampling.

机构信息

Faculty of Information Technology, Beijing University of Technology, Beijing, China.

Faculty of Information Technology, Beijing University of Technology, Beijing, China.

出版信息

Spectrochim Acta A Mol Biomol Spectrosc. 2023 Jan 15;285:121924. doi: 10.1016/j.saa.2022.121924. Epub 2022 Oct 2.

Abstract

Near infrared (NIR) spectroscopy has the characteristics of rapid processing, nondestructive analysis and on-line detection. This technique has been widely used in the fields of quantitative determination and substance content analysis. However, for complex NIR spectral data, most traditional machine learning models cannot carry out effective quantitative analyses (manifested as underfitting; that is, the training effect of the model is not good). Small amounts of available data limit the performance of deep learning-based infrared spectroscopy methods, while the traditional threshold-based feature selection methods require more prior knowledge. To address the above problems, this paper proposes a competitive adaptive reweighted sampling method based on dual band transformation (DWT-CARS). DWT-CARS includes four types in total: CARS based on integrated two-dimensional correlation spectrum (i2DCOS-CARS), CARS based on difference coefficient (DI-CARS), CARS based on ratio coefficient (RI-CARS) and CARS based on normalized difference coefficient (NDI-CARS). We conducted comparative experiments on three datasets; compared to traditional machine learning methods, our method achieved good results, demonstrating that this method has considerable prospects for the quantitative analysis of near-infrared spectroscopic data. To further improve the performance and stability of this method, we combined the idea of integrated modeling and constructed a partial least squares model based on Monte Carlo sampling for the samples obtained by CARS (DWT-CARS-MC-PLS). Through comparative experiments, we verified that the integrated model could further enhance the accuracy and stability of the results.

摘要

近红外(NIR)光谱具有快速处理、无损分析和在线检测的特点。该技术已广泛应用于定量测定和物质含量分析领域。然而,对于复杂的 NIR 光谱数据,大多数传统的机器学习模型无法进行有效的定量分析(表现为欠拟合,即模型的训练效果不好)。可用数据量少限制了基于深度学习的红外光谱方法的性能,而传统的基于阈值的特征选择方法需要更多的先验知识。针对上述问题,本文提出了一种基于双带变换(DWT-CARS)的竞争自适应重加权采样方法。DWT-CARS 总共包括四种类型:基于二维相关光谱的集成(i2DCOS-CARS)、基于差分系数的 CARS(DI-CARS)、基于比系数的 CARS(RI-CARS)和基于归一化差分系数的 CARS(NDI-CARS)。我们在三个数据集上进行了对比实验,与传统的机器学习方法相比,我们的方法取得了良好的效果,表明该方法在近红外光谱数据的定量分析中具有相当大的前景。为了进一步提高该方法的性能和稳定性,我们结合集成建模的思想,对 CARS(DWT-CARS-MC-PLS)获得的样本构建了基于蒙特卡罗抽样的偏最小二乘模型。通过对比实验,验证了集成模型可以进一步提高结果的准确性和稳定性。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验