Li Hanyang, Mazzei Leonardo, Wallis Christopher D, Wexler Anthony S
Air Quality Research Center, University of California Davis, Davis, CA, 95616, USA.
Mechanical and Aerospace Engineering, University of California, Davis, CA, 95616, USA.
J Aerosol Sci. 2022 Jan;159. doi: 10.1016/j.jaerosci.2021.105874. Epub 2021 Sep 7.
We have recently developed a low-cost spark-induced breakdown spectroscopy (SIBS) instrument for analysis of toxic metal aerosol particles that we call TARTA (toxic-metal aerosol real time analyzer). In this work, we applied machine learning methods to improve the quantitative analysis of elemental mass concentrations measured by this instrument. Specifically, we applied least absolute shrinkage and selection operator (LASSO), partial least squares (PLS) regression, principal component regression (PCR), and support vector regression (SVR) to develop multivariate calibration models for 13 metals (e.g., Cr, Cu, Mn, Fe, Zn, Co, Al, K, Be, Hg, Cd, Pb, and Ni), some of which are included on the US EPA hazardous air pollutants (HAPS) list. The calibration performance, adjusted coefficient of determination (R) and normalized root mean square error (RMSE), and limit of detection (LOD) of the proposed models were compared to those of univariate calibration models for each analyte. Our results suggest that machine learning models tend to have better prediction accuracy and lower LODs than conventional univariate calibration, of which the LASSO approach performs the best with R > 0.8 and LODs of 40-170 ng m at a sampling time of 30 min and a flow rate of 15 l min . We then assessed the applicability of the LASSO model for quantifying elemental concentrations in mixtures of these metals, serving as independent validation datasets. Ultimately, the LASSO model developed in this work is a very promising machine learning approach for quantifying mass concentration of metals in aerosol particles using TARTA.
我们最近开发了一种低成本的火花诱导击穿光谱仪(SIBS),用于分析有毒金属气溶胶颗粒,我们将其称为TARTA(有毒金属气溶胶实时分析仪)。在这项工作中,我们应用机器学习方法来改进该仪器测量的元素质量浓度的定量分析。具体而言,我们应用最小绝对收缩和选择算子(LASSO)、偏最小二乘(PLS)回归、主成分回归(PCR)和支持向量回归(SVR)来开发针对13种金属(例如铬、铜、锰、铁、锌、钴、铝、钾、铍、汞、镉、铅和镍)的多元校准模型,其中一些金属包含在美国环境保护局(EPA)的有害空气污染物(HAPS)清单中。将所提出模型的校准性能、调整后的决定系数(R)和归一化均方根误差(RMSE)以及检测限(LOD)与每种分析物的单变量校准模型进行了比较。我们的结果表明,机器学习模型往往比传统的单变量校准具有更好的预测准确性和更低的检测限,其中LASSO方法表现最佳,在采样时间为30分钟、流速为15升/分钟时,R>0.8,检测限为40 - 170纳克/立方米。然后,我们评估了LASSO模型在量化这些金属混合物中元素浓度方面的适用性,将其用作独立验证数据集。最终,这项工作中开发的LASSO模型是一种非常有前景的机器学习方法,可用于使用TARTA量化气溶胶颗粒中金属的质量浓度。