Cai Yunfei, Che Xiang, Duan Yusen
Shanghai Environment Monitoring Center, Shanghai 200235, China.
NHC Key Laboratory of Health Technology Assessment, Key Laboratory of Public Health Safety of the Ministry of Education, Department of Environmental Health, School of Public Health, Fudan University, Shanghai 200032, China.
Sensors (Basel). 2025 Aug 27;25(17):5314. doi: 10.3390/s25175314.
(1) Objective: Volatile organic compounds (VOCs) monitoring in industrial parks is crucial for environmental regulation and public health protection. However, current techniques face challenges related to cost and real-time performance. This study aims to develop a dynamic calibration framework for accurate real-time conversion of VOCs volume fractions (nmol mol) to mass concentrations (μg m) in industrial environments, addressing the limitations of conventional monitoring methods such as high costs and delayed response times. (2) Methods: By innovatively integrating photoionization detector (PID) with machine learning, we developed a robust conversion model utilizing PID signals, meteorological data, and a random forest's (RF) algorithm. The system's performance was rigorously evaluated against standard gas chromatography-flame ionization detectors (GC-FID) measurements. (3) Results: The proposed framework demonstrated superior performance, achieving a coefficient of determination (R) of 0.81, root mean squared error (RMSE) of 48.23 μg m, symmetric mean absolute percentage error (SMAPE) of 62.47%, and a normalized RMSE (RMSE) of 2.07%, outperforming conventional methods. This framework not only achieved minute-level response times but also reduced costs to just 10% of those associated with GC-FID methods. Additionally, the model exhibited strong cross-site robustness with R values ranging from 0.68 to 0.69, although its accuracy was somewhat reduced for high-concentration samples (>1500 μg m), where the mean absolute percentage error (MAPE) was 17.8%. The inclusion of SMAPE and RMSE provides a more nuanced understanding of the model's performance, particularly in the context of skewed or heteroscedastic data distributions, thereby offering a more comprehensive assessment of the framework's effectiveness. (4) Conclusions: The framework's innovative combination of PID's real-time capability and RF's nonlinear modeling achieves accurate mass concentration conversion (R = 0.81) while maintaining a 95% faster response and 90% cost reduction compared to GC-FID systems. Compared with traditional single-coefficient PID calibration, this framework significantly improves accuracy and adaptability under dynamic industrial conditions. Future work will apply transfer learning to improve high-concentration detection for pollution tracing and environmental governance in industrial parks.
(1) 目的:工业园区挥发性有机化合物(VOCs)监测对于环境监管和公众健康保护至关重要。然而,当前技术面临成本和实时性能方面的挑战。本研究旨在开发一种动态校准框架,用于在工业环境中准确实时地将VOCs体积分数(nmol/mol)转换为质量浓度(μg/m),解决传统监测方法成本高和响应时间延迟等局限性。(2) 方法:通过创新性地将光离子化检测器(PID)与机器学习相结合,我们利用PID信号、气象数据和随机森林(RF)算法开发了一个强大的转换模型。该系统的性能与标准气相色谱 - 火焰离子化检测器(GC - FID)测量结果进行了严格评估。(3) 结果:所提出的框架表现出卓越的性能,决定系数(R)为0.81,均方根误差(RMSE)为48.23μg/m,对称平均绝对百分比误差(SMAPE)为62.47%,归一化均方根误差(NRMSE)为2.07%,优于传统方法。该框架不仅实现了分钟级响应时间,还将成本降低至GC - FID方法相关成本的仅10%。此外,该模型表现出很强的跨站点稳健性,R值范围为0.68至0.69,尽管对于高浓度样品(>1500μg/m)其准确性有所降低,此时平均绝对百分比误差(MAPE)为17.8%。包含SMAPE和RMSE能更细致地理解模型性能,特别是在数据分布偏态或异方差的情况下,从而对框架的有效性提供更全面的评估。(4) 结论:该框架将PID的实时能力与RF的非线性建模进行创新性结合,实现了准确的质量浓度转换(R = 0.81),同时与GC - FID系统相比,响应速度快95%,成本降低90%。与传统单系数PID校准相比,该框架在动态工业条件下显著提高了准确性和适应性。未来工作将应用迁移学习来改进高浓度检测,以用于工业园区的污染追踪和环境治理。