Sy Mohamed, Ibrahim Emad Al, Farooq Aamir
Physical Science and Engineering Division (PSE), King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Saudi Arabia.
ACS Omega. 2024 Sep 4;9(37):39033-39042. doi: 10.1021/acsomega.4c05757. eCollection 2024 Sep 17.
Spectroscopic methods are advantageous for gas detection with applications ranging from safety to operational efficiency. Despite the potential of laser-based sensors, real-world challenges, such as noise, interference and unseen conditions, hinder the accurate identification of species. The use of conventional machine learning (ML) models is constrained by extensive data requirements and their limited adaptability to new conditions. Although augmentation-based strategies have proven to improve the robustness of machine learning models, they still do not offer a complete defense. To address these challenges, this study focuses on three primary goals: first, to detect pressure-induced spectral broadening using simple yet effective augmentations; second, to bypass the need for extensive data sets by deploying a one-shot learning approach that can identify up to 12 volatile organic compounds (VOCs); and third, to provide a provable certification for the one-shot learning model predictions via randomized smoothing. To assess the effectiveness of our proposed augmentations and randomized smoothing, we perform a comparative study with four distinct models: VOC-net, VOC-lite, VOC-plus, and VOC-certifire. Remarkably, the one-shot learning model proposed herein, VOC-certifire, delivers predictions that match the baseline model VOC-net. The VOC-certifire predictions not only exhibit robustness and reliability but are also certified within a predefined norm radius. Such a certification is particularly useful for gas detection, where the robustness, precision and consistency are key to well-informed decision-making.
光谱方法在气体检测方面具有优势,其应用范围涵盖从安全到运营效率等多个领域。尽管基于激光的传感器具有潜力,但诸如噪声、干扰和未知条件等现实世界的挑战阻碍了对气体种类的准确识别。传统机器学习(ML)模型的使用受到大量数据需求及其对新条件有限适应性的限制。尽管基于增强的策略已被证明可提高机器学习模型的鲁棒性,但它们仍无法提供全面的防护。为应对这些挑战,本研究聚焦于三个主要目标:其一,使用简单而有效的增强方法检测压力诱导的光谱展宽;其二,通过部署一种可识别多达12种挥发性有机化合物(VOC)的一次性学习方法,绕过对大量数据集的需求;其三,通过随机平滑为一次性学习模型的预测提供可证明的认证。为评估我们提出的增强方法和随机平滑的有效性,我们与四种不同的模型进行了对比研究:VOC-net、VOC-lite、VOC-plus和VOC-certifire。值得注意的是,本文提出的一次性学习模型VOC-certifire所提供的预测与基线模型VOC-net相匹配。VOC-certifire的预测不仅展现出鲁棒性和可靠性,而且在预定义的范数半径内得到了认证。这种认证对于气体检测尤为有用,因为鲁棒性、精度和一致性是做出明智决策的关键。