IEEE J Biomed Health Inform. 2024 Feb;28(2):598-608. doi: 10.1109/JBHI.2023.3314282. Epub 2024 Feb 5.
Camera-based photoplethysmography (cbP PG) is a non-contact technique that measures cardiac-related blood volume alterations in skin surface vessels through the analysis of facial videos. While traditional approaches can estimate heart rate (HR) under different illuminations, their accuracy can be affected by motion artifacts, leading to poor waveform fidelity and hindering further analysis of heart rate variability (HRV); deep learning-based approaches reconstruct high-quality pulse waveform, yet their performance significantly degrades under illumination variations. In this work, we aim to leverage the strength of these two methods and propose a framework that possesses favorable generalization capabilities while maintaining waveform fidelity. For this purpose, we propose the cbPPGGAN, an enhancement framework for cbPPG that enables the flexible incorporation of both unpaired and paired data sources in the training process. Based on the waveforms extracted by traditional approaches, the cbPPGGAN reconstructs high-quality waveforms that enable accurate HR estimation and HRV analysis. In addition, to address the lack of paired training data problems in real-world applications, we propose a cycle consistency loss that guarantees the time-frequency consistency before/after mapping. The method enhances the waveform quality of traditional POS approaches in different illumination tests (BH-rPPG) and cross-datasets (UBFC-rPPG) with mean absolute error (MAE) values of 1.34 bpm and 1.65 bpm, and average beat-to-beat (AVBB) values of 27.46 ms and 45.28 ms, respectively. Experimental results demonstrate that the cbPPGGAN enhances cbPPG signal quality and outperforms the state-of-the-art approaches in HR estimation and HRV analysis. The proposed framework opens a new pathway toward accurate HR estimation in an unconstrained environment.
基于摄像机的光电容积脉搏波描记术 (cbP PG) 是一种非接触技术,通过分析面部视频来测量皮肤表面血管中与心脏相关的血液体积变化。虽然传统方法可以在不同光照下估计心率 (HR),但其准确性可能会受到运动伪影的影响,导致波形保真度差,进一步分析心率变异性 (HRV) 受到阻碍;基于深度学习的方法可以重建高质量的脉搏波形,但在光照变化下其性能会显著下降。在这项工作中,我们旨在利用这两种方法的优势,并提出一个具有良好泛化能力且保持波形保真度的框架。为此,我们提出了 cbPPGGAN,这是一种 cbPPG 的增强框架,能够在训练过程中灵活地结合未配对和配对的数据源。基于传统方法提取的波形,cbPPGGAN 可以重建高质量的波形,从而实现准确的 HR 估计和 HRV 分析。此外,为了解决实际应用中配对训练数据不足的问题,我们提出了一种循环一致性损失,以保证映射前后的时频一致性。该方法通过不同光照测试 (BH-rPPG) 和跨数据集 (UBFC-rPPG) 增强了传统 POS 方法的波形质量,平均绝对误差 (MAE) 值分别为 1.34 bpm 和 1.65 bpm,平均逐拍 (AVBB) 值分别为 27.46 ms 和 45.28 ms。实验结果表明,cbPPGGAN 可以增强 cbPPG 信号质量,并在 HR 估计和 HRV 分析方面优于最新方法。该框架为在非约束环境下进行准确的 HR 估计开辟了新途径。