Wang Jiyao, Wei Ximeng, Lu Hao, Chen Yingcong, He Dengbo
IEEE J Biomed Health Inform. 2024 Dec;28(12):7090-7102. doi: 10.1109/JBHI.2024.3433461. Epub 2024 Dec 5.
Remote photoplethysmography (rPPG) is a contactless technique that facilitates the measurement of physiological signals and cardiac activities through facial video recordings. This approach holds tremendous potential for various applications. However, existing rPPG methods often did not account for different types of occlusions that commonly occur in real-world scenarios, such as temporary movement or actions of humans in videos or dust on camera. The failure to address these occlusions can compromise the accuracy of rPPG algorithms. To address this issue, we proposed a novel Condiff-rPPG to improve the robustness of rPPG measurement facing various occlusions. First, we compressed the damaged face video into a spatio-temporal representation with several types of masks. Second, the diffusion model was designed to recover the missing information with observed values as a condition. Moreover, a novel low-rank decomposition regularization was proposed to eliminate background noise and maximize informative features. ConDiff-rPPG ensured consistency in optimization goals during the training process. Through extensive experiments, including intra- and cross-dataset evaluations, as well as ablation tests, we demonstrated the robustness and generalization ability of our proposed model.
远程光电容积脉搏波描记术(rPPG)是一种非接触式技术,可通过面部视频记录来测量生理信号和心脏活动。这种方法在各种应用中具有巨大潜力。然而,现有的rPPG方法通常没有考虑到现实场景中常见的不同类型遮挡,例如视频中人物的临时移动或动作,或者相机上的灰尘。未能解决这些遮挡问题会影响rPPG算法的准确性。为了解决这个问题,我们提出了一种新颖的Condiff-rPPG,以提高rPPG测量面对各种遮挡时的鲁棒性。首先,我们使用几种类型的掩码将受损的面部视频压缩成时空表示。其次,设计扩散模型以观察值为条件恢复缺失信息。此外,还提出了一种新颖的低秩分解正则化方法来消除背景噪声并最大化信息特征。ConDiff-rPPG在训练过程中确保了优化目标的一致性。通过广泛的实验,包括数据集内和跨数据集评估以及消融测试,我们证明了所提出模型的鲁棒性和泛化能力。