Zhu Qiang, Wong Chau-Wai, Lazri Zachary McBride, Chen Mingliang, Fu Chang-Hong, Wu Min
IEEE Trans Biomed Eng. 2025 Jan;72(1):152-165. doi: 10.1109/TBME.2024.3442785. Epub 2025 Jan 15.
Performance improvements obtained by recent principled approaches for pulse rate (PR) estimation from face videos have typically been achieved by adding or modifying certain modules within a reconfigurable system. Yet, evaluations of such remote photoplethysmography (rPPG) are usually performed only at the system level. To better understand each module's contribution and facilitate future research in explainable learning and artificial intelligence for physiological monitoring, this paper conducts a comparative study of video-based, principled PR tracking algorithms, with a focus on challenging fitness scenarios. A review of the progress achieved over the last decade and a half in this field is utilized to construct the major processing modules of a reconfigurable remote pulse rate sensing system. Experiments are conducted on two challenging datasets-an internal collection of 25 videos of two Asian males exercising on stationary-bike, elliptical, and treadmill machines and 34 videos from a public ECG fitness database of 14 men and 3 women exercising on elliptical and stationary-bike machines. The signal-to-noise ratio (SNR), Pearson's correlation coefficient, error count ratio, error rate, and root mean squared error are used for performance evaluation. The top-performing configuration produces respective values of 0.8 dB, 0.86, 9%, 1.7%, and 3.3 beats per minute (bpm) for the internal dataset and 1.3 dB, 0.77, 28.6%, 6.0%, and 8.1 bpm for the ECG Fitness dataset, achieving significant improvements over alternative configurations. Our results suggest a synergistic effect between pulse color mapping and adaptive motion filtering, as well as the importance of a robust frequency tracking algorithm for PR estimation in low SNR settings.
近期通过基于原理的方法从面部视频中估计脉搏率(PR)所取得的性能提升,通常是通过在可重构系统中添加或修改某些模块来实现的。然而,此类远程光电容积脉搏波描记法(rPPG)的评估通常仅在系统层面进行。为了更好地理解每个模块的贡献,并促进未来在用于生理监测的可解释学习和人工智能方面的研究,本文对基于视频的、原理性的PR跟踪算法进行了比较研究,重点关注具有挑战性的健身场景。利用对该领域过去十五年所取得进展的回顾,构建了一个可重构远程脉搏率传感系统的主要处理模块。在两个具有挑战性的数据集上进行了实验——一个内部收集的25个视频,内容是两名亚洲男性在固定自行车、椭圆机和跑步机上锻炼,以及来自一个公共心电图健身数据库的34个视频,内容是14名男性和3名女性在椭圆机和固定自行车上锻炼。使用信噪比(SNR)、皮尔逊相关系数、误差计数率、错误率和均方根误差进行性能评估。对于内部数据集,表现最佳的配置分别产生0.8分贝、0.86、9%、1.7%和每分钟3.3次心跳(bpm)的值,对于心电图健身数据集,分别产生1.3分贝、0.77、28.6%、6.0%和每分钟8.1次心跳的值,与其他配置相比有显著改进。我们的结果表明脉搏颜色映射和自适应运动滤波之间存在协同效应,以及在低信噪比设置下用于PR估计的强大频率跟踪算法的重要性。