School of Electronic Engineering, Beijing University of Posts and Telecommunications, Beijing, 100876, China.
Department of Mechanical Engineering, Hong Kong City University, Hong Kong, 999077, China.
Sci Rep. 2024 Aug 19;14(1):19144. doi: 10.1038/s41598-024-70192-1.
Peripheral Capillary Oxygen Saturation (SpO) has received increasing attention during the COVID-19 pandemic. Clinical investigations have demonstrated that individuals afflicted with COVID-19 exhibit notably reduced levels of SpO before the deterioration of their health status. To cost-effectively enable individuals to monitor their SpO, this paper proposes a novel neural network model named "ITSCAN" based on Temporal Shift Module. Benefiting from the widespread use of smartphones, this model can assess an individual's SpO in real time, utilizing standard facial video footage, with a temporal granularity of seconds. The model is interweaved by two distinct branches: the motion branch, responsible for extracting spatiotemporal data features and the appearance branch, focusing on the correlation between feature channels and the location information of feature map using coordinate attention mechanisms. Accordingly, the SpO estimator generates the corresponding SpO value. This paper summarizes for the first time 5 loss functions commonly used in the SpO estimation model. Subsequently, a novel loss function has been contributed through the examination of various combinations and careful selection of hyperparameters. Comprehensive ablation experiments analyze the independent impact of each module on the overall model performance. Finally, the experimental results based on the public dataset (VIPL-HR) show that our model has obvious advantages in MAE (1.10%) and RMSE (1.19%) compared with related work, which implies more accuracy of the proposed method to contribute to public health.
外周毛细血管血氧饱和度 (SpO) 在 COVID-19 大流行期间受到越来越多的关注。临床研究表明,COVID-19 患者在健康状况恶化之前,SpO 明显降低。为了以具有成本效益的方式使个人能够监测其 SpO,本文提出了一种基于时间移位模块的新型神经网络模型“ITSCAN”。受益于智能手机的广泛使用,该模型可以利用标准面部视频片段,以秒为时间粒度,实时评估个体的 SpO。该模型由两个不同的分支交织而成:运动分支,负责提取时空数据特征,外观分支,使用坐标注意力机制关注特征通道之间的相关性和特征图的位置信息。因此,SpO 估计器生成相应的 SpO 值。本文首次总结了 SpO 估计模型中常用的 5 种损失函数。随后,通过检查各种组合并仔细选择超参数,提出了一种新的损失函数。全面的消融实验分析了每个模块对整体模型性能的独立影响。最后,基于公共数据集 (VIPL-HR) 的实验结果表明,与相关工作相比,我们的模型在 MAE(1.10%)和 RMSE(1.19%)方面具有明显优势,这意味着该方法的准确性更高,有助于公共健康。