Weber-Boisvert Guillaume, Gosselin Benoit, Sandberg Frida
Department of Electrical and Computer Engineering, Université Laval, Quebec, QC, Canada.
Department of Biomedical Engineering, Lund University, Lund, Sweden.
Front Physiol. 2023 Mar 2;14:1126957. doi: 10.3389/fphys.2023.1126957. eCollection 2023.
The large MIMIC waveform dataset, sourced from intensive care units, has been used extensively for the development of Photoplethysmography (PPG) based blood pressure (BP) estimation algorithms. Yet, because the data comes from patients in severe conditions-often under the effect of drugs-it is regularly noted that the relationship between BP and PPG signal characteristics may be anomalous, a claim that we investigate here. A sample of 12,000 records from the MIMIC waveform dataset was stacked up against the 219 records of the PPG-BP dataset, an alternative public dataset obtained under controlled experimental conditions. The distribution of systolic and diastolic BP data and 31 PPG pulse morphological features was first compared between datasets. Then, the correlation between features and BP, as well as between the features themselves, was analysed. Finally, regression models were trained for each dataset and validated against the other. Statistical analysis showed significant differences between the datasets in diastolic BP and in 20 out of 31 features when adjusting for heart rate differences. The eight features showing the highest rank correlation to systolic BP in PPG-BP all displayed muted correlation levels in MIMIC. Regression tests showed twice higher baseline predictive power with PPG-BP than with MIMIC. Cross-dataset regression displayed a practically complete loss of predictive power for all models. The differences between the MIMIC and PPG-BP dataset exposed in this study suggest that BP estimation models based on the MIMIC dataset have reduced predictive power on the general population.
源自重症监护病房的大型MIMIC波形数据集已被广泛用于基于光电容积脉搏波描记法(PPG)的血压(BP)估计算法的开发。然而,由于数据来自病情严重的患者——通常处于药物影响之下——人们经常指出血压与PPG信号特征之间的关系可能异常,我们在此对这一说法进行研究。将来自MIMIC波形数据集的12000条记录样本与PPG-BP数据集的219条记录进行对比,后者是在受控实验条件下获得的另一个公共数据集。首先比较了两个数据集之间收缩压和舒张压数据的分布以及31个PPG脉搏形态特征。然后,分析了特征与血压之间以及特征本身之间的相关性。最后,针对每个数据集训练回归模型,并与另一个数据集进行验证。统计分析表明,在调整心率差异后,两个数据集在舒张压和31个特征中的20个特征上存在显著差异。在PPG-BP数据集中与收缩压显示出最高等级相关性的八个特征在MIMIC数据集中的相关性水平均不明显。回归测试表明,PPG-BP的基线预测能力是MIMIC的两倍。跨数据集回归显示所有模型的预测能力几乎完全丧失。本研究中揭示的MIMIC和PPG-BP数据集之间的差异表明,基于MIMIC数据集的血压估计模型对普通人群的预测能力有所降低。