Mazumder Oishee, Banerjee Rohan, Roy Dibyendu, Bhattacharya Sakyajit, Ghose Avik, Sinha Aniruddha
IEEE J Biomed Health Inform. 2022 May;26(5):2136-2146. doi: 10.1109/JBHI.2022.3147383. Epub 2022 May 5.
This paper presents a novel approach of generating synthetic Photoplethysmogram (PPG) data using a physical model of the cardiovascular system to improve classifier performance with a combination of synthetic and real data. The physical model is an in-silico cardiac computational model, consisting of a four-chambered heart with electrophysiology, hemodynamic, and blood pressure auto-regulation functionality. Starting with a small number of measured PPG data, the cardiac model is used to synthesize healthy as well as PPG time-series pertaining to coronary artery disease (CAD) by varying pathophysiological parameters. A Variational Autoencoder (VAE) structure is proposed to derive a statistical feature space for CAD classification. Results are presented in two perspectives namely, (i) using artificially reduced real disease data and (ii) using all the real disease data. In both cases, by augmenting with the synthetic data for training, the performance (sensitivity, specificity) of the classifier changes from (i) (0.65, 1) to (1, 0.9) and (ii) (1, 0.95) to (1, 1). The proposed hybrid approach of combining physical modelling and statistical feature space selection generates realistic PPG data with pathophysiological interpretation and can outperform a baseline Generative Adversarial Network (GAN) architecture with a relatively small amount of real data for training. This proposed method could aid as a substitution technique for handling the problem of bulk data required for training machine learning algorithms for cardiac health-care applications.
本文提出了一种新颖的方法,即使用心血管系统的物理模型生成合成光电容积脉搏波图(PPG)数据,以结合合成数据和真实数据来提高分类器性能。该物理模型是一个计算机心脏计算模型,由具有电生理、血流动力学和血压自动调节功能的四腔心脏组成。从少量测量的PPG数据开始,通过改变病理生理参数,利用心脏模型合成健康的以及与冠状动脉疾病(CAD)相关的PPG时间序列。提出了一种变分自编码器(VAE)结构来推导用于CAD分类的统计特征空间。结果从两个角度呈现,即(i)使用人工减少的真实疾病数据和(ii)使用所有真实疾病数据。在这两种情况下,通过用合成数据增强训练,分类器的性能(敏感性、特异性)从(i)(0.65,1)变为(1,0.9),以及从(ii)(1,0.95)变为(1,1)。所提出的结合物理建模和统计特征空间选择的混合方法生成具有病理生理解释的逼真PPG数据,并且在训练所需的真实数据量相对较少的情况下,可以优于基线生成对抗网络(GAN)架构。该方法可以作为一种替代技术,用于处理心脏保健应用中训练机器学习算法所需的大量数据问题。