Javidi Hamed, Mariam Arshiya, Khademi Gholamreza, Zabor Emily C, Zhao Ran, Radivoyevitch Tomas, Rotroff Daniel M
Department of Quantitative Health Sciences, Lerner Research Institute, Cleveland Clinic, Cleveland, OH, USA.
Department of Electrical Engineering and Computer Science, Cleveland State University, Cleveland, OH, USA.
NPJ Digit Med. 2022 Jul 27;5(1):106. doi: 10.1038/s41746-022-00651-4.
Deep learning (DL) from electronic health records holds promise for disease prediction, but systematic methods for learning from simulated longitudinal clinical measurements have yet to be reported. We compared nine DL frameworks using simulated body mass index (BMI), glucose, and systolic blood pressure trajectories, independently isolated shape and magnitude changes, and evaluated model performance across various parameters (e.g., irregularity, missingness). Overall, discrimination based on variation in shape was more challenging than magnitude. Time-series forest-convolutional neural networks (TSF-CNN) and Gramian angular field(GAF)-CNN outperformed other approaches (P < 0.05) with overall area-under-the-curve (AUCs) of 0.93 for both models, and 0.92 and 0.89 for variation in magnitude and shape with up to 50% missing data. Furthermore, in a real-world assessment, the TSF-CNN model predicted T2D with AUCs reaching 0.72 using only BMI trajectories. In conclusion, we performed an extensive evaluation of DL approaches and identified robust modeling frameworks for disease prediction based on longitudinal clinical measurements.
从电子健康记录中进行深度学习有望实现疾病预测,但从模拟纵向临床测量中学习的系统方法尚未见报道。我们使用模拟的体重指数(BMI)、血糖和收缩压轨迹比较了9种深度学习框架,独立分离形状和幅度变化,并评估了模型在各种参数(如不规则性、缺失值)下的性能。总体而言,基于形状变化的判别比基于幅度变化的判别更具挑战性。时间序列森林-卷积神经网络(TSF-CNN)和格拉姆角场(GAF)-CNN的表现优于其他方法(P < 0.05),两个模型的总体曲线下面积(AUC)均为0.93,对于幅度变化和形状变化,在缺失数据高达50%的情况下,AUC分别为0.92和0.89。此外,在一项实际评估中,TSF-CNN模型仅使用BMI轨迹预测2型糖尿病,AUC达到0.72。总之,我们对深度学习方法进行了广泛评估,并确定了基于纵向临床测量进行疾病预测的稳健建模框架。