Biton Shany, Gendelman Sheina, Ribeiro Antônio H, Miana Gabriela, Moreira Carla, Ribeiro Antonio Luiz P, Behar Joachim A
Faculty of Biomedical Engineering, Technion-IIT, Haifa, Israel.
Department of Information Technology, Uppsala University, Uppsala, Sweden.
Eur Heart J Digit Health. 2021 Aug 5;2(4):576-585. doi: 10.1093/ehjdh/ztab071. eCollection 2021 Dec.
This study aims to assess whether information derived from the raw 12-lead electrocardiogram (ECG) combined with clinical information is predictive of atrial fibrillation (AF) development.
We use a subset of the Telehealth Network of Minas Gerais (TNMG) database consisting of patients that had repeated 12-lead ECG measurements between 2010 and 2017 that is 1 130 404 recordings from 415 389 unique patients. Median and interquartile of age for the recordings were 58 (46-69) and 38% of the patients were males. Recordings were assigned to train-validation and test sets in an 80:20% split which was stratified by class, age and gender. A random forest classifier was trained to predict, for a given recording, the risk of AF development within 5 years. We use features obtained from different modalities, namely demographics, clinical information, engineered features, and features from deep representation learning. The best model performance on the test set was obtained for the model combining features from all modalities with an area under the receiver operating characteristic curve (AUROC) = 0.909 against the best single modality model which had an AUROC = 0.839.
Our study has important clinical implications for AF management. It is the first study integrating feature engineering, deep learning, and Electronic medical record system (EMR) metadata to create a risk prediction tool for the management of patients at risk of AF. The best model that includes features from all modalities demonstrates that human knowledge in electrophysiology combined with deep learning outperforms any single modality approach. The high performance obtained suggest that structural changes in the 12-lead ECG are associated with existing or impending AF.
本研究旨在评估从原始12导联心电图(ECG)结合临床信息中获取的信息是否能预测房颤(AF)的发生。
我们使用了米纳斯吉拉斯远程医疗网络(TNMG)数据库的一个子集,该子集包含2010年至2017年间重复进行12导联心电图测量的患者,即来自415389名独特患者的1130404份记录。记录的年龄中位数和四分位数间距分别为58(46 - 69),38%的患者为男性。记录按80:20%的比例分配到训练验证集和测试集,并按类别、年龄和性别进行分层。训练了一个随机森林分类器,以预测给定记录在5年内发生房颤的风险。我们使用了从不同模式获得的特征,即人口统计学、临床信息、工程特征以及深度表征学习的特征。对于结合了所有模式特征的模型,在测试集上获得了最佳性能,其受试者操作特征曲线下面积(AUROC)= 0.909,而最佳单模式模型的AUROC = 0.839。
我们的研究对房颤管理具有重要的临床意义。这是第一项整合特征工程、深度学习和电子病历系统(EMR)元数据以创建房颤风险预测工具来管理房颤风险患者的研究。包含所有模式特征的最佳模型表明,电生理学方面的人类知识与深度学习相结合优于任何单模式方法。所获得的高性能表明12导联心电图的结构变化与现有的或即将发生的房颤相关。