Brisk Rob, Bond Raymond, Finlay Dewar, McLaughlin James, Piadlo Alicja, Leslie Stephen J, Gossman David E, Menown Ian B, McEneaney D J, Warren S
Cardiovascular Research Unit, Craigavon Hospital, 68 Lurgan Road, Portadown BT63 5QQ, UK.
School of Computer Science, Ulster University, Shore Road, Jordanstown BT37 0QB, UK.
Eur Heart J Digit Health. 2021 Feb 20;2(1):127-134. doi: 10.1093/ehjdh/ztab002. eCollection 2021 Mar.
Deep learning (DL) has emerged in recent years as an effective technique in automated ECG analysis.
A retrospective, observational study was designed to assess the feasibility of detecting induced coronary artery occlusion in human subjects earlier than experienced cardiologists using a DL algorithm. A deep convolutional neural network was trained using data from the STAFF III database. The task was to classify ECG samples as showing acute coronary artery occlusion, or no occlusion. Occluded samples were recorded after 60 s of balloon occlusion of a single coronary artery. For the first iteration of the experiment, non-occluded samples were taken from ECGs recorded in a restroom prior to entering theatres. For the second iteration of the experiment, non-occluded samples were taken in the theatre prior to balloon inflation. Results were obtained using a cross-validation approach. In the first iteration of the experiment, the DL model achieved an F1 score of 0.814, which was higher than any of three reviewing cardiologists or STEMI criteria. In the second iteration of the experiment, the DL model achieved an F1 score of 0.533, which is akin to the performance of a random chance classifier.
The dataset was too small for the second model to achieve meaningful performance, despite the use of transfer learning. However, 'data leakage' during the first iteration of the experiment led to falsely high results. This study highlights the risk of DL models leveraging data leaks to produce spurious results.
近年来,深度学习(DL)已成为自动心电图分析中的一种有效技术。
设计了一项回顾性观察研究,以评估使用DL算法比经验丰富的心脏病专家更早检测人类受试者诱导冠状动脉闭塞的可行性。使用来自STAFF III数据库的数据训练深度卷积神经网络。任务是将心电图样本分类为显示急性冠状动脉闭塞或无闭塞。在单个冠状动脉球囊闭塞60秒后记录闭塞样本。在实验的第一次迭代中,非闭塞样本取自进入手术室前在卫生间记录的心电图。在实验的第二次迭代中,非闭塞样本在球囊充气前在手术室采集。使用交叉验证方法获得结果。在实验的第一次迭代中,DL模型的F1分数达到0.814,高于三位参与评审的心脏病专家中的任何一位或STEMI标准。在实验的第二次迭代中,DL模型的F1分数达到0.533,这与随机机会分类器的性能相当。
尽管使用了迁移学习,但第二个模型的数据集太小,无法获得有意义的性能。然而,实验第一次迭代期间的“数据泄露”导致结果虚高。本研究强调了DL模型利用数据泄露产生虚假结果的风险。