Bergquist Jake A, Zenger Brian, Brundage James, MacLeod Rob S, Bunch T Jared, Shah Rashmee, Ye Xiangyang, Lyons Ann, Torre Michael, Ranjan Ravi, Tasdizen Tolga, Steinberg Benjamin A
Scientific Computing and Imaging Institute, University of Utah, Salt Lake City, Utah.
Nora Eccles Treadwell Cardiovascular Research and Training Institute, University of Utah, Salt Lake City, Utah.
Heart Rhythm O2. 2024 Jul 17;5(9):644-654. doi: 10.1016/j.hroo.2024.07.009. eCollection 2024 Sep.
Artificial intelligence-machine learning (AI-ML) has demonstrated the ability to extract clinically useful information from electrocardiograms (ECGs) not available using traditional interpretation methods. There exists an extensive body of AI-ML research in fields outside of cardiology including several open-source AI-ML architectures that can be translated to new problems in an "off-the-shelf" manner.
We sought to address the limited investigation of which if any of these off-the-shelf architectures could be useful in ECG analysis as well as how and when these AI-ML approaches fail.
We applied 6 off-the-shelf AI-ML architectures to detect low left ventricular ejection fraction (LVEF) in a cohort of ECGs from 24,868 patients. We assessed LVEF classification and explored patient characteristics associated with inaccurate (false positive or false negative) LVEF prediction.
We found that all of these network architectures produced LVEF detection area under the receiver-operating characteristic curve values above 0.9 (averaged over 5 instances per network), with the ResNet 18 network performing the highest (average area under the receiver-operating characteristic curve of 0.917). We also observed that some patient-specific characteristics such as race, sex, and presence of several comorbidities were associated with lower LVEF prediction performance.
This demonstrates the ability of off-the-shelf AI-ML architectures to detect clinically useful information from ECGs with performance matching contemporary custom-build AI-ML architectures. We also highlighted the presence of possible biases in these AI-ML approaches in the context of patient characteristics. These findings should be considered in the pursuit of efficient and equitable deployment of AI-ML technologies moving forward.
人工智能-机器学习(AI-ML)已展现出从心电图(ECG)中提取传统解读方法无法获取的临床有用信息的能力。在心脏病学以外的领域存在大量AI-ML研究,包括几种可“现成”用于新问题的开源AI-ML架构。
我们试图解决以下有限的研究问题:这些现成的架构中是否有任何一种可用于ECG分析,以及这些AI-ML方法如何以及何时会失败。
我们应用6种现成的AI-ML架构,对来自24868名患者的ECG队列进行左心室射血分数(LVEF)降低的检测。我们评估了LVEF分类,并探索与LVEF预测不准确(假阳性或假阴性)相关的患者特征。
我们发现所有这些网络架构在接受者操作特征曲线下的LVEF检测面积值均高于0.9(每个网络平均5个实例),其中ResNet 18网络表现最佳(接受者操作特征曲线下平均面积为0.917)。我们还观察到一些患者特定特征,如种族、性别和几种合并症的存在,与较低的LVEF预测性能相关。
这证明了现成的AI-ML架构能够从ECG中检测临床有用信息,其性能与当代定制的AI-ML架构相当。我们还强调了在患者特征背景下这些AI-ML方法中可能存在的偏差。在未来追求AI-ML技术的高效和公平部署时,应考虑这些发现。