Al-Mousa Amjed, Baniissa Joud, Hashem Tala, Ibraheem Tala
Computer Engineering Department, Princess Sumaya University for Technology, Amman, Jordan.
Digit Health. 2023 Jul 16;9:20552076231187608. doi: 10.1177/20552076231187608. eCollection 2023 Jan-Dec.
Building an electrocardiogram (ECG) heartbeat classification model is essential for early arrhythmia detection. This research aims to build a reliable model that can classify heartbeats into five heartbeat types: normal beat (N), supraventricular ectopic beat (SVEB), ventricular ectopic beat (VEB), fusion beat (F), and unknown beat (Q), with a focus on enhancing the predictions of the uncommon Q and F heartbeats. The base dataset used is the MIT-BIH SupraVentricular Database, which was used to train and compare the performance of five machine learning models: logistic regression, Random Forest (RF), K-nearest neighbor, linear support vector machine, and linear discriminant analysis. In addition to using the synthetic minority oversampling technique, data extracted from multiple databases for the F and Q classes were combined with the original base dataset. These methods resulted in significant improvement in the recall for the rare F and Q classes when compared to the literature. The RF algorithm produced the best performance with an accuracy of 97% and recall values equal to 97%, 93%, 95%, 95%, and 30% for N, SVEB, VEB, F, and Q, respectively.
构建心电图(ECG)心跳分类模型对于早期心律失常检测至关重要。本研究旨在构建一个可靠的模型,该模型可以将心跳分为五种类型:正常心跳(N)、室上性异位搏动(SVEB)、室性异位搏动(VEB)、融合搏动(F)和未知搏动(Q),重点是提高对不常见的Q和F心跳的预测。所使用的基础数据集是麻省理工学院 - 贝斯以色列女执事医疗中心室上性数据库,该数据集用于训练和比较五种机器学习模型的性能:逻辑回归、随机森林(RF)、K近邻、线性支持向量机和线性判别分析。除了使用合成少数过采样技术外,还将从多个数据库中提取的F和Q类数据与原始基础数据集相结合。与文献相比,这些方法显著提高了罕见F和Q类的召回率。RF算法表现最佳,准确率为97%,N、SVEB、VEB、F和Q的召回值分别为97%、93%、95%、95%和30%。