Electrical Engineering Department of University of Leuven, Belgium; Connected Health Solutions Group at Imec-Leuven, Belgium.
Electrical Engineering Department of University of Delft, the Netherlands.
Comput Biol Med. 2021 Mar;130:104164. doi: 10.1016/j.compbiomed.2020.104164. Epub 2020 Dec 13.
Long-term electrocardiogram monitoring comes at the expense of signal quality. During unconstrained movements, the electrocardiogram is often corrupted by motion artefacts, which can lead to inaccurate physiological information. In this situation, automated quality assessment methods are useful to increase the reliability of the measurements. A generic machine learning pipeline that generates classification models for electrocardiogram quality assessment is presented in this article. The presented pipeline is tested on signals from varied acquisition sources, towards selecting segments that can be used for heart rate analysis in lifestyle applications.
Electrocardiogram recordings from traditional, wearable and ubiquitous devices, are segmented in 10 s windows and manually labeled by experienced researchers into two quality classes. To capture the electrocardiogram dynamics, a comprehensive set of 43 features is extracted from each segment, based on the time-domain signal, its Fast Fourier Transform, the Autocorrelation function and the Stationary Wavelet Transform. To select the most relevant features for each acquisition source we employ both a customized hybrid approach and the state-of-the-art Neighborhood Component Analysis method and compare them. Support Vector Machines (SVM), Decision Trees, K-Nearest-Neighbors and supervised ensemble methods are tested as possible binary classifiers.
The results for the best performing models on traditional, wearable and ubiquitous electrocardiogram datasets are, respectively: balanced-accuracy: 89%, F1-score: 93% with the Fine Gaussian SVM model and 10 features; balanced-accuracy: 93%, F1-score: 93% with the Fine Gaussian SVM model and 11 features; balanced-accuracy: 95%, F1-score: 86%, with the Fine Gaussian SVM model and 8 features.
According to the results, our generic pipeline can generate classification models tailored to individual acquisition sources, provided that a standard Lead I or Lead II is available. Such models accurately establish whether the electrocardiogram quality is good or bad for heart rate analysis. Furthermore, removing bad quality segments decreases errors in heart rate calculation.
长期的心电图监测会影响信号质量。在不受约束的运动中,心电图经常会受到运动伪影的干扰,从而导致生理信息不准确。在这种情况下,自动化质量评估方法有助于提高测量的可靠性。本文提出了一种通用的机器学习管道,用于生成心电图质量评估的分类模型。该管道针对来自不同采集源的信号进行了测试,旨在选择可用于生活方式应用中心率分析的片段。
传统、可穿戴和无处不在的设备的心电图记录被分段为 10 秒的窗口,并由经验丰富的研究人员手动标记为两个质量等级。为了捕捉心电图动态,从每个片段中提取了一组全面的 43 个特征,基于时域信号、快速傅里叶变换、自相关函数和平稳小波变换。为了为每个采集源选择最相关的特征,我们采用了定制的混合方法和最先进的邻域成分分析方法,并对它们进行了比较。支持向量机 (SVM)、决策树、K-最近邻和监督集成方法被测试为可能的二分类器。
在传统、可穿戴和无处不在的心电图数据集上表现最佳的模型的结果分别为:平衡准确率:89%,Fine Gaussian SVM 模型和 10 个特征的 F1 分数:93%;平衡准确率:93%,Fine Gaussian SVM 模型和 11 个特征的 F1 分数:93%;平衡准确率:95%,Fine Gaussian SVM 模型和 8 个特征的 F1 分数:86%。
根据结果,我们的通用管道可以为各个采集源生成定制的分类模型,前提是有标准的 I 导联或 II 导联心电图。这些模型可以准确地确定心电图质量是否适合心率分析。此外,去除质量较差的片段可以减少心率计算中的错误。