IEEE J Biomed Health Inform. 2024 Apr;28(4):1993-2004. doi: 10.1109/JBHI.2024.3355960. Epub 2024 Apr 4.
Electrocardiogram (ECG) signals frequently encounter diverse types of noise, such as baseline wander (BW), electrode motion (EM) artifacts, muscle artifact (MA), and others. These noises often occur in combination during the actual data acquisition process, resulting in erroneous or perplexing interpretations for cardiologists. To suppress random mixed noise (RMN) in ECG with less distortion, we propose a Transformer-based Convolutional Denoising AutoEncoder model (TCDAE) in this study. The encoder of TCDAE is composed of three stacked gated convolutional layers and a Transformer encoder block with a point-wise multi-head self-attention module. To obtain minimal distortion in both time and frequency domains, we also propose a frequency weighted Huber loss function in training phase to better approximate the original signals. The TCDAE model is trained and tested on the QT Database (QTDB) and MIT-BIH Noise Stress Test Database (NSTDB), with the training data and testing data coming from different records. All the metrics perform the most robust in overall noise and separate noise intervals for RMN removal compared with the baseline methods. We also conduct generalization tests on the Icentia11k database where the TCDAE outperforms the state-of-the-art models, with a 55% reduction of the false positives in R peak detection after denoising. The TCDAE model approximates the short-term and long-term characteristics of ECG signals and has higher stability even under extreme RMN corruption. The memory consumption and inference speed of TCDAE are also feasible for its deployment in clinical applications.
心电图(ECG)信号经常会遇到各种类型的噪声,如基线漂移(BW)、电极运动(EM)伪影、肌肉伪影(MA)等。这些噪声在实际数据采集过程中经常同时出现,导致心脏病专家的错误或困惑的解释。为了在不产生失真的情况下抑制心电图中的随机混合噪声(RMN),我们在本研究中提出了一种基于 Transformer 的卷积去噪自动编码器模型(TCDAE)。TCDAE 的编码器由三个堆叠的门控卷积层和一个具有逐点多头自注意力模块的 Transformer 编码器块组成。为了在时域和频域中都获得最小的失真,我们还在训练阶段提出了一种频域加权 Huber 损失函数,以更好地逼近原始信号。TCDAE 模型在 QT 数据库(QTDB)和麻省理工学院生物医学工程研究所噪声应激测试数据库(NSTDB)上进行了训练和测试,训练数据和测试数据来自不同的记录。与基线方法相比,所有指标在总体噪声和单独的噪声间隔中对 RMN 去除的性能最为稳健。我们还在 Icentia11k 数据库上进行了泛化测试,TCDAE 在去噪后 R 波检测的假阳性率降低了 55%,优于最先进的模型。TCDAE 模型近似 ECG 信号的短期和长期特征,即使在极端 RMN 污染下也具有更高的稳定性。TCDAE 的内存消耗和推理速度也适合在临床应用中部署。