Medical School of Chinese PLA, Beijing, China.
Department of Hyperbaric Oxygen, The First Medical Center, Chinese PLA General Hospital, Beijing, China.
JMIR Mhealth Uhealth. 2021 Aug 12;9(8):e25415. doi: 10.2196/25415.
With the development and promotion of wearable devices and their mobile health (mHealth) apps, physiological signals have become a research hotspot. However, noise is complex in signals obtained from daily lives, making it difficult to analyze the signals automatically and resulting in a high false alarm rate. At present, screening out the high-quality segments of the signals from huge-volume data with few labels remains a problem. Signal quality assessment (SQA) is essential and is able to advance the valuable information mining of signals.
The aims of this study were to design an SQA algorithm based on the unsupervised isolation forest model to classify the signal quality into 3 grades: good, acceptable, and unacceptable; validate the algorithm on labeled data sets; and apply the algorithm on real-world data to evaluate its efficacy.
Data used in this study were collected by a wearable device (SensEcho) from healthy individuals and patients. The observation windows for electrocardiogram (ECG) and respiratory signals were 10 and 30 seconds, respectively. In the experimental procedure, the unlabeled training set was used to train the models. The validation and test sets were labeled according to preset criteria and used to evaluate the classification performance quantitatively. The validation set consisted of 3460 and 2086 windows of ECG and respiratory signals, respectively, whereas the test set was made up of 4686 and 3341 windows of signals, respectively. The algorithm was also compared with self-organizing maps (SOMs) and 4 classic supervised models (logistic regression, random forest, support vector machine, and extreme gradient boosting). One case validation was illustrated to show the application effect. The algorithm was then applied to 1144 cases of ECG signals collected from patients and the detected arrhythmia false alarms were calculated.
The quantitative results showed that the ECG SQA model achieved 94.97% and 95.58% accuracy on the validation and test sets, respectively, whereas the respiratory SQA model achieved 81.06% and 86.20% accuracy on the validation and test sets, respectively. The algorithm was superior to SOM and achieved moderate performance when compared with the supervised models. The example case showed that the algorithm was able to correctly classify the signal quality even when there were complex pathological changes in the signals. The algorithm application results indicated that some specific types of arrhythmia false alarms such as tachycardia, atrial premature beat, and ventricular premature beat could be significantly reduced with the help of the algorithm.
This study verified the feasibility of applying the anomaly detection unsupervised model to SQA. The application scenarios include reducing the false alarm rate of the device and selecting signal segments that can be used for further research.
随着可穿戴设备及其移动健康(mHealth)应用的发展和推广,生理信号已成为研究热点。然而,日常生活中获得的信号噪声复杂,使得信号自动分析变得困难,从而导致高误报率。目前,从大量数据中筛选出具有少量标签的高质量信号段仍然是一个问题。信号质量评估(SQA)是必不可少的,它能够促进信号的有价值信息挖掘。
本研究旨在设计一种基于无监督隔离森林模型的 SQA 算法,将信号质量分为 3 个等级:良好、可接受和不可接受;在有标签数据集上验证算法;并将算法应用于真实数据,以评估其效果。
本研究使用可穿戴设备(SensEcho)从健康个体和患者中收集数据。心电图(ECG)和呼吸信号的观察窗口分别为 10 秒和 30 秒。在实验过程中,使用未标记的训练集对模型进行训练。验证集和测试集根据预设标准进行标记,并用于定量评估分类性能。验证集包含 3460 个 ECG 信号窗口和 2086 个呼吸信号窗口,测试集包含 4686 个 ECG 信号窗口和 3341 个呼吸信号窗口。该算法还与自组织映射(SOM)和 4 种经典监督模型(逻辑回归、随机森林、支持向量机和极端梯度提升)进行了比较。通过一个案例验证说明了应用效果。然后将该算法应用于从患者中采集的 1144 例 ECG 信号,并计算检测到的心律失常误报。
定量结果表明,ECG SQA 模型在验证集和测试集上的准确率分别为 94.97%和 95.58%,而呼吸 SQA 模型在验证集和测试集上的准确率分别为 81.06%和 86.20%。该算法优于 SOM,与监督模型相比具有中等性能。案例表明,即使信号中存在复杂的病理变化,该算法也能够正确地对信号质量进行分类。算法应用结果表明,借助该算法,可以显著减少某些特定类型的心律失常误报,如心动过速、房性早搏和室性早搏。
本研究验证了将异常检测无监督模型应用于 SQA 的可行性。应用场景包括降低设备的误报率和选择可用于进一步研究的信号段。