Pahar Madhurananda, Miranda Igor, Diacon Andreas, Niesler Thomas
Department of Electrical and Electronic Engineering, Stellenbosch University, Stellenbosch, 7600 Western Cape South Africa.
Federal University of Recôncavo da Bahia, Cruz das Almas, 44.380-000 Bahia Brazil.
J Signal Process Syst. 2022;94(8):821-835. doi: 10.1007/s11265-022-01748-5. Epub 2022 Mar 19.
We present an automatic non-invasive way of detecting cough events based on both accelerometer and audio signals. The acceleration signals are captured by a smartphone firmly attached to the patient's bed, using its integrated accelerometer. The audio signals are captured simultaneously by the same smartphone using an external microphone. We have compiled a manually-annotated dataset containing such simultaneously-captured acceleration and audio signals for approximately 6000 cough and 68000 non-cough events from 14 adult male patients. Logistic regression (LR), support vector machine (SVM) and multilayer perceptron (MLP) classifiers provide a baseline and are compared with three deep architectures, convolutional neural network (CNN), long short-term memory (LSTM) network, and residual-based architecture (Resnet50) using a leave-one-out cross-validation scheme. We find that it is possible to use either acceleration or audio signals to distinguish between coughing and other activities including sneezing, throat-clearing, and movement on the bed with high accuracy. However, in all cases, the deep neural networks outperform the shallow classifiers by a clear margin and the Resnet50 offers the best performance, achieving an area under the ROC curve (AUC) exceeding 0.98 and 0.99 for acceleration and audio signals respectively. While audio-based classification consistently offers better performance than acceleration-based classification, we observe that the difference is very small for the best systems. Since the acceleration signal requires less processing power, and since the need to record audio is sidestepped and thus privacy is inherently secured, and since the recording device is attached to the bed and not worn, an accelerometer-based highly accurate non-invasive cough detector may represent a more convenient and readily accepted method in long-term cough monitoring.
我们提出了一种基于加速度计和音频信号自动检测咳嗽事件的非侵入性方法。加速度信号由牢固附着在患者床上的智能手机利用其集成加速度计进行采集。音频信号由同一部智能手机使用外部麦克风同时采集。我们已编制了一个人工标注的数据集,其中包含来自14名成年男性患者的约6000次咳嗽和68000次非咳嗽事件的此类同时采集的加速度和音频信号。逻辑回归(LR)、支持向量机(SVM)和多层感知器(MLP)分类器提供了一个基线,并使用留一法交叉验证方案与三种深度架构,即卷积神经网络(CNN)、长短期记忆(LSTM)网络和基于残差的架构(Resnet50)进行比较。我们发现,使用加速度信号或音频信号都能够高精度地区分咳嗽与包括打喷嚏、清嗓子和在床上移动在内的其他活动。然而,在所有情况下,深度神经网络都明显优于浅层分类器,并且Resnet50表现最佳,加速度和音频信号的ROC曲线下面积(AUC)分别超过了0.98和0.99。虽然基于音频的分类始终比基于加速度的分类表现更好,但我们观察到,对于最佳系统而言,差异非常小。由于加速度信号所需的处理能力较低,且无需录制音频,从而固有地保障了隐私,并且由于记录设备附着在床上而非佩戴在身上,基于加速度计的高精度非侵入性咳嗽检测器可能是长期咳嗽监测中一种更方便且更容易被接受的方法。