Department of Electrical, Electronic and Computer Engineering, University of Pretoria, Pretoria 0001, South Africa.
Council for Scientific and Industrial Research (CSIR), Pretoria 0184, South Africa.
Sensors (Basel). 2023 Mar 8;23(6):2948. doi: 10.3390/s23062948.
Internet of things (IoT)-enabled wireless body area network (WBAN) is an emerging technology that combines medical devices, wireless devices, and non-medical devices for healthcare management applications. Speech emotion recognition (SER) is an active research field in the healthcare domain and machine learning. It is a technique that can be used to automatically identify speakers' emotions from their speech. However, the SER system, especially in the healthcare domain, is confronted with a few challenges. For example, low prediction accuracy, high computational complexity, delay in real-time prediction, and how to identify appropriate features from speech. Motivated by these research gaps, we proposed an emotion-aware IoT-enabled WBAN system within the healthcare framework where data processing and long-range data transmissions are performed by an edge AI system for real-time prediction of patients' speech emotions as well as to capture the changes in emotions before and after treatment. Additionally, we investigated the effectiveness of different machine learning and deep learning algorithms in terms of performance classification, feature extraction methods, and normalization methods. We developed a hybrid deep learning model, i.e., convolutional neural network (CNN) and bidirectional long short-term memory (BiLSTM), and a regularized CNN model. We combined the models with different optimization strategies and regularization techniques to improve the prediction accuracy, reduce generalization error, and reduce the computational complexity of the neural networks in terms of their computational time, power, and space. Different experiments were performed to check the efficiency and effectiveness of the proposed machine learning and deep learning algorithms. The proposed models are compared with a related existing model for evaluation and validation using standard performance metrics such as prediction accuracy, precision, recall, F1 score, confusion matrix, and the differences between the actual and predicted values. The experimental results proved that one of the proposed models outperformed the existing model with an accuracy of about 98%.
物联网 (IoT) 支持的无线体域网 (WBAN) 是一种新兴技术,它将医疗设备、无线设备和非医疗设备结合在一起,用于医疗保健管理应用。语音情感识别 (SER) 是医疗保健领域和机器学习中的一个活跃研究领域。它是一种可以从语音中自动识别说话者情感的技术。然而,SER 系统,特别是在医疗保健领域,面临着一些挑战。例如,预测精度低、计算复杂度高、实时预测延迟以及如何从语音中识别合适的特征。受这些研究差距的启发,我们在医疗保健框架内提出了一种情感感知的物联网支持的 WBAN 系统,其中数据处理和远程数据传输由边缘人工智能系统执行,以便实时预测患者的语音情感,并捕捉治疗前后情绪的变化。此外,我们研究了不同机器学习和深度学习算法在性能分类、特征提取方法和归一化方法方面的有效性。我们开发了一种混合深度学习模型,即卷积神经网络 (CNN) 和双向长短期记忆 (BiLSTM) 以及正则化 CNN 模型。我们将这些模型与不同的优化策略和正则化技术相结合,以提高预测精度、降低泛化误差,并降低神经网络在计算时间、功耗和空间方面的计算复杂度。进行了不同的实验以检查所提出的机器学习和深度学习算法的效率和有效性。使用标准性能指标(如预测精度、精度、召回率、F1 分数、混淆矩阵以及实际值和预测值之间的差异)对所提出的模型进行了与现有相关模型的比较和评估。实验结果证明,所提出的模型之一的准确率约为 98%,优于现有模型。