Altibbi, King Hussein Business Park, Amman 11831, Jordan.
Faculty of Information Technology, Philadelphia University, Amman 19392, Jordan.
Sensors (Basel). 2021 May 10;21(9):3279. doi: 10.3390/s21093279.
Maintaining a high quality of conversation between doctors and patients is essential in telehealth services, where efficient and competent communication is important to promote patient health. Assessing the quality of medical conversations is often handled based on a human auditory-perceptual evaluation. Typically, trained experts are needed for such tasks, as they follow systematic evaluation criteria. However, the daily rapid increase of consultations makes the evaluation process inefficient and impractical. This paper investigates the automation of the quality assessment process of patient-doctor voice-based conversations in a telehealth service using a deep-learning-based classification model. For this, the data consist of audio recordings obtained from Altibbi. Altibbi is a digital health platform that provides telemedicine and telehealth services in the Middle East and North Africa (MENA). The objective is to assist Altibbi's operations team in the evaluation of the provided consultations in an automated manner. The proposed model is developed using three sets of features: features extracted from the signal level, the transcript level, and the signal and transcript levels. At the signal level, various statistical and spectral information is calculated to characterize the spectral envelope of the speech recordings. At the transcript level, a pre-trained embedding model is utilized to encompass the semantic and contextual features of the textual information. Additionally, the hybrid of the signal and transcript levels is explored and analyzed. The designed classification model relies on stacked layers of deep neural networks and convolutional neural networks. Evaluation results show that the model achieved a higher level of precision when compared with the manual evaluation approach followed by Altibbi's operations team.
在远程医疗服务中,保持医生和患者之间高质量的对话至关重要,因为高效和胜任的沟通对于促进患者健康非常重要。评估医疗对话的质量通常基于人类听觉感知评估。通常,此类任务需要经过培训的专家,因为他们遵循系统的评估标准。然而,咨询量的日常快速增长使得评估过程效率低下且不切实际。本文研究了使用基于深度学习的分类模型自动评估远程医疗服务中患者-医生基于语音的对话质量的过程。为此,数据包括从 Altibbi 获得的音频记录。Altibbi 是一个提供远程医疗和远程医疗服务的数字健康平台,服务于中东和北非(MENA)地区。其目的是协助 Altibbi 的运营团队以自动化方式评估提供的咨询。所提出的模型是使用三组特征开发的:从信号级、转录本级和信号与转录本级提取的特征。在信号级,计算各种统计和频谱信息以描述语音记录的频谱包络。在转录本级别,使用预先训练的嵌入模型来包含文本信息的语义和上下文特征。此外,还探索和分析了信号和转录本级别的混合。设计的分类模型依赖于堆叠的深度神经网络和卷积神经网络层。评估结果表明,与 Altibbi 运营团队采用的手动评估方法相比,该模型在精度方面达到了更高的水平。