Jahanshahi Hadi, Kazmi Syed, Cevik Mucahit
Data Science Lab, Ryerson University, 44 Gerrard St E, Toronto, M5B 1G3 Ontario Canada.
J Healthc Inform Res. 2022 Jul 15;6(3):344-374. doi: 10.1007/s41666-022-00118-x. eCollection 2022 Sep.
Telehealth helps to facilitate access to medical professionals by enabling remote medical services for the patients. These services have become gradually popular over the years with the advent of necessary technological infrastructure. The benefits of telehealth have been even more apparent since the beginning of the COVID-19 crisis, as people have become less inclined to visit doctors in person during the pandemic. In this paper, we focus on facilitating chat sessions between a doctor and a patient. We note that the quality and efficiency of the chat experience can be critical as the demand for telehealth services increases. Accordingly, we develop a smart auto-response generation mechanism for medical conversations that helps doctors respond to consultation requests efficiently, particularly during busy sessions. We explore over 900,000 anonymous, historical online messages between doctors and patients collected over 9 months. We implement clustering algorithms to identify the most frequent responses by doctors and manually label the data accordingly. We then train machine learning algorithms using this preprocessed data to generate the responses. The considered algorithm has two steps: a filtering (i.e., triggering) model to filter out infeasible patient messages and a response generator to suggest the top-3 doctor responses for the ones that successfully pass the triggering phase. Among the models utilized, BERT provides an accuracy of 85.41% for precision@3 and shows robustness to its parameters.
远程医疗通过为患者提供远程医疗服务,有助于促进患者获得医疗专业人员的服务。随着必要技术基础设施的出现,这些服务在过去几年中逐渐流行起来。自新冠疫情危机开始以来,远程医疗的好处更加明显,因为在疫情期间人们不太愿意亲自去看医生。在本文中,我们专注于促进医生和患者之间的聊天会话。我们注意到,随着远程医疗服务需求的增加,聊天体验的质量和效率可能至关重要。因此,我们开发了一种用于医疗对话的智能自动回复生成机制,以帮助医生有效地回复咨询请求,特别是在繁忙时段。我们研究了在9个月内收集的90多万条医生和患者之间匿名历史在线消息。我们实施聚类算法来识别医生最常给出的回复,并据此手动标记数据。然后,我们使用这些预处理数据训练机器学习算法来生成回复。所考虑的算法有两个步骤:一个过滤(即触发)模型,用于过滤掉不可行的患者消息;一个回复生成器,为成功通过触发阶段的消息建议医生的前三条回复。在所使用的模型中,BERT在precision@3方面的准确率为85.41%,并且对其参数具有鲁棒性。