Department of Medical Informatics, Faculty of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran.
Vanderbilt University Medical Center, Department of HealthIT, Nashville, TN, USA.
Int J Med Inform. 2020 May;137:104108. doi: 10.1016/j.ijmedinf.2020.104108. Epub 2020 Mar 6.
Healthcare consumers are increasingly turning to the online health Q&A communities to seek answers for their questions because current general search engines are unable to digest complex health-related questions. Q&A communities are platforms where users ask unstructured questions from different healthcare topics.
This study aimed to provide a concept-based approach to automatically assign health questions to the appropriate domain experts.
We developed three processes for (1) expert profiling, (2) question analysis and (3) similarity calculation and assignment. Semantic weight of concepts combined with TF-IDF weighting comprised vectors of concepts as expert profiles. Subsequently, the similarity between submitted questions and expert profiles was calculated to find a relevant expert.
We randomly selected 345 questions posted by consumers for 38 experts in 13 health topics from NetWellness as input data. Our results showed the precision and recall of our proposed method for the studied topics were between 63 %-92 % and 61 %-100 %, respectively. The calculated F-measure in selected topics was between 62 % (Addiction and Substance Abuse) and 94 % (Eye and Vision Care) with a combined F-measure of 80 %.
Concept-based methods using unified medical language system and natural language processing techniques could automatically assign actual health questions in different topics to the relevant domain experts with good performance metrics.
由于当前的通用搜索引擎无法理解复杂的健康相关问题,越来越多的医疗保健消费者开始转向在线健康问答社区寻求答案。问答社区是用户从不同的医疗保健主题提出非结构化问题的平台。
本研究旨在提供一种基于概念的方法,将健康问题自动分配给合适的领域专家。
我们开发了三个过程,用于(1)专家资料建档、(2)问题分析和(3)相似度计算和分配。概念的语义权重与 TF-IDF 加权相结合,构成了专家资料建档的概念向量。然后,计算提交问题与专家资料建档之间的相似度,以找到相关专家。
我们随机选择了 345 个来自 NetWellness 的消费者提出的问题,这些问题涉及 13 个健康主题的 38 位专家,作为输入数据。我们的研究结果表明,对于所研究的主题,我们提出的方法的准确率和召回率分别在 63%-92%和 61%-100%之间。在选定的主题中,计算出的 F 值在 62%(成瘾和物质滥用)到 94%(眼睛和视力保健)之间,综合 F 值为 80%。
使用统一医学语言系统和自然语言处理技术的基于概念的方法可以将不同主题的实际健康问题自动分配给相关领域的专家,并具有良好的性能指标。