Alexander George, Bahja Mohammed, Butt Gibran Farook
The School of Computer Science, University of Birmingham, Birmingham, United Kingdom.
JMIR Med Inform. 2022 Apr 11;10(4):e29385. doi: 10.2196/29385.
Obtaining patient feedback is an essential mechanism for health care service providers to assess their quality and effectiveness. Unlike assessments of clinical outcomes, feedback from patients offers insights into their lived experiences. The Department of Health and Social Care in England via National Health Service Digital operates a patient feedback web service through which patients can leave feedback of their experiences in structured and free-text report forms. Free-text feedback, compared with structured questionnaires, may be less biased by the feedback collector and, thus, more representative; however, it is harder to analyze in large quantities and challenging to derive meaningful, quantitative outcomes.
The aim of this study is to build a novel data analysis and interactive visualization pipeline accessible through an interactive web application to facilitate the interrogation of and provide unique insights into National Health Service patient feedback.
This study details the development of a text analysis tool that uses contemporary natural language processing and machine learning models to analyze free-text clinical service reviews to develop a robust classification model and interactive visualization web application. The methodology is based on the design science research paradigm and was conducted in three iterations: a sentiment analysis of the patient feedback corpus in the first iteration, topic modeling (unigram and bigram)-based analysis for topic identification in the second iteration, and nested topic modeling in the third iteration that combines sentiment analysis and topic modeling methods. An interactive data visualization web application for use by the general public was then created, presenting the data on a geographic representation of the country, making it easily accessible.
Of the 11,103 possible clinical services that could be reviewed across England, 2030 (18.28%) different services received a combined total of 51,845 reviews between October 1, 2017, and September 30, 2019. Dominant topics were identified for the entire corpus followed by negative- and positive-sentiment topics in turn. Reviews containing high- and low-sentiment topics occurred more frequently than reviews containing less polarized topics. Time-series analysis identified trends in topic and sentiment occurrence frequency across the study period.
Using contemporary natural language processing techniques, unstructured text data were effectively characterized for further analysis and visualization. An efficient pipeline was successfully combined with a web application, making automated analysis and dissemination of large volumes of information accessible. This study represents a significant step in efforts to generate and visualize useful, actionable, and unique information from free-text patient reviews.
获取患者反馈是医疗服务提供者评估其服务质量和效果的重要机制。与临床结果评估不同,患者反馈能让我们深入了解他们的实际就医体验。英国卫生和社会保健部通过国民医疗服务体系数字化部门运营一项患者反馈网络服务,患者可通过该服务以结构化和自由文本报告的形式留下就医体验反馈。与结构化问卷相比,自由文本反馈受反馈收集者的影响可能较小,因而更具代表性;然而,对其进行大量分析较为困难,且难以得出有意义的定量结果。
本研究旨在构建一个新颖的数据分析和交互式可视化流程,通过交互式网络应用程序实现,以促进对国民医疗服务体系患者反馈的查询,并提供独特见解。
本研究详细介绍了一种文本分析工具的开发过程,该工具使用当代自然语言处理和机器学习模型来分析自由文本临床服务评价,以开发一个强大的分类模型和交互式可视化网络应用程序。该方法基于设计科学研究范式,分三个迭代阶段进行:第一阶段对患者反馈语料库进行情感分析,第二阶段基于主题建模(一元语法和二元语法)进行主题识别分析,第三阶段进行嵌套主题建模,将情感分析和主题建模方法相结合。然后创建了一个供公众使用的交互式数据可视化网络应用程序,以英国地理分布图的形式呈现数据,使其易于访问。
在英格兰可被评价的11103项可能的临床服务中,2030项(18.28%)不同服务在2017年10月1日至2019年9月30日期间共收到51845条评价。确定了整个语料库的主要主题,其次依次为负面和正面情感主题。包含高情感和低情感主题的评价比包含极化程度较低主题的评价出现得更频繁。时间序列分析确定了整个研究期间主题和情感出现频率的趋势。
使用当代自然语言处理技术,非结构化文本数据得到有效表征,以便进一步分析和可视化。一个高效的流程成功地与网络应用程序相结合,使得大量信息的自动分析和传播成为可能。本研究代表了从自由文本患者评价中生成并可视化有用、可操作且独特信息的重要一步。