IT Department, Leiden University Medical Center, Albinusdreef 2, Postbus 9600, Postzone D-01-P, 2300 RC, Leiden, The Netherlands.
Department of Surgery, Leiden University Medical Center, Leiden, The Netherlands.
BMC Med Inform Decis Mak. 2020 May 27;20(1):97. doi: 10.1186/s12911-020-1104-5.
Patient experience surveys often include free-text responses. Analysis of these responses is time-consuming and often underutilized. This study examined whether Natural Language Processing (NLP) techniques could provide a data-driven, hospital-independent solution to indicate points for quality improvement.
This retrospective study used routinely collected patient experience data from two hospitals. A data-driven NLP approach was used. Free-text responses were categorized into topics, subtopics (i.e. n-grams) and labelled with a sentiment score. The indicator 'impact', combining sentiment and frequency, was calculated to reveal topics to improve, monitor or celebrate. The topic modelling architecture was tested on data from a second hospital to examine whether the architecture is transferable to another hospital.
A total of 38,664 survey responses from the first hospital resulted in 127 topics and 294 n-grams. The indicator 'impact' revealed n-grams to celebrate (15.3%), improve (8.8%), and monitor (16.7%). For hospital 2, a similar percentage of free-text responses could be labelled with a topic and n-grams. Between-hospitals, most topics (69.7%) were similar, but 32.2% of topics for hospital 1 and 29.0% of topics for hospital 2 were unique.
In both hospitals, NLP techniques could be used to categorize patient experience free-text responses into topics, sentiment labels and to define priorities for improvement. The model's architecture was shown to be hospital-specific as it was able to discover new topics for the second hospital. These methods should be considered for future patient experience analyses to make better use of this valuable source of information.
患者体验调查通常包括自由文本回复。分析这些回复既耗时又经常未充分利用。本研究探讨了自然语言处理 (NLP) 技术是否可以提供一种数据驱动、医院独立的解决方案,以指出质量改进的要点。
本回顾性研究使用来自两家医院的常规收集的患者体验数据。采用数据驱动的 NLP 方法。将自由文本回复分为主题、子主题(即 n-gram)并标记为情绪得分。计算“影响”指标(将情绪和频率结合起来),以揭示需要改进、监测或庆祝的主题。该主题建模架构在第二家医院的数据上进行了测试,以检验该架构是否可以转移到另一家医院。
来自第一家医院的 38664 份调查回复共产生了 127 个主题和 294 个 n-grams。“影响”指标揭示了值得庆祝的 n-grams(15.3%)、需要改进的 n-grams(8.8%)和需要监测的 n-grams(16.7%)。对于医院 2,类似比例的自由文本回复可以标记为主题和 n-grams。在两家医院之间,大多数主题(69.7%)是相似的,但医院 1 的 32.2%的主题和医院 2 的 29.0%的主题是独特的。
在两家医院中,NLP 技术都可以用于将患者体验的自由文本回复分类为主题、情绪标签,并确定改进的优先级。该模型的架构被证明是医院特有的,因为它能够为第二家医院发现新的主题。这些方法应该在未来的患者体验分析中考虑,以更好地利用这一宝贵的信息来源。