Polish Telemedicine and eHealth Society, 03-728 Warsaw, Poland.
Multimedia Department, Polish-Japanese Academy of Information Technology, 02-008 Warsaw, Poland.
Int J Environ Res Public Health. 2022 Aug 17;19(16):10182. doi: 10.3390/ijerph191610182.
Evaluating patients’ experience and satisfaction often calls for analyses of free-text data. Language and domain-specific information extraction can reduce costly manual preprocessing and enable the analysis of extensive collections of experience-based narratives. The research aims were to (1) elicit free-text narratives about experiences with health services of international students in Poland, (2) develop domain- and language-specific algorithms for the extraction of information relevant for the evaluation of quality and safety of health services, and (3) test the performance of information extraction algorithms’ on questions about the patients’ experiences with health services. The materials were free-text narratives about health clinic encounters produced by English-speaking foreigners recalling their experiences (n = 104) in healthcare facilities in Poland. A linguistic analysis of the text collection led to constructing a semantic−syntactic lexicon and a set of lexical-syntactic frames. These were further used to develop rule-based information extraction algorithms in the form of Python scripts. The extraction algorithms generated text classifications according to predefined queries. In addition, the narratives were classified by human readers. The algorithm-based and the human readers’ classifications were highly correlated and significant (p < 0.01), indicating an excellent performance of the automatic query algorithms. The study results demonstrate that domain-specific and language-specific information extraction from free-text narratives can be used as an efficient and low-cost method for evaluating patient experiences and satisfaction with health services and built into software solutions for the quality evaluation in health care.
评估患者的体验和满意度通常需要对自由文本数据进行分析。语言和领域特定的信息提取可以减少昂贵的手动预处理,并能够分析大量基于经验的叙述。研究目的是:(1)引出关于波兰国际学生医疗服务体验的自由文本叙述,(2)开发针对健康服务质量和安全评估的相关信息的领域和语言特定算法,以及(3)测试有关患者健康服务体验问题的信息提取算法的性能。材料是由讲英语的外国人撰写的关于医疗诊所遭遇的自由文本叙述,这些外国人回忆了他们在波兰医疗机构的经历(n = 104)。对文本集的语言分析导致构建了一个语义 - 句法词汇和一套词汇 - 句法框架。这些框架进一步用于以 Python 脚本的形式开发基于规则的信息提取算法。提取算法根据预定义的查询生成文本分类。此外,叙述还被人工读者分类。基于算法的分类和人工读者的分类高度相关且具有统计学意义(p < 0.01),表明自动查询算法性能出色。研究结果表明,从自由文本叙述中提取领域特定和语言特定的信息可以作为评估患者对健康服务的体验和满意度的高效且低成本的方法,并纳入医疗保健质量评估的软件解决方案中。