Cuenca-Zaldívar Juan Nicolás, Torrente-Regidor Maria, Martín-Losada Laura, Fernández-De-Las-Peñas César, Florencio Lidiane Lima, Sousa Pedro Alexandre, Palacios-Ceña Domingo
Research Group in Nursing and Health Care, Puerta de Hierro Health Research Institute - Segovia de Arana, Majadahonda, Spain.
Functional Recovery Unit, Guadarrama Hospital, Guadarrama, Spain.
JMIR Med Inform. 2022 May 12;10(5):e38308. doi: 10.2196/38308.
The COVID-19 pandemic has changed the usual working of many hospitalization units (or wards). Few studies have used electronic nursing clinical notes (ENCN) and their unstructured text to identify alterations in patients' feelings and therapeutic procedures of interest.
This study aimed to analyze positive or negative sentiments through inspection of the free text of the ENCN, compare sentiments of ENCN with or without hospitalized patients with COVID-19, carry out temporal analysis of the sentiments of the patients during the start of the first wave of the COVID-19 pandemic, and identify the topics in ENCN.
This is a descriptive study with analysis of the text content of ENCN. All ENCNs between January and June 2020 at Guadarrama Hospital (Madrid, Spain) extracted from the CGM Selene Electronic Health Records System were included. Two groups of ENCNs were analyzed: one from hospitalized patients in post-intensive care units for COVID-19 and a second group from hospitalized patients without COVID-19. A sentiment analysis was performed on the lemmatized text, using the National Research Council of Canada, Affin, and Bing dictionaries. A polarity analysis of the sentences was performed using the Bing dictionary, SO Dictionaries V1.11, and Spa dictionary as amplifiers and decrementators. Machine learning techniques were applied to evaluate the presence of significant differences in the ENCN in groups of patients with and those without COVID-19. Finally, a structural analysis of thematic models was performed to study the abstract topics that occur in the ENCN, using Latent Dirichlet Allocation topic modeling.
A total of 37,564 electronic health records were analyzed. Sentiment analysis in ENCN showed that patients with subacute COVID-19 have a higher proportion of positive sentiments than those without COVID-19. Also, there are significant differences in polarity between both groups (Z=5.532, P<.001) with a polarity of 0.108 (SD 0.299) in patients with COVID-19 versus that of 0.09 (SD 0.301) in those without COVID-19. Machine learning modeling reported that despite all models presenting high values, it is the neural network that presents the best indicators (>0.8) and with significant P values between both groups. Through Structural Topic Modeling analysis, the final model containing 10 topics was selected. High correlations were noted among topics 2, 5, and 8 (pressure ulcer and pharmacotherapy treatment), topics 1, 4, 7, and 9 (incidences related to fever and well-being state, and baseline oxygen saturation) and topics 3 and 10 (blood glucose level and pain).
The ENCN may help in the development and implementation of more effective programs, which allows patients with COVID-19 to adopt to their prepandemic lifestyle faster. Topic modeling could help identify specific clinical problems in patients and better target the care they receive.
新型冠状病毒肺炎(COVID-19)大流行改变了许多住院科室(或病房)的常规工作方式。很少有研究利用电子护理临床记录(ENCN)及其非结构化文本识别患者感受和相关治疗程序的变化。
本研究旨在通过检查ENCN的自由文本分析积极或消极情绪,比较有或没有COVID-19住院患者的ENCN情绪,在COVID-19大流行第一波开始时对患者情绪进行时间分析,并识别ENCN中的主题。
这是一项对ENCN文本内容进行分析的描述性研究。纳入了2020年1月至6月在瓜达拉马医院(西班牙马德里)从CGM Selene电子健康记录系统中提取的所有ENCN。分析了两组ENCN:一组来自COVID-19重症监护病房后的住院患者,另一组来自没有COVID-19的住院患者。使用加拿大国家研究委员会、Affin和Bing词典对词形还原后的文本进行情感分析。使用Bing词典、SO词典V1.11和Spa词典作为增强器和衰减器对句子进行极性分析。应用机器学习技术评估有和没有COVID-19的患者组中ENCN是否存在显著差异。最后,使用潜在狄利克雷分配主题建模对主题模型进行结构分析,以研究ENCN中出现的抽象主题。
共分析了37564份电子健康记录。ENCN中的情感分析表明,亚急性COVID-19患者的积极情绪比例高于没有COVID-19的患者。此外,两组之间的极性存在显著差异(Z=5.532,P<0.001),COVID-19患者的极性为0.108(标准差0.299),而没有COVID-19的患者极性为0.09(标准差0.301)。机器学习建模报告称,尽管所有模型的值都很高,但神经网络呈现出最佳指标(>0.8),且两组之间的P值具有显著性。通过结构主题建模分析,选择了包含10个主题的最终模型。注意到主题2、5和8(压疮和药物治疗)、主题1、4、7和9(与发热和健康状态以及基线血氧饱和度相关的发病率)以及主题3和10(血糖水平和疼痛)之间存在高度相关性。
ENCN可能有助于制定和实施更有效的方案,使COVID-19患者更快地适应大流行前的生活方式。主题建模有助于识别患者的特定临床问题,并更好地针对他们接受的护理。