Suppr超能文献

基于机器学习和词典的方法开发患者满意度分析系统。

Development of a patients' satisfaction analysis system using machine learning and lexicon-based methods.

机构信息

Cardiovascular Nursing Research Center, Rajaie Cardiovascular Medical and Research Center, Iran University of Medical Sciences, Tehran, Iran.

Rajaie Cardiovascular Medical and Research Center, Iran University of Medical Sciences, Tehran, Iran.

出版信息

BMC Health Serv Res. 2023 Mar 23;23(1):280. doi: 10.1186/s12913-023-09260-7.

Abstract

BACKGROUND

Patients' rights are integral to medical ethics. This study aimed to perform sentiment analysis and opinion mining on patients' messages by a combination of lexicon-based and machine learning methods to identify positive or negative comments and to determine the different ward and staff names mentioned in patients' messages.

METHODS

The level of satisfaction and observance of the rights of 250 service recipients of the hospital was evaluated through the related checklists by the evaluator. In total, 822 Persian messages, composed of 540 negative and 282 positive comments, were collected and labeled by the evaluator. Pre-processing was performed on the messages and followed by 2 feature vectors which were extracted from the messages, including the term frequency-inverse document frequency (TFIDF) vector and a combination of the multifeature (MF) (a lexicon-based method) and TFIDF (MF + TFIDF) vectors. Six feature selectors and 5 classifiers were used in this study. For the evaluations, 5-fold cross-validation with different metrics including area under the receiver operating characteristic curve (AUC), accuracy (ACC), F1 score, sensitivity (SEN), specificity (SPE) and Precision-Recall Curves (PRC) were reported. Message tag detection, which featured different hospital wards and identified staff names mentioned in the study patients' messages, was implemented by the lexicon-based method.

RESULTS

The best classifier was Multinomial Naïve Bayes in combination with MF + TFIDF feature vector and SelectFromModel (SFM) feature selection (ACC = 0.89 ± 0.03, AUC = 0.87 ± 0.03, F1 = 0.92 ± 0.03, SEN = 0.93 ± 0.04, and SPE = 0.82 ± 0.02, PRC-AUC = 0.97). Two methods of assessment by the evaluator and artificial intelligence as well as survey systems were compared.

CONCLUSION

Our results demonstrated that the lexicon-based method, in combination with machine learning classifiers, could extract sentiments in patients' comments and classify them into positive and negative categories. We also developed an online survey system to analyze patients' satisfaction in different wards and to remove conventional assessments by the evaluator.

摘要

背景

患者权利是医学伦理的重要组成部分。本研究旨在通过基于词典和机器学习的方法对患者信息进行情感分析和意见挖掘,以识别积极或消极的评论,并确定患者信息中提到的不同病房和员工姓名。

方法

通过评估员对医院 250 名服务接受者进行相关检查表评估,评估患者的满意度和对权利的遵守情况。共收集并标记了 822 条波斯语消息,其中 540 条为负面评论,282 条为正面评论。对消息进行预处理,然后提取 2 个特征向量,包括词频-逆文档频率(TFIDF)向量和多特征(MF)(基于词典的方法)和 TFIDF(MF+TFIDF)向量的组合。本研究使用了 6 个特征选择器和 5 个分类器。评估使用了不同的指标,包括接收器操作特征曲线(ROC)下的面积(AUC)、准确性(ACC)、F1 得分、敏感性(SEN)、特异性(SPE)和精度-召回曲线(PRC),进行了 5 折交叉验证。通过基于词典的方法实现了消息标签检测,可检测研究患者消息中提到的不同医院病房和员工姓名。

结果

最好的分类器是多类朴素贝叶斯与 MF+TFIDF 特征向量和 SelectFromModel(SFM)特征选择相结合(ACC=0.89±0.03,AUC=0.87±0.03,F1=0.92±0.03,SEN=0.93±0.04,SPE=0.82±0.02,PRC-AUC=0.97)。评估员、人工智能和调查系统的两种评估方法进行了比较。

结论

我们的结果表明,基于词典的方法与机器学习分类器相结合,可以提取患者评论中的情绪,并将其分类为积极和消极两类。我们还开发了一个在线调查系统,以分析不同病房患者的满意度,并消除评估员的常规评估。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bec9/10037842/4269b915ac57/12913_2023_9260_Fig1_HTML.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验