Sehgal Neil K R, Guntuku Sharath Chandra, Southwick Lauren, Merchant Raina M, Agarwal Anish K
Computer and Information Science Department, University of Pennsylvania, Philadelphia.
Leonard Davis Institute of Health Economics, University of Pennsylvania, Philadelphia.
JAMA Netw Open. 2025 Aug 1;8(8):e2524505. doi: 10.1001/jamanetworkopen.2025.24505.
Understanding patient experience is crucial for improving health care delivery. However, language patterns and themes correlated with negative or positive ratings are not well known.
To examine online reviews of US health care facilities, identifying language patterns and themes associated with negative or positive ratings.
DESIGN, SETTING, AND PARTICIPANTS: For this cross-sectional study, all reviews of US health care facilities offering essential health benefits, as defined by the Affordable Care Act, posted on 1 online platform (Yelp.com) under "Health & Medical" from January 1, 2017, to December 31, 2023, were obtained. Reviews are posted voluntarily with ratings (1 star = lowest, 5 stars = highest) and open-ended review narratives regarding patients' care experiences.
The primary outcome was the correlation between n-grams (1- to 3-word sequences) and review ratings (negative: 1 or 2 stars; positive: 4 or 5 stars). Secondary measures included linguistic analysis and topic modeling based on standard machine-learning algorithms. Machine-learning and natural-language processing, including n-gram correlation, linguistic feature analysis, and topic modeling, were applied to determine correlations with review star ratings.
A total of 1 099 901 online reviews from 138 605 facilities were identified over the 7-year study period. Among these, nearly one-half (46.3%) were negative and one-half (50.1%) were positive, with a median (IQR) rating of 4 (1-5) stars. The word "not" was most correlated with negative ratings (r = 0.31; 95% CI, 0.31-0.32), whereas "and" was most correlated with positive ratings (r = 0.35; 95% CI, 0.35-0.36). Among 200 topics, the strongest negative correlations involved payment issues (r = 0.25; 95% CI, 0.25-0.25) and poor treatment (r = 0.24; 95% CI, 0.23-0.24); the strongest positive correlations involved kindness (r = 0.32; 95% CI, 0.32-0.32) and anxiety relief (r = 0.32; 95% CI, 0.32-0.32).
In this cross-sectional analysis, negative patient experiences frequently centered on quality of communication and administrative issues. Negative feedback centered on unmet expectations, whereas positive reviews emphasized supportive staff interactions. Incorporating real-time online-review data into existing quality-improvement frameworks-such as patient experience dashboards or service recovery protocols-could help clinicians, administrators, and policymakers identify emerging concerns, monitor patient sentiment, and tailor interventions that enhance patient-centered care across diverse health care settings.
了解患者体验对于改善医疗服务至关重要。然而,与负面或正面评价相关的语言模式和主题尚不为人所知。
研究美国医疗设施的在线评论,识别与负面或正面评价相关的语言模式和主题。
设计、设置和参与者:在这项横断面研究中,获取了2017年1月1日至2023年12月31日期间在1个在线平台(Yelp.com)的“健康与医疗”板块下发布的、关于提供《平价医疗法案》所定义的基本健康福利的美国医疗设施的所有评论。评论由患者自愿发布,并带有评分(1星 = 最低,5星 = 最高)以及关于患者护理体验的开放式评论叙述。
主要结局是词块(1至3个单词的序列)与评论评分(负面:1或2星;正面:4或5星)之间的相关性。次要测量指标包括基于标准机器学习算法的语言分析和主题建模。应用机器学习和自然语言处理,包括词块相关性、语言特征分析和主题建模,以确定与评论星级评分的相关性。
在7年的研究期间,共识别出来自138605家设施的1099901条在线评论。其中,近一半(46.3%)为负面评论,一半(50.1%)为正面评论,中位数(四分位间距)评分为4(1至5)星。“not”一词与负面评价的相关性最高(r = 0.31;95%置信区间,0.31至0.32),而“and”与正面评价的相关性最高(r = 0.35;95%置信区间,0.35至0.36)。在200个主题中,最强的负相关涉及支付问题(r = 0.25;95%置信区间,0.25至0.25)和治疗不佳(r = 0.24;95%置信区间,0.23至0.24);最强的正相关涉及友善(r = 0.32;95%置信区间,0.32至0.32)和焦虑缓解(r = 0.32;95%置信区间,0.32至0.32)。
在这项横断面分析中,负面患者体验频繁集中在沟通质量和管理问题上。负面反馈集中在未满足的期望上,而正面评论强调支持性的员工互动。将实时在线评论数据纳入现有的质量改进框架,如患者体验仪表盘或服务恢复协议,可帮助临床医生、管理人员和政策制定者识别新出现的问题、监测患者情绪,并制定干预措施,以在不同的医疗环境中加强以患者为中心的护理。