• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

探究不同糖尿病患者电子健康记录中的语言差异:自然语言处理分析

Examining Linguistic Differences in Electronic Health Records for Diverse Patients With Diabetes: Natural Language Processing Analysis.

作者信息

Bilotta Isabel, Tonidandel Scott, Liaw Winston R, King Eden, Carvajal Diana N, Taylor Ayana, Thamby Julie, Xiang Yang, Tao Cui, Hansen Michael

机构信息

Deutser, Houston, TX, United States.

Belk College of Business, University of North Carolina at Charlotte, Charlotte, NC, United States.

出版信息

JMIR Med Inform. 2024 May 23;12:e50428. doi: 10.2196/50428.

DOI:10.2196/50428
PMID:38787295
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11137426/
Abstract

BACKGROUND

Individuals from minoritized racial and ethnic backgrounds experience pernicious and pervasive health disparities that have emerged, in part, from clinician bias.

OBJECTIVE

We used a natural language processing approach to examine whether linguistic markers in electronic health record (EHR) notes differ based on the race and ethnicity of the patient. To validate this methodological approach, we also assessed the extent to which clinicians perceive linguistic markers to be indicative of bias.

METHODS

In this cross-sectional study, we extracted EHR notes for patients who were aged 18 years or older; had more than 5 years of diabetes diagnosis codes; and received care between 2006 and 2014 from family physicians, general internists, or endocrinologists practicing in an urban, academic network of clinics. The race and ethnicity of patients were defined as White non-Hispanic, Black non-Hispanic, or Hispanic or Latino. We hypothesized that Sentiment Analysis and Social Cognition Engine (SEANCE) components (ie, negative adjectives, positive adjectives, joy words, fear and disgust words, politics words, respect words, trust verbs, and well-being words) and mean word count would be indicators of bias if racial differences emerged. We performed linear mixed effects analyses to examine the relationship between the outcomes of interest (the SEANCE components and word count) and patient race and ethnicity, controlling for patient age. To validate this approach, we asked clinicians to indicate the extent to which they thought variation in the use of SEANCE language domains for different racial and ethnic groups was reflective of bias in EHR notes.

RESULTS

We examined EHR notes (n=12,905) of Black non-Hispanic, White non-Hispanic, and Hispanic or Latino patients (n=1562), who were seen by 281 physicians. A total of 27 clinicians participated in the validation study. In terms of bias, participants rated negative adjectives as 8.63 (SD 2.06), fear and disgust words as 8.11 (SD 2.15), and positive adjectives as 7.93 (SD 2.46) on a scale of 1 to 10, with 10 being extremely indicative of bias. Notes for Black non-Hispanic patients contained significantly more negative adjectives (coefficient 0.07, SE 0.02) and significantly more fear and disgust words (coefficient 0.007, SE 0.002) than those for White non-Hispanic patients. The notes for Hispanic or Latino patients included significantly fewer positive adjectives (coefficient -0.02, SE 0.007), trust verbs (coefficient -0.009, SE 0.004), and joy words (coefficient -0.03, SE 0.01) than those for White non-Hispanic patients.

CONCLUSIONS

This approach may enable physicians and researchers to identify and mitigate bias in medical interactions, with the goal of reducing health disparities stemming from bias.

摘要

背景

来自少数族裔和种族背景的个体经历着有害且普遍存在的健康差距,部分原因是临床医生的偏见。

目的

我们采用自然语言处理方法,研究电子健康记录(EHR)笔记中的语言标记是否因患者的种族和民族而异。为验证这种方法,我们还评估了临床医生认为语言标记可表明偏见的程度。

方法

在这项横断面研究中,我们提取了年龄在18岁及以上、有超过5年糖尿病诊断代码且在2006年至2014年期间在城市学术诊所网络中执业的家庭医生、普通内科医生或内分泌科医生处接受治疗的患者的EHR笔记。患者的种族和民族被定义为非西班牙裔白人、非西班牙裔黑人或西班牙裔或拉丁裔。我们假设,如果出现种族差异,情感分析和社会认知引擎(SEANCE)组件(即负面形容词、正面形容词、喜悦词汇、恐惧和厌恶词汇、政治词汇、尊重词汇、信任动词和幸福词汇)以及平均单词数将是偏见的指标。我们进行了线性混合效应分析,以检查感兴趣的结果(SEANCE组件和单词数)与患者种族和民族之间的关系,并控制患者年龄。为验证这种方法,我们要求临床医生指出他们认为不同种族和民族群体在使用SEANCE语言领域方面的差异在多大程度上反映了EHR笔记中的偏见。

结果

我们检查了281名医生诊治的非西班牙裔黑人、非西班牙裔白人以及西班牙裔或拉丁裔患者(n = 1562)的EHR笔记(n = 12905)。共有27名临床医生参与了验证研究。在偏见方面,参与者在1至10的量表上对负面形容词的评分是8.63(标准差2.06),对恐惧和厌恶词汇的评分是8.11(标准差2.15),对正面形容词的评分是7.93(标准差2.46),10表示极能表明偏见。与非西班牙裔白人患者的笔记相比,非西班牙裔黑人患者的笔记包含显著更多的负面形容词(系数0.07,标准误0.02)和显著更多的恐惧和厌恶词汇(系数0.007,标准误0.002)。与非西班牙裔白人患者的笔记相比,西班牙裔或拉丁裔患者的笔记包含显著更少的正面形容词(系数 -0.02,标准误0.007)、信任动词(系数 -0.009,标准误0.004)和喜悦词汇(系数 -0.03,标准误0.01)。

结论

这种方法可能使医生和研究人员能够识别并减轻医疗互动中的偏见,以减少因偏见导致的健康差距。

相似文献

1
Examining Linguistic Differences in Electronic Health Records for Diverse Patients With Diabetes: Natural Language Processing Analysis.探究不同糖尿病患者电子健康记录中的语言差异:自然语言处理分析
JMIR Med Inform. 2024 May 23;12:e50428. doi: 10.2196/50428.
2
Race and Ethnicity and Clinician Linguistic Expressions of Doubt in Hospital Admission Notes.种族和民族与住院病历中临床医生怀疑态度的表达。
JAMA Netw Open. 2024 Oct 1;7(10):e2438550. doi: 10.1001/jamanetworkopen.2024.38550.
3
Examination of Stigmatizing Language in the Electronic Health Record.电子健康记录中的污名化语言研究。
JAMA Netw Open. 2022 Jan 4;5(1):e2144967. doi: 10.1001/jamanetworkopen.2021.44967.
4
Toward representative genomic research: the children's rare disease cohorts experience.迈向具有代表性的基因组研究:儿童罕见病队列研究经验
Ther Adv Rare Dis. 2023 Aug 22;4:26330040231181406. doi: 10.1177/26330040231181406. eCollection 2023 Jan-Dec.
5
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
6
7
Association of Disparities in Family History and Family Cancer History in the Electronic Health Record With Sex, Race, Hispanic or Latino Ethnicity, and Language Preference in 2 Large US Health Care Systems.电子健康记录中家族病史和家族癌症病史的差异与性别、种族、西班牙裔或拉丁裔以及在 2 个大型美国医疗保健系统中的语言偏好的关联。
JAMA Netw Open. 2022 Oct 3;5(10):e2234574. doi: 10.1001/jamanetworkopen.2022.34574.
8
Racial/Ethnic Differences in Psychiatric and Medical Diagnoses Among Autistic Adults.成年自闭症患者精神科和医学诊断中的种族/民族差异。
Autism Adulthood. 2022 Dec 1;4(4):290-305. doi: 10.1089/aut.2021.0083. Epub 2022 Dec 13.
9
Underserved populations with missing race ethnicity data differ significantly from those with structured race/ethnicity documentation.服务不足的人群中缺失种族民族数据与那些有结构化种族/民族文档记录的人群有显著差异。
J Am Med Inform Assoc. 2019 Aug 1;26(8-9):722-729. doi: 10.1093/jamia/ocz040.
10
Racial and Ethnic Bias in Risk Prediction Models for Colorectal Cancer Recurrence When Race and Ethnicity Are Omitted as Predictors.当种族和民族被排除在预测因素之外时,结直肠癌复发风险预测模型中的种族和民族偏见。
JAMA Netw Open. 2023 Jun 1;6(6):e2318495. doi: 10.1001/jamanetworkopen.2023.18495.

引用本文的文献

1
On the use of natural language processing to implement the target trial framework using unstructured data from the electronic health record.关于使用自然语言处理技术,利用电子健康记录中的非结构化数据来实施目标试验框架。
Glob Epidemiol. 2025 May 8;9:100204. doi: 10.1016/j.gloepi.2025.100204. eCollection 2025 Jun.
2
Stigmatizing and Positive Language in Birth Clinical Notes Associated With Race and Ethnicity.出生临床记录中与种族和族裔相关的污名化语言和积极语言
JAMA Netw Open. 2025 May 1;8(5):e259599. doi: 10.1001/jamanetworkopen.2025.9599.
3
Identifying stigmatizing and positive/preferred language in obstetric clinical notes using natural language processing.使用自然语言处理识别产科临床记录中的污名化语言以及积极/偏好性语言。
J Am Med Inform Assoc. 2025 Feb 1;32(2):308-317. doi: 10.1093/jamia/ocae290.

本文引用的文献

1
A qualitative analysis of stigmatizing language in birth admission clinical notes.出生入院临床记录中污名化语言的定性分析
Nurs Inq. 2023 Jul;30(3):e12557. doi: 10.1111/nin.12557. Epub 2023 Apr 18.
2
Examination of Stigmatizing Language in the Electronic Health Record.电子健康记录中的污名化语言研究。
JAMA Netw Open. 2022 Jan 4;5(1):e2144967. doi: 10.1001/jamanetworkopen.2021.44967.
3
Negative Patient Descriptors: Documenting Racial Bias In The Electronic Health Record.负面患者描述:电子健康记录中的种族偏见问题。
Health Aff (Millwood). 2022 Feb;41(2):203-211. doi: 10.1377/hlthaff.2021.01423. Epub 2022 Jan 19.
4
Physician Use of Stigmatizing Language in Patient Medical Records.医生在患者病历中使用污名化语言的情况。
JAMA Netw Open. 2021 Jul 1;4(7):e2117052. doi: 10.1001/jamanetworkopen.2021.17052.
5
Using Corpus Analyses to Help Address the DIF Interpretation: Gender Differences in Standardized Writing Assessment.运用语料库分析辅助处理差异项目功能(DIF)解释:标准化写作评估中的性别差异
Front Psychol. 2020 Jun 3;11:1088. doi: 10.3389/fpsyg.2020.01088. eCollection 2020.
6
The Implicit Association Test in health professions education: A meta-narrative review.医学专业学术文献:内隐联想测验在健康职业教育中的应用:元叙述性综述。
Perspect Med Educ. 2019 Oct;8(5):267-275. doi: 10.1007/s40037-019-00533-8.
7
Do Words Matter? Stigmatizing Language and the Transmission of Bias in the Medical Record.用词重要吗?医疗记录中的污名化语言与偏见传递。
J Gen Intern Med. 2018 May;33(5):685-691. doi: 10.1007/s11606-017-4289-2. Epub 2018 Jan 26.
8
Randomized trial of a question prompt list to increase patient active participation during interactions with black patients and their oncologists.一项关于问题提示列表的随机试验,旨在提高黑人患者与其肿瘤学家互动期间患者的积极参与度。
Patient Educ Couns. 2017 May;100(5):818-826. doi: 10.1016/j.pec.2016.12.026. Epub 2016 Dec 27.
9
Copy, Paste, and Cloned Notes in Electronic Health Records.电子健康记录中的复制、粘贴和克隆笔记。
Chest. 2014 Mar;145(3):632-638. doi: 10.1378/chest.13-0886.
10
Examining the Presence, Consequences, and Reduction of Implicit Bias in Health Care: A Narrative Review.审视医疗保健中隐性偏见的存在、后果及减少:一项叙述性综述。
Group Process Intergroup Relat. 2016 Jul;19(4):528-542. doi: 10.1177/1368430216642029. Epub 2016 May 8.