• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用自然语言处理技术识别分娩临床记录中的污名化语言

Using Natural Language Processing to Identify Stigmatizing Language in Labor and Birth Clinical Notes.

机构信息

School of Nursing, Columbia University, 560 West 168th St, Mail Code 6, New York, NY, 10032, USA.

Department of Computer Science, Aalto University, Espoo, Finland.

出版信息

Matern Child Health J. 2024 Mar;28(3):578-586. doi: 10.1007/s10995-023-03857-4. Epub 2023 Dec 26.

DOI:10.1007/s10995-023-03857-4
PMID:38147277
Abstract

INTRODUCTION

Stigma and bias related to race and other minoritized statuses may underlie disparities in pregnancy and birth outcomes. One emerging method to identify bias is the study of stigmatizing language in the electronic health record. The objective of our study was to develop automated natural language processing (NLP) methods to identify two types of stigmatizing language: marginalizing language and its complement, power/privilege language, accurately and automatically in labor and birth notes.

METHODS

We analyzed notes for all birthing people > 20 weeks' gestation admitted for labor and birth at two hospitals during 2017. We then employed text preprocessing techniques, specifically using TF-IDF values as inputs, and tested machine learning classification algorithms to identify stigmatizing and power/privilege language in clinical notes. The algorithms assessed included Decision Trees, Random Forest, and Support Vector Machines. Additionally, we applied a feature importance evaluation method (InfoGain) to discern words that are highly correlated with these language categories.

RESULTS

For marginalizing language, Decision Trees yielded the best classification with an F-score of 0.73. For power/privilege language, Support Vector Machines performed optimally, achieving an F-score of 0.91. These results demonstrate the effectiveness of the selected machine learning methods in classifying language categories in clinical notes.

CONCLUSION

We identified well-performing machine learning methods to automatically detect stigmatizing language in clinical notes. To our knowledge, this is the first study to use NLP performance metrics to evaluate the performance of machine learning methods in discerning stigmatizing language. Future studies should delve deeper into refining and evaluating NLP methods, incorporating the latest algorithms rooted in deep learning.

摘要

简介

与种族和其他少数群体地位相关的污名化和偏见可能是导致妊娠和分娩结果差异的原因之一。一种识别偏见的新兴方法是研究电子健康记录中的污名化语言。我们的研究目的是开发自动化自然语言处理(NLP)方法,以准确和自动地识别劳动和分娩记录中的两种污名化语言:边缘化语言及其补充语——权力/特权语言。

方法

我们分析了 2017 年在两家医院住院分娩的 20 周以上妊娠产妇的记录。然后,我们采用了文本预处理技术,特别是使用 TF-IDF 值作为输入,并测试了机器学习分类算法,以识别临床记录中的污名化和权力/特权语言。评估的算法包括决策树、随机森林和支持向量机。此外,我们还应用了特征重要性评估方法(InfoGain)来辨别与这些语言类别高度相关的词汇。

结果

对于边缘化语言,决策树的分类效果最佳,F1 得分为 0.73。对于权力/特权语言,支持向量机的表现最佳,F1 得分为 0.91。这些结果表明,所选机器学习方法在对临床记录中的语言类别进行分类方面是有效的。

结论

我们确定了性能良好的机器学习方法,可以自动检测临床记录中的污名化语言。据我们所知,这是第一项使用 NLP 性能指标评估机器学习方法在识别污名化语言方面性能的研究。未来的研究应深入研究改进和评估 NLP 方法,并结合基于深度学习的最新算法。

相似文献

1
Using Natural Language Processing to Identify Stigmatizing Language in Labor and Birth Clinical Notes.使用自然语言处理技术识别分娩临床记录中的污名化语言
Matern Child Health J. 2024 Mar;28(3):578-586. doi: 10.1007/s10995-023-03857-4. Epub 2023 Dec 26.
2
A qualitative analysis of stigmatizing language in birth admission clinical notes.出生入院临床记录中污名化语言的定性分析
Nurs Inq. 2023 Jul;30(3):e12557. doi: 10.1111/nin.12557. Epub 2023 Apr 18.
3
A clinical text classification paradigm using weak supervision and deep representation.一种使用弱监督和深度表示的临床文本分类范式。
BMC Med Inform Decis Mak. 2019 Jan 7;19(1):1. doi: 10.1186/s12911-018-0723-6.
4
Medical subdomain classification of clinical notes using a machine learning-based natural language processing approach.基于机器学习的自然语言处理方法对临床笔记进行医学子域分类。
BMC Med Inform Decis Mak. 2017 Dec 1;17(1):155. doi: 10.1186/s12911-017-0556-8.
5
Measuring Implicit Bias in ICU Notes Using Word-Embedding Neural Network Models.使用词嵌入神经网络模型测量 ICU 记录中的内隐偏见。
Chest. 2024 Jun;165(6):1481-1490. doi: 10.1016/j.chest.2023.12.031. Epub 2024 Jan 8.
6
Advancing equity in breast cancer care: natural language processing for analysing treatment outcomes in under-represented populations.推进乳腺癌护理中的公平性:自然语言处理分析代表性不足人群的治疗结果。
BMJ Health Care Inform. 2024 Jul 1;31(1):e100966. doi: 10.1136/bmjhci-2023-100966.
7
Artificial Intelligence Learning Semantics via External Resources for Classifying Diagnosis Codes in Discharge Notes.人工智能通过外部资源学习语义以对出院小结中的诊断代码进行分类。
J Med Internet Res. 2017 Nov 6;19(11):e380. doi: 10.2196/jmir.8344.
8
Classifying early infant feeding status from clinical notes using natural language processing and machine learning.使用自然语言处理和机器学习对临床记录进行早期婴儿喂养状态分类。
Sci Rep. 2024 Apr 3;14(1):7831. doi: 10.1038/s41598-024-58299-x.
9
Social Reminiscence in Older Adults' Everyday Conversations: Automated Detection Using Natural Language Processing and Machine Learning.老年人日常对话中的社会怀旧:使用自然语言处理和机器学习的自动检测。
J Med Internet Res. 2020 Sep 15;22(9):e19133. doi: 10.2196/19133.
10
Risk prediction using natural language processing of electronic mental health records in an inpatient forensic psychiatry setting.利用电子心理健康记录的自然语言处理进行住院法医精神病学环境中的风险预测。
J Biomed Inform. 2018 Oct;86:49-58. doi: 10.1016/j.jbi.2018.08.007. Epub 2018 Aug 14.

引用本文的文献

1
Chaplains' Charting in the USA in the Era of "Open Notes:" Recommendations from a Quality Improvement Project.“开放病历”时代美国牧师的病历记录:一项质量改进项目的建议
J Relig Health. 2025 Aug 23. doi: 10.1007/s10943-025-02414-3.
2
Detecting Stigmatizing Language in Clinical Notes with Large Language Models for Addiction Care.使用大语言模型在成瘾护理临床记录中检测污名化语言。
medRxiv. 2025 Aug 12:2025.08.08.25333315. doi: 10.1101/2025.08.08.25333315.
3
Efficient Detection of Stigmatizing Language in Electronic Health Records via In-Context Learning: Comparative Analysis and Validation Study.
通过上下文学习在电子健康记录中高效检测污名化语言:比较分析与验证研究
JMIR Med Inform. 2025 Aug 18;13:e68955. doi: 10.2196/68955.
4
Stigmatizing and Positive Language in Birth Clinical Notes Associated With Race and Ethnicity.出生临床记录中与种族和族裔相关的污名化语言和积极语言
JAMA Netw Open. 2025 May 1;8(5):e259599. doi: 10.1001/jamanetworkopen.2025.9599.
5
Improving Clinical Documentation with Artificial Intelligence: A Systematic Review.利用人工智能改善临床文档记录:一项系统综述。
Perspect Health Inf Manag. 2024 Jun 1;21(2):1d. eCollection 2024 Summer-Fall.
6
CARE-SD: classifier-based analysis for recognizing provider stigmatizing and doubt marker labels in electronic health records: model development and validation.CARE-SD:基于分类器的电子健康记录中识别医疗服务提供者污名化和怀疑标记标签的分析:模型开发与验证
J Am Med Inform Assoc. 2025 Feb 1;32(2):365-374. doi: 10.1093/jamia/ocae310.
7
Understanding Daily Care Experience Preferences Across the Lifespan of Older Adults: Application of Natural Language Processing.了解老年人一生的日常护理体验偏好:自然语言处理的应用
West J Nurs Res. 2025 Feb;47(2):71-81. doi: 10.1177/01939459241306946. Epub 2024 Dec 21.
8
Identifying stigmatizing and positive/preferred language in obstetric clinical notes using natural language processing.使用自然语言处理识别产科临床记录中的污名化语言以及积极/偏好性语言。
J Am Med Inform Assoc. 2025 Feb 1;32(2):308-317. doi: 10.1093/jamia/ocae290.