• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

应用自然语言处理从患者病历中识别社会需求:在综合医疗服务系统中开发和评估一个可扩展、高性能且基于规则的模型。

Application of natural language processing to identify social needs from patient medical notes: development and assessment of a scalable, performant, and rule-based model in an integrated healthcare delivery system.

作者信息

Gray Geoffrey M, Zirikly Ayah, Ahumada Luis M, Rouhizadeh Masoud, Richards Thomas, Kitchen Christopher, Foroughmand Iman, Hatef Elham

机构信息

Center for Pediatric Data Science and Analytic Methodology, Johns Hopkins All Children's Hospital, St. Petersburg, FL, United States.

Department of Computer Science, Whiting School of Engineering, Johns Hopkins University, Baltimore, MD, United States.

出版信息

JAMIA Open. 2023 Oct 4;6(4):ooad085. doi: 10.1093/jamiaopen/ooad085. eCollection 2023 Dec.

DOI:10.1093/jamiaopen/ooad085
PMID:37799347
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10550267/
Abstract

OBJECTIVES

To develop and test a scalable, performant, and rule-based model for identifying 3 major domains of social needs (residential instability, food insecurity, and transportation issues) from the unstructured data in electronic health records (EHRs).

MATERIALS AND METHODS

We included patients aged 18 years or older who received care at the Johns Hopkins Health System (JHHS) between July 2016 and June 2021 and had at least 1 unstructured (free-text) note in their EHR during the study period. We used a combination of manual lexicon curation and semiautomated lexicon creation for feature development. We developed an initial rules-based pipeline (Match Pipeline) using 2 keyword sets for each social needs domain. We performed rule-based keyword matching for distinct lexicons and tested the algorithm using an annotated dataset comprising 192 patients. Starting with a set of expert-identified keywords, we tested the adjustments by evaluating false positives and negatives identified in the labeled dataset. We assessed the performance of the algorithm using measures of precision, recall, and 1 score.

RESULTS

The algorithm for identifying residential instability had the best overall performance, with a weighted average for precision, recall, and 1 score of 0.92, 0.84, and 0.92 for identifying patients with homelessness and 0.84, 0.82, and 0.79 for identifying patients with housing insecurity. Metrics for the food insecurity algorithm were high but the transportation issues algorithm was the lowest overall performing metric.

DISCUSSION

The NLP algorithm in identifying social needs at JHHS performed relatively well and would provide the opportunity for implementation in a healthcare system.

CONCLUSION

The NLP approach developed in this project could be adapted and potentially operationalized in the routine data processes of a healthcare system.

摘要

目标

开发并测试一种可扩展、高性能且基于规则的模型,用于从电子健康记录(EHR)中的非结构化数据识别社会需求的3个主要领域(居住不稳定、粮食不安全和交通问题)。

材料与方法

我们纳入了2016年7月至2021年6月期间在约翰霍普金斯医疗系统(JHHS)接受治疗且年龄在18岁及以上、在研究期间其EHR中至少有1条非结构化(自由文本)记录的患者。我们使用手动词汇编纂和半自动词汇创建相结合的方法进行特征开发。我们针对每个社会需求领域使用2个关键词集开发了一个初始的基于规则的流程(匹配流程)。我们对不同的词汇进行基于规则的关键词匹配,并使用包含192名患者的注释数据集测试该算法。从一组专家确定的关键词开始,我们通过评估在标记数据集中识别出的假阳性和假阴性来测试调整情况。我们使用精确率、召回率和F1分数来评估算法的性能。

结果

识别居住不稳定的算法总体性能最佳,识别无家可归患者时精确率、召回率和F1分数的加权平均值分别为0.92、0.84和0.92,识别住房不安全患者时分别为0.84、0.82和0.79。粮食不安全算法的指标较高,但交通问题算法的总体性能指标最低。

讨论

JHHS中用于识别社会需求的自然语言处理算法表现相对较好,将为在医疗系统中实施提供机会。

结论

本项目开发的自然语言处理方法可在医疗系统的常规数据流程中进行调整并可能投入使用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ebce/10550267/5af04a1dbd2d/ooad085f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ebce/10550267/51dc32ffef95/ooad085f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ebce/10550267/5af04a1dbd2d/ooad085f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ebce/10550267/51dc32ffef95/ooad085f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ebce/10550267/5af04a1dbd2d/ooad085f2.jpg

相似文献

1
Application of natural language processing to identify social needs from patient medical notes: development and assessment of a scalable, performant, and rule-based model in an integrated healthcare delivery system.应用自然语言处理从患者病历中识别社会需求:在综合医疗服务系统中开发和评估一个可扩展、高性能且基于规则的模型。
JAMIA Open. 2023 Oct 4;6(4):ooad085. doi: 10.1093/jamiaopen/ooad085. eCollection 2023 Dec.
2
Development and assessment of a natural language processing model to identify residential instability in electronic health records' unstructured data: a comparison of 3 integrated healthcare delivery systems.开发和评估一种用于识别电子健康记录非结构化数据中居住不稳定情况的自然语言处理模型:对3个综合医疗服务系统的比较
JAMIA Open. 2022 Feb 16;5(1):ooac006. doi: 10.1093/jamiaopen/ooac006. eCollection 2022 Apr.
3
Measuring the Value of a Practical Text Mining Approach to Identify Patients With Housing Issues in the Free-Text Notes in Electronic Health Record: Findings of a Retrospective Cohort Study.衡量实用文本挖掘方法在电子健康记录中的自由文本记录中识别住房问题患者的价值:一项回顾性队列研究的结果。
Front Public Health. 2021 Aug 27;9:697501. doi: 10.3389/fpubh.2021.697501. eCollection 2021.
4
Natural language processing to identify social determinants of health in Alzheimer's disease and related dementia from electronic health records.基于自然语言处理的电子健康记录中阿尔茨海默病及相关痴呆症社会决定因素的识别。
Health Serv Res. 2023 Dec;58(6):1292-1302. doi: 10.1111/1475-6773.14210. Epub 2023 Aug 3.
5
Using Large Language Models to Annotate Complex Cases of Social Determinants of Health in Longitudinal Clinical Records.使用大语言模型注释纵向临床记录中健康社会决定因素的复杂病例。
medRxiv. 2024 Apr 27:2024.04.25.24306380. doi: 10.1101/2024.04.25.24306380.
6
Automatically identifying social isolation from clinical narratives for patients with prostate Cancer.自动识别前列腺癌患者临床叙述中的社会孤立现象。
BMC Med Inform Decis Mak. 2019 Mar 14;19(1):43. doi: 10.1186/s12911-019-0795-y.
7
Development of a natural language processing algorithm to extract seizure types and frequencies from the electronic health record.开发一种自然语言处理算法,从电子健康记录中提取癫痫发作类型和频率。
Seizure. 2022 Oct;101:48-51. doi: 10.1016/j.seizure.2022.07.010. Epub 2022 Jul 20.
8
Can We Geographically Validate a Natural Language Processing Algorithm for Automated Detection of Incidental Durotomy Across Three Independent Cohorts From Two Continents?能否通过来自两大洲的三个独立队列对用于自动检测偶然硬脊膜切开术的自然语言处理算法进行地理验证?
Clin Orthop Relat Res. 2022 Sep 1;480(9):1766-1775. doi: 10.1097/CORR.0000000000002200. Epub 2022 Apr 12.
9
Extracting Medical Information From Free-Text and Unstructured Patient-Generated Health Data Using Natural Language Processing Methods: Feasibility Study With Real-world Data.使用自然语言处理方法从自由文本和非结构化患者生成的健康数据中提取医学信息:基于真实世界数据的可行性研究
JMIR Form Res. 2023 Mar 7;7:e43014. doi: 10.2196/43014.
10
A Natural Language Processing Model for COVID-19 Detection Based on Dutch General Practice Electronic Health Records by Using Bidirectional Encoder Representations From Transformers: Development and Validation Study.基于荷兰全科电子健康记录的 COVID-19 检测自然语言处理模型:使用转换器的双向编码器表示进行开发和验证研究。
J Med Internet Res. 2023 Oct 4;25:e49944. doi: 10.2196/49944.

引用本文的文献

1
Improving Clinical Documentation with Artificial Intelligence: A Systematic Review.利用人工智能改善临床文档记录:一项系统综述。
Perspect Health Inf Manag. 2024 Jun 1;21(2):1d. eCollection 2024 Summer-Fall.
2
Extracting Housing and Food Insecurity Information From Clinical Notes Using cTAKES.使用cTAKES从临床记录中提取住房和粮食不安全信息。
Health Serv Res. 2025 May;60 Suppl 3(Suppl 3):e14440. doi: 10.1111/1475-6773.14440. Epub 2025 Jan 28.
3
Applications of Natural Language Processing and Large Language Models for Social Determinants of Health: Protocol for a Systematic Review.

本文引用的文献

1
Real-world integration of the protocol for responding to and assessing patients' assets, risks, and experiences tool to assess social determinants of health in the electronic medical record at an academic medical center.在一家学术医疗中心,将应对和评估患者资产、风险及体验的协议与工具实际整合到电子病历中,以评估健康的社会决定因素。
Digit Health. 2023 May 22;9:20552076231176652. doi: 10.1177/20552076231176652. eCollection 2023 Jan-Dec.
2
Unemployment, Homelessness, and Other Societal Outcomes Among US Veterans With Schizophrenia Relapse: A Retrospective Cohort Study.美国精神分裂症复发退伍军人的失业、无家可归及其他社会后果:一项回顾性队列研究。
Prim Care Companion CNS Disord. 2022 Sep 13;24(5):21m03173. doi: 10.4088/PCC.21m03173.
3
自然语言处理和大语言模型在健康社会决定因素中的应用:系统评价方案
JMIR Res Protoc. 2025 Jan 21;14:e66094. doi: 10.2196/66094.
4
A Clinical Decision Support System for Addressing Health-Related Social Needs in Emergency Department: Defining End User Needs and Preferences.急诊科解决健康相关社会需求的临床决策支持系统:定义最终用户需求和偏好
Appl Clin Inform. 2024 Oct;15(5):1097-1106. doi: 10.1055/s-0044-1791816. Epub 2024 Dec 18.
5
Enhancement of a social risk score in the electronic health record to identify social needs among medically underserved patients: using structured data and free-text provider notes.增强电子健康记录中的社会风险评分以识别医疗服务不足患者的社会需求:利用结构化数据和自由文本形式的医生记录。
JAMIA Open. 2024 Oct 29;7(4):ooae117. doi: 10.1093/jamiaopen/ooae117. eCollection 2024 Dec.
When There Is Value in Asking: An Argument for Social Risk Screening in Clinical Practice.何时提问具有价值:关于临床实践中社会风险筛查的争论
Ann Intern Med. 2022 Aug;175(8):1181-1182. doi: 10.7326/M22-0147. Epub 2022 Jun 14.
4
Development and assessment of a natural language processing model to identify residential instability in electronic health records' unstructured data: a comparison of 3 integrated healthcare delivery systems.开发和评估一种用于识别电子健康记录非结构化数据中居住不稳定情况的自然语言处理模型:对3个综合医疗服务系统的比较
JAMIA Open. 2022 Feb 16;5(1):ooac006. doi: 10.1093/jamiaopen/ooac006. eCollection 2022 Apr.
5
Extracting social determinants of health from electronic health records using natural language processing: a systematic review.利用自然语言处理从电子健康记录中提取健康的社会决定因素:系统评价。
J Am Med Inform Assoc. 2021 Nov 25;28(12):2716-2727. doi: 10.1093/jamia/ocab170.
6
The Impact of Social Determinants of Health on Medication Adherence: a Systematic Review and Meta-analysis.社会健康决定因素对药物依从性的影响:系统评价和荟萃分析。
J Gen Intern Med. 2021 May;36(5):1359-1370. doi: 10.1007/s11606-020-06447-0. Epub 2021 Jan 29.
7
Assessing the Impact of Social Needs and Social Determinants of Health on Health Care Utilization: Using Patient- and Community-Level Data.评估社会需求和健康的社会决定因素对医疗保健利用的影响:使用患者和社区层面的数据。
Popul Health Manag. 2021 Apr;24(2):222-230. doi: 10.1089/pop.2020.0043. Epub 2020 Jun 25.
8
Artificial intelligence approaches using natural language processing to advance EHR-based clinical research.利用自然语言处理技术的人工智能方法来推进基于电子健康记录的临床研究。
J Allergy Clin Immunol. 2020 Feb;145(2):463-469. doi: 10.1016/j.jaci.2019.12.897. Epub 2019 Dec 26.
9
The Association Between Neighborhood Socioeconomic and Housing Characteristics with Hospitalization: Results of a National Study of Veterans.社区社会经济与住房特征与住院治疗之间的关联:一项退伍军人全国性研究的结果
J Am Board Fam Med. 2019 Nov-Dec;32(6):890-903. doi: 10.3122/jabfm.2019.06.190138.
10
Identifying Patients with Significant Problems Related to Social Determinants of Health with Natural Language Processing.利用自然语言处理技术识别与健康的社会决定因素相关的重大问题患者。
Stud Health Technol Inform. 2019 Aug 21;264:1456-1457. doi: 10.3233/SHTI190482.