增强电子健康记录中的社会风险评分以识别医疗服务不足患者的社会需求：利用结构化数据和自由文本形式的医生记录。

Enhancement of a social risk score in the electronic health record to identify social needs among medically underserved patients: using structured data and free-text provider notes.

作者信息

Hatef Elham, Kitchen Christopher, Gray Geoffrey M, Zirikly Ayah, Richards Thomas, Ahumada Luis M, Weiner Jonathan P

机构信息

Division of General Internal Medicine, Department of Medicine, Johns Hopkins School of Medicine, Baltimore, MD 21205, United States.

Center for Population Health Information Technology, Department of Health Policy and Management, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD 21205, United States.

出版信息

JAMIA Open. 2024 Oct 29;7(4):ooae117. doi: 10.1093/jamiaopen/ooae117. eCollection 2024 Dec.

DOI:10.1093/jamiaopen/ooae117

PMID:39473880

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11521376/

Abstract

OBJECTIVE

To improve the performance of a social risk score (a predictive risk model) using electronic health record (EHR) structured and unstructured data.

MATERIALS AND METHODS

We used EPIC-based EHR data from July 2016 to June 2021 and linked it to community-level data from the US Census American Community Survey. We identified predictors of interest within the EHR structured data and applied natural language processing (NLP) techniques to identify patients' social needs in the EHR unstructured data. We performed logistic regression models with and without information from the unstructured data (Models I and II) and compared their performance with generalized estimating equation (GEE) models with and without the unstructured data (Models III and IV).

RESULTS

The logistic model (Model I) performed well (Area Under the Curve [AUC] 0.703, 95% confidence interval [CI] 0.701:0.705) and the addition of EHR unstructured data (Model II) resulted in a slight change in the AUC (0.701, 95% CI 0.699:0.703). In the logistic models, the addition of EHR unstructured data resulted in an increase in the area under the precision-recall curve (PRC 0.255, 95% CI 0.254:0.256 in Model I versus 0.378, 95% CI 0.375:0.38 in Model II). The GEE models performed similarly to the logistic models and the addition of EHR unstructured data resulted in a slight change in the AUC (0.702, 95% CI 0.699:0.705 in Model III versus 0.699, 95% CI 0.698:0.702 in Model IV).

DISCUSSION

Our work presents the enhancement of a novel social risk score that integrates community-level data with patient-level data to systematically identify patients at increased risk of having future social needs for in-depth assessment of their social needs and potential referral to community-based organizations to address these needs.

CONCLUSION

The addition of information on social needs extracted from unstructured EHR resulted in an improved prediction of positive cases presented by the improvement in the PRC.

摘要

目的

利用电子健康记录（EHR）的结构化和非结构化数据提高社会风险评分（一种预测风险模型）的性能。

材料与方法

我们使用了2016年7月至2021年6月基于EPIC的EHR数据，并将其与美国人口普查美国社区调查的社区层面数据相链接。我们在EHR结构化数据中确定了感兴趣的预测因素，并应用自然语言处理（NLP）技术在EHR非结构化数据中识别患者的社会需求。我们进行了包含和不包含非结构化数据信息的逻辑回归模型（模型I和模型II），并将它们的性能与包含和不包含非结构化数据的广义估计方程（GEE）模型（模型III和模型IV）进行比较。

结果

逻辑模型（模型I）表现良好（曲线下面积[AUC]为0.703，95%置信区间[CI]为0.701:0.705），添加EHR非结构化数据（模型II）导致AUC略有变化（0.701，95%CI为0.699:0.703）。在逻辑模型中，添加EHR非结构化数据导致精确召回率曲线下面积增加（模型I中PRC为0.255，95%CI为0.254:0.256，模型II中为0.378，95%CI为0.375:0.38）。GEE模型的表现与逻辑模型相似，添加EHR非结构化数据导致AUC略有变化（模型III中为0.702，95%CI为0.699:0.705，模型IV中为0.699，95%CI为0.698:0.702）。

讨论

我们的工作展示了一种新型社会风险评分的增强，该评分将社区层面数据与患者层面数据相结合，以系统地识别未来有社会需求风险增加的患者，以便对其社会需求进行深入评估，并可能转介至社区组织以满足这些需求。

结论

从非结构化EHR中提取社会需求信息，通过PRC的改善，对阳性病例的预测得到了改进。

相似文献

Enhancement of a social risk score in the electronic health record to identify social needs among medically underserved patients: using structured data and free-text provider notes.增强电子健康记录中的社会风险评分以识别医疗服务不足患者的社会需求：利用结构化数据和自由文本形式的医生记录。

JAMIA Open. 2024 Oct 29;7(4):ooae117. doi: 10.1093/jamiaopen/ooae117. eCollection 2024 Dec.

Development of a Social Risk Score in the Electronic Health Record to Identify Social Needs Among Underserved Populations: Retrospective Study.利用电子健康记录开发社会风险评分以识别弱势群体中的社会需求：一项回顾性研究。

JMIR Form Res. 2024 Mar 12;8:e54732. doi: 10.2196/54732.

Classifying social determinants of health from unstructured electronic health records using deep learning-based natural language processing.利用基于深度学习的自然语言处理技术从非结构化电子健康记录中分类社会健康决定因素。

J Biomed Inform. 2022 Mar;127:103984. doi: 10.1016/j.jbi.2021.103984. Epub 2022 Jan 7.

The Value of Unstructured Electronic Health Record Data in Geriatric Syndrome Case Identification.非结构化电子健康记录数据在老年综合征病例识别中的价值。

J Am Geriatr Soc. 2018 Aug;66(8):1499-1507. doi: 10.1111/jgs.15411. Epub 2018 Jul 4.

Leveraging Natural Language Processing to Improve Electronic Health Record Suicide Risk Prediction for Veterans Health Administration Users.利用自然语言处理提高退伍军人健康管理局用户电子健康记录自杀风险预测

J Clin Psychiatry. 2023 Jun 19;84(4):22m14568. doi: 10.4088/JCP.22m14568.

Predicting future falls in older people using natural language processing of general practitioners' clinical notes.利用全科医生临床记录的自然语言处理技术预测老年人未来的跌倒情况。

Age Ageing. 2023 Apr 1;52(4). doi: 10.1093/ageing/afad046.

Assessing the Availability of Data on Social and Behavioral Determinants in Structured and Unstructured Electronic Health Records: A Retrospective Analysis of a Multilevel Health Care System.评估结构化和非结构化电子健康记录中社会和行为决定因素的数据可用性：对一个多层次医疗系统的回顾性分析。

JMIR Med Inform. 2019 Aug 2;7(3):e13802. doi: 10.2196/13802.

Deep Learning Approaches for Predicting Glaucoma Progression Using Electronic Health Records and Natural Language Processing.使用电子健康记录和自然语言处理的深度学习方法预测青光眼进展

Ophthalmol Sci. 2022 Feb 12;2(2):100127. doi: 10.1016/j.xops.2022.100127. eCollection 2022 Jun.

Social Determinants of Health Documentation in Structured and Unstructured Clinical Data of Patients With Diabetes: Comparative Analysis.糖尿病患者结构化和非结构化临床数据中的健康记录社会决定因素：比较分析

JMIR Med Inform. 2023 Aug 22;11:e46159. doi: 10.2196/46159.

Cohort profile: St. Michael's Hospital Tuberculosis Database (SMH-TB), a retrospective cohort of electronic health record data and variables extracted using natural language processing.队列资料简介：圣迈克尔医院结核病数据库（SMH-TB），这是一个使用自然语言处理提取电子健康记录数据和变量的回顾性队列。

PLoS One. 2021 Mar 3;16(3):e0247872. doi: 10.1371/journal.pone.0247872. eCollection 2021.

本文引用的文献

A fair individualized polysocial risk score for identifying increased social risk in type 2 diabetes.一个公平的个体化多社会风险评分，用于识别 2 型糖尿病患者的社会风险增加。

Nat Commun. 2024 Oct 5;15(1):8653. doi: 10.1038/s41467-024-52960-9.

JMIR Form Res. 2024 Mar 12;8:e54732. doi: 10.2196/54732.

Latent Class Analysis of Social Needs in Medicaid Population and Its Impact on Risk Adjustment Models.医疗补助人群社会需求的潜在类别分析及其对风险调整模型的影响

Med Care. 2024 Nov 1;62(11):724-731. doi: 10.1097/MLR.0000000000001961. Epub 2023 Dec 12.

Association Between ICD-10 Codes for Social Needs and Subsequent Emergency and Inpatient Use.社会需求的 ICD-10 编码与后续急诊和住院使用之间的关联。

Med Care. 2024 Jan 1;62(1):60-66. doi: 10.1097/MLR.0000000000001948. Epub 2023 Nov 9.

Examining the Association of Social Needs with Future Health Care Utilization in an Older Adult Population: Which Needs Are Most Important?探讨老年人群体的社会需求与未来医疗保健利用之间的关联：哪些需求最重要？

Popul Health Manag. 2023 Dec;26(6):413-419. doi: 10.1089/pop.2023.0171. Epub 2023 Oct 31.

Application of natural language processing to identify social needs from patient medical notes: development and assessment of a scalable, performant, and rule-based model in an integrated healthcare delivery system.应用自然语言处理从患者病历中识别社会需求：在综合医疗服务系统中开发和评估一个可扩展、高性能且基于规则的模型。

JAMIA Open. 2023 Oct 4;6(4):ooad085. doi: 10.1093/jamiaopen/ooad085. eCollection 2023 Dec.

Health Care Impacts Of Resource Navigation For Health-Related Social Needs In The Accountable Health Communities Model.医疗保健中资源导航对责任医疗社区模式下与健康相关的社会需求的影响。

Health Aff (Millwood). 2023 Jun;42(6):822-831. doi: 10.1377/hlthaff.2022.01502. Epub 2023 May 17.

Implementing Health Related Social Needs Screening in an Outpatient Clinic.在门诊实施健康相关社会需求筛查。

J Prim Care Community Health. 2022 Jan-Dec;13:21501319221118809. doi: 10.1177/21501319221118809.

When There Is Value in Asking: An Argument for Social Risk Screening in Clinical Practice.何时提问具有价值：关于临床实践中社会风险筛查的争论

Ann Intern Med. 2022 Aug;175(8):1181-1182. doi: 10.7326/M22-0147. Epub 2022 Jun 14.

Predicting health-related social needs in Medicaid and Medicare populations using machine learning.利用机器学习预测医疗补助和医疗保险人群的与健康相关的社会需求。

Sci Rep. 2022 Mar 16;12(1):4554. doi: 10.1038/s41598-022-08344-4.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。