• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用电子健康记录数据和自然语言处理将患有风湿性疾病的个体分类为经济不安全人群:算法推导与验证

Classifying Individuals With Rheumatic Conditions as Financially Insecure Using Electronic Health Record Data and Natural Language Processing: Algorithm Derivation and Validation.

作者信息

Chandler Mia T, Cai Tianrun, Santacroce Leah, Ulysse Sciaska, Liao Katherine P, Feldman Candace H

机构信息

Boston Children's Hospital, Boston, Massachusetts.

Harvard Medical School, Boston, Massachusetts.

出版信息

ACR Open Rheumatol. 2024 Aug;6(8):481-488. doi: 10.1002/acr2.11675. Epub 2024 May 15.

DOI:10.1002/acr2.11675
PMID:38747148
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11319925/
Abstract

OBJECTIVE

We aimed to examine the feasibility of applying natural language processing (NLP) to unstructured electronic health record (EHR) documents to detect the presence of financial insecurity among patients with rheumatologic disease enrolled in an integrated care management program (iCMP).

METHODS

We incorporated supervised, rule-based NLP and statistical methods to identify financial insecurity among patients with rheumatic conditions enrolled in an iCMP (n = 20,395) in a multihospital EHR system. We constructed a lexicon for financial insecurity using data from available knowledge sources and then reviewed EHR notes from 538 randomly selected individuals (training cohort n = 366, validation cohort n = 172). We manually categorized records as having "definite," "possible," or "no" mention of financial insecurity. All available notes were processed using Narrative Information Linear Extraction, a rule-based version of NLP. Models were trained using the NLP features for financial insecurity using logistic, least absolute shrinkage operator (LASSO), and random forest performance characteristic and were compared with the reference standard.

RESULTS

A total of 245,142 notes were processed from 538 individual patient records. Financial insecurity was present among 100 (27%) individuals in the training cohort and 63 (37%) in the validation cohort. The LASSO and random forest models performed identically and slightly better than logistic regression, with positive predictive values of 0.90, sensitivities of 0.29, and specificities of 0.98.

CONCLUSION

The development of a context-driven lexicon used with rule-based NLP to extract data that identify financial insecurity is feasible for use and improved the capture for presence of financial insecurity with high accuracy. In the absence of a standard lexicon and construct definition for financial insecurity status, additional studies are needed to optimize the sensitivity of algorithms to categorize financial insecurity with construct validity.

摘要

目的

我们旨在研究将自然语言处理(NLP)应用于非结构化电子健康记录(EHR)文档,以检测参与综合护理管理计划(iCMP)的风湿病患者中存在财务不安全状况的可行性。

方法

我们采用了监督式、基于规则的NLP和统计方法,以识别多医院EHR系统中参与iCMP的风湿病患者(n = 20,395)中的财务不安全状况。我们利用现有知识源的数据构建了一个财务不安全状况的词汇表,然后审查了538名随机选择个体的EHR记录(训练队列n = 366,验证队列n = 172)。我们将记录手动分类为有“明确”、“可能”或“未提及”财务不安全状况。所有可用记录均使用基于规则的NLP版本——叙事信息线性提取进行处理。使用财务不安全状况的NLP特征,通过逻辑回归、最小绝对收缩算子(LASSO)和随机森林性能特征对模型进行训练,并与参考标准进行比较。

结果

共处理了538份个体患者记录中的245,142条记录。训练队列中有100名(27%)个体存在财务不安全状况,验证队列中有63名(37%)。LASSO和随机森林模型表现相同,且略优于逻辑回归,阳性预测值为0.90,敏感性为0.29,特异性为0.98。

结论

开发一个与基于规则的NLP一起使用的上下文驱动词汇表,以提取识别财务不安全状况的数据是可行的,并且能够以高精度改进对财务不安全状况存在情况的捕捉。在缺乏财务不安全状况的标准词汇表和结构定义的情况下,需要进行更多研究以优化算法的敏感性,从而使财务不安全状况的分类具有结构效度。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb5e/11319925/f4499a58e018/ACR2-6-481-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb5e/11319925/f4499a58e018/ACR2-6-481-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb5e/11319925/f4499a58e018/ACR2-6-481-g001.jpg

相似文献

1
Classifying Individuals With Rheumatic Conditions as Financially Insecure Using Electronic Health Record Data and Natural Language Processing: Algorithm Derivation and Validation.利用电子健康记录数据和自然语言处理将患有风湿性疾病的个体分类为经济不安全人群:算法推导与验证
ACR Open Rheumatol. 2024 Aug;6(8):481-488. doi: 10.1002/acr2.11675. Epub 2024 May 15.
2
Automatically Identifying Financial Stress Information from Clinical Notes for Patients with Prostate Cancer.从前列腺癌患者的临床记录中自动识别财务压力信息。
Cancer Res Rep. 2020;1(1). doi: 10.61545/crr-1-102.
3
Natural language processing to identify social determinants of health in Alzheimer's disease and related dementia from electronic health records.基于自然语言处理的电子健康记录中阿尔茨海默病及相关痴呆症社会决定因素的识别。
Health Serv Res. 2023 Dec;58(6):1292-1302. doi: 10.1111/1475-6773.14210. Epub 2023 Aug 3.
4
Generalizability and portability of natural language processing system to extract individual social risk factors.自然语言处理系统提取个体社会风险因素的可推广性和可移植性。
Int J Med Inform. 2023 Sep;177:105115. doi: 10.1016/j.ijmedinf.2023.105115. Epub 2023 Jun 5.
5
Using natural language processing to identify opioid use disorder in electronic health record data.利用自然语言处理技术在电子健康记录数据中识别阿片类药物使用障碍。
Int J Med Inform. 2023 Feb;170:104963. doi: 10.1016/j.ijmedinf.2022.104963. Epub 2022 Dec 10.
6
Comparing Natural Language Processing and Structured Medical Data to Develop a Computable Phenotype for Patients Hospitalized Due to COVID-19: Retrospective Analysis.比较自然语言处理和结构化医学数据以开发COVID-19住院患者的可计算表型:回顾性分析
JMIR Med Inform. 2023 Aug 22;11:e46267. doi: 10.2196/46267.
7
Development and assessment of a natural language processing model to identify residential instability in electronic health records' unstructured data: a comparison of 3 integrated healthcare delivery systems.开发和评估一种用于识别电子健康记录非结构化数据中居住不稳定情况的自然语言处理模型:对3个综合医疗服务系统的比较
JAMIA Open. 2022 Feb 16;5(1):ooac006. doi: 10.1093/jamiaopen/ooac006. eCollection 2022 Apr.
8
ARCH: Large-scale Knowledge Graph via Aggregated Narrative Codified Health Records Analysis.ARCH:通过聚合叙事编码健康记录分析构建大规模知识图谱
medRxiv. 2023 May 21:2023.05.14.23289955. doi: 10.1101/2023.05.14.23289955.
9
Identifying lupus patients in electronic health records: Development and validation of machine learning algorithms and application of rule-based algorithms.在电子健康记录中识别狼疮患者:机器学习算法的开发和验证以及基于规则算法的应用。
Semin Arthritis Rheum. 2019 Aug;49(1):84-90. doi: 10.1016/j.semarthrit.2019.01.002. Epub 2019 Jan 4.
10
Identification of pancreatic cancer risk factors from clinical notes using natural language processing.利用自然语言处理从临床记录中识别胰腺癌风险因素。
Pancreatology. 2024 Jun;24(4):572-578. doi: 10.1016/j.pan.2024.03.016. Epub 2024 Mar 26.

引用本文的文献

1
Universal health-related social needs screening in a paediatric rheumatology clinic.儿科风湿病诊所的通用健康相关社会需求筛查
Rheumatol Adv Pract. 2025 Feb 6;9(2):rkaf014. doi: 10.1093/rap/rkaf014. eCollection 2025.

本文引用的文献

1
Social Determinants of Health Documentation Among Individuals With Rheumatic and Musculoskeletal Conditions in an Integrated Care Management Program.在综合护理管理项目中,风湿和肌肉骨骼疾病患者的健康状况社会决定因素文档记录。
Arthritis Care Res (Hoboken). 2023 Dec;75(12):2529-2536. doi: 10.1002/acr.25174. Epub 2023 Aug 7.
2
Associations Between Natural Language Processing-Enriched Social Determinants of Health and Suicide Death Among US Veterans.自然语言处理增强的健康社会决定因素与美国退伍军人自杀死亡之间的关联。
JAMA Netw Open. 2023 Mar 1;6(3):e233079. doi: 10.1001/jamanetworkopen.2023.3079.
3
Do patients want clinicians to ask about social needs and include this information in their medical record?
患者希望临床医生询问社会需求并将这些信息纳入他们的医疗记录吗?
BMC Health Serv Res. 2022 Oct 22;22(1):1275. doi: 10.1186/s12913-022-08652-5.
4
Association of Area-Level Heat and Social Vulnerability With Recurrent Hospitalizations Among Individuals With Rheumatic Conditions.地区热环境与社会脆弱性与风湿性疾病患者再住院的相关性。
Arthritis Care Res (Hoboken). 2023 Jan;75(1):22-33. doi: 10.1002/acr.25015. Epub 2022 Oct 12.
5
A Study of Social and Behavioral Determinants of Health in Lung Cancer Patients Using Transformers-based Natural Language Processing Models.基于变压器的自然语言处理模型研究肺癌患者健康的社会和行为决定因素。
AMIA Annu Symp Proc. 2022 Feb 21;2021:1225-1233. eCollection 2021.
6
The Impact of an Integrated Care Management Program on Acute Care Use and Outpatient Appointment Attendance Among High-Risk Patients With Lupus.综合护理管理项目对高危狼疮患者急性护理使用情况及门诊预约就诊率的影响。
ACR Open Rheumatol. 2022 Apr;4(4):338-344. doi: 10.1002/acr2.11391. Epub 2022 Jan 18.
7
Evaluation of a Natural Language Processing Approach to Identify Social Determinants of Health in Electronic Health Records in a Diverse Community Cohort.评估一种自然语言处理方法,以识别不同人群队列电子健康记录中的健康社会决定因素。
Med Care. 2022 Mar 1;60(3):248-255. doi: 10.1097/MLR.0000000000001683.
8
Extracting social determinants of health from electronic health records using natural language processing: a systematic review.利用自然语言处理从电子健康记录中提取健康的社会决定因素:系统评价。
J Am Med Inform Assoc. 2021 Nov 25;28(12):2716-2727. doi: 10.1093/jamia/ocab170.
9
Measuring the Value of a Practical Text Mining Approach to Identify Patients With Housing Issues in the Free-Text Notes in Electronic Health Record: Findings of a Retrospective Cohort Study.衡量实用文本挖掘方法在电子健康记录中的自由文本记录中识别住房问题患者的价值:一项回顾性队列研究的结果。
Front Public Health. 2021 Aug 27;9:697501. doi: 10.3389/fpubh.2021.697501. eCollection 2021.
10
Socioeconomic Disparities in Functional Status in a National Sample of Patients With Rheumatoid Arthritis.类风湿关节炎患者全国样本中功能状态的社会经济差异。
JAMA Netw Open. 2021 Aug 2;4(8):e2119400. doi: 10.1001/jamanetworkopen.2021.19400.