• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于转换器的多任务、多标签命名实体识别技术提取健康事件的社会决定因素。

Extracting social determinants of health events with transformer-based multitask, multilabel named entity recognition.

机构信息

Tsui Laboratory, Department of Biomedical and Health Informatics, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA.

MindCORE and Cognitive Science, University of Pennsylvania, Philadelphia, Pennsylvania, USA.

出版信息

J Am Med Inform Assoc. 2023 Jul 19;30(8):1379-1388. doi: 10.1093/jamia/ocad046.

DOI:10.1093/jamia/ocad046
PMID:37002953
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10354761/
Abstract

OBJECTIVE

Social determinants of health (SDOH) are nonclinical, socioeconomic conditions that influence patient health and quality of life. Identifying SDOH may help clinicians target interventions. However, SDOH are more frequently available in narrative notes compared to structured electronic health records. The 2022 n2c2 Track 2 competition released clinical notes annotated for SDOH to promote development of NLP systems for extracting SDOH. We developed a system addressing 3 limitations in state-of-the-art SDOH extraction: the inability to identify multiple SDOH events of the same type per sentence, overlapping SDOH attributes within text spans, and SDOH spanning multiple sentences.

MATERIALS AND METHODS

We developed and evaluated a 2-stage architecture. In stage 1, we trained a BioClinical-BERT-based named entity recognition system to extract SDOH event triggers, that is, text spans indicating substance use, employment, or living status. In stage 2, we trained a multitask, multilabel NER to extract arguments (eg, alcohol "type") for events extracted in stage 1. Evaluation was performed across 3 subtasks differing by provenance of training and validation data using precision, recall, and F1 scores.

RESULTS

When trained and validated on data from the same site, we achieved 0.87 precision, 0.89 recall, and 0.88 F1. Across all subtasks, we ranked between second and fourth place in the competition and always within 0.02 F1 from first.

CONCLUSIONS

Our 2-stage, deep-learning-based NLP system effectively extracted SDOH events from clinical notes. This was achieved with a novel classification framework that leveraged simpler architectures compared to state-of-the-art systems. Improved SDOH extraction may help clinicians improve health outcomes.

摘要

目的

社会决定因素健康(SDOH)是非临床的社会经济条件,影响患者的健康和生活质量。确定 SDOH 可以帮助临床医生确定干预目标。然而,与结构化电子健康记录相比,SDOH 更频繁地出现在叙述性记录中。2022 年 n2c2 第 2 轨道竞赛发布了标注有 SDOH 的临床记录,以促进用于提取 SDOH 的自然语言处理(NLP)系统的开发。我们开发了一个系统,解决了最先进的 SDOH 提取中的 3 个限制:无法识别句子中相同类型的多个 SDOH 事件,文本跨度内重叠的 SDOH 属性,以及跨越多个句子的 SDOH。

材料和方法

我们开发并评估了一个两阶段架构。在第 1 阶段,我们训练了一个基于 BioClinical-BERT 的命名实体识别系统,以提取 SDOH 事件触发器,即指示物质使用、就业或生活状况的文本跨度。在第 2 阶段,我们训练了一个多任务、多标签 NER,以提取第 1 阶段提取的事件的参数(例如,酒精“类型”)。使用精度、召回率和 F1 分数在跨 3 个子任务进行评估,这些子任务的训练和验证数据的来源不同。

结果

当在同一站点的数据上进行训练和验证时,我们实现了 0.87 的精度、0.89 的召回率和 0.88 的 F1。在所有子任务中,我们在竞赛中排名第二至第四位,并且始终与第一位相差 0.02 F1。

结论

我们的基于深度学习的两阶段 NLP 系统有效地从临床记录中提取了 SDOH 事件。这是通过一种新颖的分类框架实现的,该框架利用了比最先进系统更简单的架构。改善 SDOH 提取可能有助于临床医生改善健康结果。

相似文献

1
Extracting social determinants of health events with transformer-based multitask, multilabel named entity recognition.基于转换器的多任务、多标签命名实体识别技术提取健康事件的社会决定因素。
J Am Med Inform Assoc. 2023 Jul 19;30(8):1379-1388. doi: 10.1093/jamia/ocad046.
2
Classifying social determinants of health from unstructured electronic health records using deep learning-based natural language processing.利用基于深度学习的自然语言处理技术从非结构化电子健康记录中分类社会健康决定因素。
J Biomed Inform. 2022 Mar;127:103984. doi: 10.1016/j.jbi.2021.103984. Epub 2022 Jan 7.
3
A marker-based neural network system for extracting social determinants of health.基于标记的神经网络系统,用于提取健康的社会决定因素。
J Am Med Inform Assoc. 2023 Jul 19;30(8):1398-1407. doi: 10.1093/jamia/ocad041.
4
The 2022 n2c2/UW shared task on extracting social determinants of health.2022 年 n2c2/UW 关于提取健康社会决定因素的共享任务。
J Am Med Inform Assoc. 2023 Jul 19;30(8):1367-1378. doi: 10.1093/jamia/ocad012.
5
Identifying social determinants of health from clinical narratives: A study of performance, documentation ratio, and potential bias.从临床叙述中识别健康的社会决定因素:一项关于表现、记录比例和潜在偏差的研究。
J Biomed Inform. 2024 May;153:104642. doi: 10.1016/j.jbi.2024.104642. Epub 2024 Apr 14.
6
Extracting social determinants of health from electronic health records using natural language processing: a systematic review.利用自然语言处理从电子健康记录中提取健康的社会决定因素:系统评价。
J Am Med Inform Assoc. 2021 Nov 25;28(12):2716-2727. doi: 10.1093/jamia/ocab170.
7
Extracting social determinants of health from clinical note text with classification and sequence-to-sequence approaches.使用分类和序列到序列方法从临床记录文本中提取健康的社会决定因素。
J Am Med Inform Assoc. 2023 Jul 19;30(8):1448-1455. doi: 10.1093/jamia/ocad071.
8
Leveraging natural language processing to augment structured social determinants of health data in the electronic health record.利用自然语言处理技术增强电子健康记录中的结构化社会决定因素健康数据。
J Am Med Inform Assoc. 2023 Jul 19;30(8):1389-1397. doi: 10.1093/jamia/ocad073.
9
Natural language processing to identify social determinants of health in Alzheimer's disease and related dementia from electronic health records.基于自然语言处理的电子健康记录中阿尔茨海默病及相关痴呆症社会决定因素的识别。
Health Serv Res. 2023 Dec;58(6):1292-1302. doi: 10.1111/1475-6773.14210. Epub 2023 Aug 3.
10
Large Language Models for Social Determinants of Health Information Extraction from Clinical Notes - A Generalizable Approach across Institutions.用于从临床记录中提取健康信息社会决定因素的大语言模型——一种适用于各机构的通用方法。
medRxiv. 2024 May 22:2024.05.21.24307726. doi: 10.1101/2024.05.21.24307726.

引用本文的文献

1
Unveiling social determinants of health impact on adverse pregnancy outcomes through natural language processing.通过自然语言处理揭示健康的社会决定因素对不良妊娠结局的影响。
Sci Rep. 2025 Aug 9;15(1):29183. doi: 10.1038/s41598-025-13542-x.
2
Academic case reports lack diversity: Assessing the presence and diversity of sociodemographic and behavioral factors related to Post COVID-19 Condition.学术病例报告缺乏多样性:评估与新冠后状况相关的社会人口学和行为因素的存在情况及多样性。
PLoS One. 2025 Jul 2;20(7):e0326668. doi: 10.1371/journal.pone.0326668. eCollection 2025.
3
Deep learning for occupation recognition and knowledge discovery in rheumatology clinical notes.用于风湿科临床记录中职业识别和知识发现的深度学习
Sci Rep. 2025 Jul 1;15(1):20944. doi: 10.1038/s41598-025-05294-5.
4
Social determinants of health extraction from clinical notes across institutions using large language models.使用大语言模型从各机构的临床记录中提取健康的社会决定因素。
NPJ Digit Med. 2025 May 17;8(1):287. doi: 10.1038/s41746-025-01645-8.
5
Patient and clinician acceptability of automated extraction of social drivers of health from clinical notes in primary care.基层医疗中从临床记录自动提取健康社会驱动因素的患者和临床医生可接受性
J Am Med Inform Assoc. 2025 May 1;32(5):855-865. doi: 10.1093/jamia/ocaf046.
6
Decoding substance use disorder severity from clinical notes using a large language model.使用大语言模型从临床记录中解码物质使用障碍的严重程度
Npj Ment Health Res. 2025 Feb 7;4(1):5. doi: 10.1038/s44184-024-00114-6.
7
Realizing the potential of social determinants data in EHR systems: A scoping review of approaches for screening, linkage, extraction, analysis, and interventions.认识电子健康记录系统中社会决定因素数据的潜力:对筛查、关联、提取、分析和干预方法的范围审查
J Clin Transl Sci. 2024 Oct 10;8(1):e147. doi: 10.1017/cts.2024.571. eCollection 2024.
8
Multifaceted Natural Language Processing Task-Based Evaluation of Bidirectional Encoder Representations From Transformers Models for Bilingual (Korean and English) Clinical Notes: Algorithm Development and Validation.基于转换器模型的双向编码器表示的多方面自然语言处理任务评估在双语(韩语和英语)临床笔记中的应用:算法开发和验证。
JMIR Med Inform. 2024 Oct 30;12:e52897. doi: 10.2196/52897.
9
Addressing Health-Related Social Needs and Mental Health Needs in the Neonatal Intensive Care Unit: Exploring Challenges and the Potential of Technology.解决新生儿重症监护病房中的与健康相关的社会需求和心理健康需求:探索挑战和技术的潜力。
Int J Environ Res Public Health. 2023 Dec 9;20(24):7161. doi: 10.3390/ijerph20247161.
10
Advancements in extracting social determinants of health information from narrative text.从叙述性文本中提取健康信息的社会决定因素的进展。
J Am Med Inform Assoc. 2023 Jul 19;30(8):1363-1366. doi: 10.1093/jamia/ocad121.

本文引用的文献

1
Automatic extraction of social determinants of health from medical notes of chronic lower back pain patients.从慢性下背痛患者的病历中自动提取健康的社会决定因素。
J Am Med Inform Assoc. 2023 Jul 19;30(8):1438-1447. doi: 10.1093/jamia/ocad054.
2
The 2022 n2c2/UW shared task on extracting social determinants of health.2022 年 n2c2/UW 关于提取健康社会决定因素的共享任务。
J Am Med Inform Assoc. 2023 Jul 19;30(8):1367-1378. doi: 10.1093/jamia/ocad012.
3
A Survey on Deep Learning Event Extraction: Approaches and Applications.深度学习事件抽取研究综述:方法与应用
IEEE Trans Neural Netw Learn Syst. 2024 May;35(5):6301-6321. doi: 10.1109/TNNLS.2022.3213168. Epub 2024 May 2.
4
Classifying social determinants of health from unstructured electronic health records using deep learning-based natural language processing.利用基于深度学习的自然语言处理技术从非结构化电子健康记录中分类社会健康决定因素。
J Biomed Inform. 2022 Mar;127:103984. doi: 10.1016/j.jbi.2021.103984. Epub 2022 Jan 7.
5
Extracting social determinants of health from electronic health records using natural language processing: a systematic review.利用自然语言处理从电子健康记录中提取健康的社会决定因素:系统评价。
J Am Med Inform Assoc. 2021 Nov 25;28(12):2716-2727. doi: 10.1093/jamia/ocab170.
6
Social Determinants of Health 201 for Health Care: Plan, Do, Study, Act.医疗保健中的健康社会决定因素201:计划、执行、研究、行动。
NAM Perspect. 2021 Jun 21;2021. doi: 10.31478/202106c. eCollection 2021.
7
Identification of social determinants of health using multi-label classification of electronic health record clinical notes.利用电子健康记录临床笔记的多标签分类识别健康的社会决定因素。
JAMIA Open. 2021 Feb 9;4(3):ooaa069. doi: 10.1093/jamiaopen/ooaa069. eCollection 2021 Jul.
8
Screening and Interventions for Social Risk Factors: Technical Brief to Support the US Preventive Services Task Force.社会风险因素筛查与干预:支持美国预防服务工作组的技术简报
JAMA. 2021 Oct 12;326(14):1416-1428. doi: 10.1001/jama.2021.12825.
9
Annotating social determinants of health using active learning, and characterizing determinants using neural event extraction.使用主动学习对健康的社会决定因素进行标注,并使用神经事件提取对决定因素进行特征描述。
J Biomed Inform. 2021 Jan;113:103631. doi: 10.1016/j.jbi.2020.103631. Epub 2020 Dec 5.
10
Maximizing the use of social and behavioural information from secondary care mental health electronic health records.最大限度地利用二级心理健康电子健康记录中的社会和行为信息。
J Biomed Inform. 2020 Jul;107:103429. doi: 10.1016/j.jbi.2020.103429. Epub 2020 May 5.