• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Extracting social determinants of health from clinical note text with classification and sequence-to-sequence approaches.使用分类和序列到序列方法从临床记录文本中提取健康的社会决定因素。
J Am Med Inform Assoc. 2023 Jul 19;30(8):1448-1455. doi: 10.1093/jamia/ocad071.
2
The 2022 n2c2/UW shared task on extracting social determinants of health.2022 年 n2c2/UW 关于提取健康社会决定因素的共享任务。
J Am Med Inform Assoc. 2023 Jul 19;30(8):1367-1378. doi: 10.1093/jamia/ocad012.
3
Extracting social determinants of health events with transformer-based multitask, multilabel named entity recognition.基于转换器的多任务、多标签命名实体识别技术提取健康事件的社会决定因素。
J Am Med Inform Assoc. 2023 Jul 19;30(8):1379-1388. doi: 10.1093/jamia/ocad046.
4
Classifying social determinants of health from unstructured electronic health records using deep learning-based natural language processing.利用基于深度学习的自然语言处理技术从非结构化电子健康记录中分类社会健康决定因素。
J Biomed Inform. 2022 Mar;127:103984. doi: 10.1016/j.jbi.2021.103984. Epub 2022 Jan 7.
5
A marker-based neural network system for extracting social determinants of health.基于标记的神经网络系统,用于提取健康的社会决定因素。
J Am Med Inform Assoc. 2023 Jul 19;30(8):1398-1407. doi: 10.1093/jamia/ocad041.
6
Large Language Models for Social Determinants of Health Information Extraction from Clinical Notes - A Generalizable Approach across Institutions.用于从临床记录中提取健康信息社会决定因素的大语言模型——一种适用于各机构的通用方法。
medRxiv. 2024 May 22:2024.05.21.24307726. doi: 10.1101/2024.05.21.24307726.
7
Annotating social determinants of health using active learning, and characterizing determinants using neural event extraction.使用主动学习对健康的社会决定因素进行标注,并使用神经事件提取对决定因素进行特征描述。
J Biomed Inform. 2021 Jan;113:103631. doi: 10.1016/j.jbi.2020.103631. Epub 2020 Dec 5.
8
A Hybrid Model for Family History Information Identification and Relation Extraction: Development and Evaluation of an End-to-End Information Extraction System.一种用于家族病史信息识别与关系抽取的混合模型:一个端到端信息抽取系统的开发与评估
JMIR Med Inform. 2021 Apr 22;9(4):e22797. doi: 10.2196/22797.
9
Identifying social determinants of health from clinical narratives: A study of performance, documentation ratio, and potential bias.从临床叙述中识别健康的社会决定因素:一项关于表现、记录比例和潜在偏差的研究。
J Biomed Inform. 2024 May;153:104642. doi: 10.1016/j.jbi.2024.104642. Epub 2024 Apr 14.
10
Automatic extraction of social determinants of health from medical notes of chronic lower back pain patients.从慢性下背痛患者的病历中自动提取健康的社会决定因素。
J Am Med Inform Assoc. 2023 Jul 19;30(8):1438-1447. doi: 10.1093/jamia/ocad054.

引用本文的文献

1
Leveraging large language models for the deidentification and temporal normalization of sensitive health information in electronic health records.利用大语言模型对电子健康记录中的敏感健康信息进行去识别化处理和时间标准化。
NPJ Digit Med. 2025 Aug 13;8(1):517. doi: 10.1038/s41746-025-01921-7.
2
Boosting Social Determinants of Health Extraction with Semantic Knowledge Augmented Large Language Model.利用语义知识增强的大语言模型提升健康的社会决定因素提取能力
AMIA Annu Symp Proc. 2025 May 22;2024:453-462. eCollection 2024.
3
Social determinants of health extraction from clinical notes across institutions using large language models.使用大语言模型从各机构的临床记录中提取健康的社会决定因素。
NPJ Digit Med. 2025 May 17;8(1):287. doi: 10.1038/s41746-025-01645-8.
4
Change in address in electronic health records as an early marker of homelessness.电子健康记录中的地址变更作为无家可归的早期指标。
PLoS One. 2025 Mar 10;20(3):e0318552. doi: 10.1371/journal.pone.0318552. eCollection 2025.
5
Decoding substance use disorder severity from clinical notes using a large language model.使用大语言模型从临床记录中解码物质使用障碍的严重程度
Npj Ment Health Res. 2025 Feb 7;4(1):5. doi: 10.1038/s44184-024-00114-6.
6
CACER: Clinical concept Annotations for Cancer Events and Relations.CACER:癌症事件与关系的临床概念注释。
J Am Med Inform Assoc. 2024 Nov 1;31(11):2583-2594. doi: 10.1093/jamia/ocae231.
7
Zero-shot Learning with Minimum Instruction to Extract Social Determinants and Family History from Clinical Notes using GPT Model.使用GPT模型从临床记录中提取社会决定因素和家族病史的最少指令零样本学习
Proc IEEE Int Conf Big Data. 2023 Dec;2023:1476-1480. doi: 10.1109/BigData59044.2023.10386811.
8
Extraction of Substance Use Information From Clinical Notes: Generative Pretrained Transformer-Based Investigation.从临床记录中提取物质使用信息:基于生成式预训练变换器的研究
JMIR Med Inform. 2024 Aug 19;12:e56243. doi: 10.2196/56243.
9
Large language models to identify social determinants of health in electronic health records.利用大语言模型识别电子健康记录中的健康社会决定因素。
NPJ Digit Med. 2024 Jan 11;7(1):6. doi: 10.1038/s41746-023-00970-0.
10
Advancements in extracting social determinants of health information from narrative text.从叙述性文本中提取健康信息的社会决定因素的进展。
J Am Med Inform Assoc. 2023 Jul 19;30(8):1363-1366. doi: 10.1093/jamia/ocad121.

本文引用的文献

1
The 2022 n2c2/UW shared task on extracting social determinants of health.2022 年 n2c2/UW 关于提取健康社会决定因素的共享任务。
J Am Med Inform Assoc. 2023 Jul 19;30(8):1367-1378. doi: 10.1093/jamia/ocad012.
2
A Study of Social and Behavioral Determinants of Health in Lung Cancer Patients Using Transformers-based Natural Language Processing Models.基于变压器的自然语言处理模型研究肺癌患者健康的社会和行为决定因素。
AMIA Annu Symp Proc. 2022 Feb 21;2021:1225-1233. eCollection 2021.
3
Classifying social determinants of health from unstructured electronic health records using deep learning-based natural language processing.利用基于深度学习的自然语言处理技术从非结构化电子健康记录中分类社会健康决定因素。
J Biomed Inform. 2022 Mar;127:103984. doi: 10.1016/j.jbi.2021.103984. Epub 2022 Jan 7.
4
Extracting social determinants of health from electronic health records using natural language processing: a systematic review.利用自然语言处理从电子健康记录中提取健康的社会决定因素:系统评价。
J Am Med Inform Assoc. 2021 Nov 25;28(12):2716-2727. doi: 10.1093/jamia/ocab170.
5
Identification of social determinants of health using multi-label classification of electronic health record clinical notes.利用电子健康记录临床笔记的多标签分类识别健康的社会决定因素。
JAMIA Open. 2021 Feb 9;4(3):ooaa069. doi: 10.1093/jamiaopen/ooaa069. eCollection 2021 Jul.
6
Annotating social determinants of health using active learning, and characterizing determinants using neural event extraction.使用主动学习对健康的社会决定因素进行标注,并使用神经事件提取对决定因素进行特征描述。
J Biomed Inform. 2021 Jan;113:103631. doi: 10.1016/j.jbi.2020.103631. Epub 2020 Dec 5.
7
COVID-19: a potential public health problem for homeless populations.新型冠状病毒肺炎:无家可归人群面临的潜在公共卫生问题。
Lancet Public Health. 2020 Apr;5(4):e186-e187. doi: 10.1016/S2468-2667(20)30053-0. Epub 2020 Mar 11.
8
Detecting Social and Behavioral Determinants of Health with Structured and Free-Text Clinical Data.利用结构化和自由文本临床数据检测健康的社会和行为决定因素。
Appl Clin Inform. 2020 Jan;11(1):172-181. doi: 10.1055/s-0040-1702214. Epub 2020 Mar 4.
9
BioBERT: a pre-trained biomedical language representation model for biomedical text mining.BioBERT:一种用于生物医学文本挖掘的预训练生物医学语言表示模型。
Bioinformatics. 2020 Feb 15;36(4):1234-1240. doi: 10.1093/bioinformatics/btz682.
10
Ensemble method-based extraction of medication and related information from clinical texts.基于集成方法的临床文本中药物及相关信息的提取。
J Am Med Inform Assoc. 2020 Jan 1;27(1):31-38. doi: 10.1093/jamia/ocz100.

使用分类和序列到序列方法从临床记录文本中提取健康的社会决定因素。

Extracting social determinants of health from clinical note text with classification and sequence-to-sequence approaches.

机构信息

Nuance Communications, Burlington, Massachusetts, USA.

Microsoft, Redmond, Washington, USA.

出版信息

J Am Med Inform Assoc. 2023 Jul 19;30(8):1448-1455. doi: 10.1093/jamia/ocad071.

DOI:10.1093/jamia/ocad071
PMID:37100768
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10354779/
Abstract

OBJECTIVE

Social determinants of health (SDOH) are nonmedical factors that can influence health outcomes. This paper seeks to extract SDOH from clinical texts in the context of the National NLP Clinical Challenges (n2c2) 2022 Track 2 Task.

MATERIALS AND METHODS

Annotated and unannotated data from the Medical Information Mart for Intensive Care III (MIMIC-III) corpus, the Social History Annotation Corpus, and an in-house corpus were used to develop 2 deep learning models that used classification and sequence-to-sequence (seq2seq) approaches.

RESULTS

The seq2seq approach had the highest overall F1 scores in the challenge's 3 subtasks: 0.901 on the extraction subtask, 0.774 on the generalizability subtask, and 0.889 on the learning transfer subtask.

DISCUSSION

Both approaches rely on SDOH event representations that were designed to be compatible with transformer-based pretrained models, with the seq2seq representation supporting an arbitrary number of overlapping and sentence-spanning events. Models with adequate performance could be produced quickly, and the remaining mismatch between representation and task requirements was then addressed in postprocessing. The classification approach used rules to generate entity relationships from its sequence of token labels, while the seq2seq approach used constrained decoding and a constraint solver to recover entity text spans from its sequence of potentially ambiguous tokens.

CONCLUSION

We proposed 2 different approaches to extract SDOH from clinical texts with high accuracy. However, accuracy suffers on text from new healthcare institutions not present in the training data, and thus generalization remains an important topic for future study.

摘要

目的

健康的社会决定因素(SDOH)是非医学因素,可影响健康结果。本文旨在从 National NLP Clinical Challenges (n2c2) 2022 第 2 轨道 2 任务的临床文本中提取 SDOH。

材料与方法

使用来自 Medical Information Mart for Intensive Care III (MIMIC-III) 语料库、社会历史标注语料库和内部语料库的标注和未标注数据,开发了 2 种深度学习模型,分别使用分类和序列到序列(seq2seq)方法。

结果

seq2seq 方法在挑战的 3 个子任务中总体 F1 得分最高:提取子任务为 0.901,泛化子任务为 0.774,学习迁移子任务为 0.889。

讨论

这两种方法都依赖于旨在与基于转换器的预训练模型兼容的 SDOH 事件表示,seq2seq 表示支持任意数量的重叠和跨句事件。可以快速生成具有足够性能的模型,然后在后期处理中解决表示和任务要求之间的剩余不匹配问题。分类方法使用规则从其令牌标签序列生成实体关系,而 seq2seq 方法使用约束解码和约束求解器从其潜在歧义令牌的序列中恢复实体文本跨度。

结论

我们提出了 2 种从临床文本中准确提取 SDOH 的不同方法。然而,在训练数据中不存在的新医疗机构的文本上,准确性会受到影响,因此泛化仍然是未来研究的一个重要课题。