• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用大型语言模型从原始和去识别的医疗记录中提取复杂的健康社会决定因素摘要:开发和验证研究。

Using Large Language Models to Abstract Complex Social Determinants of Health From Original and Deidentified Medical Notes: Development and Validation Study.

机构信息

Institute for Systems Biology, Seattle, WA, United States.

Providence Health & Services, Renton, WA, United States.

出版信息

J Med Internet Res. 2024 Nov 19;26:e63445. doi: 10.2196/63445.

DOI:10.2196/63445
PMID:39561354
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11615547/
Abstract

BACKGROUND

Social determinants of health (SDoH) such as housing insecurity are known to be intricately linked to patients' health status. More efficient methods for abstracting structured data on SDoH can help accelerate the inclusion of exposome variables in biomedical research and support health care systems in identifying patients who could benefit from proactive outreach. Large language models (LLMs) developed from Generative Pre-trained Transformers (GPTs) have shown potential for performing complex abstraction tasks on unstructured clinical notes.

OBJECTIVE

Here, we assess the performance of GPTs on identifying temporal aspects of housing insecurity and compare results between both original and deidentified notes.

METHODS

We compared the ability of GPT-3.5 and GPT-4 to identify instances of both current and past housing instability, as well as general housing status, from 25,217 notes from 795 pregnant women. Results were compared with manual abstraction, a named entity recognition model, and regular expressions.

RESULTS

Compared with GPT-3.5 and the named entity recognition model, GPT-4 had the highest performance and had a much higher recall (0.924) than human abstractors (0.702) in identifying patients experiencing current or past housing instability, although precision was lower (0.850) compared with human abstractors (0.971). GPT-4's precision improved slightly (0.936 original, 0.939 deidentified) on deidentified versions of the same notes, while recall dropped (0.781 original, 0.704 deidentified).

CONCLUSIONS

This work demonstrates that while manual abstraction is likely to yield slightly more accurate results overall, LLMs can provide a scalable, cost-effective solution with the advantage of greater recall. This could support semiautomated abstraction, but given the potential risk for harm, human review would be essential before using results for any patient engagement or care decisions. Furthermore, recall was lower when notes were deidentified prior to LLM abstraction.

摘要

背景

健康的社会决定因素(如住房无保障)与患者的健康状况密切相关,这是众所周知的。更有效的方法来提取关于社会决定因素的结构化数据,可以帮助加速外显子组变量纳入生物医学研究,并支持医疗保健系统识别可能受益于主动外展的患者。基于生成式预训练转换器(Generative Pre-trained Transformers,GPTs)开发的大型语言模型(Large language models,LLMs)已显示出在非结构化临床记录上执行复杂抽象任务的潜力。

目的

在这里,我们评估 GPT 在识别住房无保障的时间方面的性能,并比较原始和去识别记录之间的结果。

方法

我们比较了 GPT-3.5 和 GPT-4 从 795 名孕妇的 25217 份记录中识别当前和过去住房不稳定以及一般住房状况实例的能力。结果与人工抽象、命名实体识别模型和正则表达式进行了比较。

结果

与 GPT-3.5 和命名实体识别模型相比,GPT-4 在识别当前或过去住房不稳定的患者方面具有最高的性能,召回率(0.924)明显高于人工抽象者(0.702),尽管精度较低(0.850)。在相同记录的去识别版本上,GPT-4 的精度略有提高(0.936 原始,0.939 去识别),而召回率下降(0.781 原始,0.704 去识别)。

结论

这项工作表明,虽然人工抽象可能总体上产生更准确的结果,但 LLM 可以提供一种可扩展、具有成本效益的解决方案,具有更高的召回率优势。这可以支持半自动抽象,但考虑到潜在的伤害风险,在将结果用于任何患者参与或护理决策之前,人工审查是必不可少的。此外,在对 LLM 抽象之前对记录进行去识别时,召回率较低。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ab0/11615547/c1813107702c/jmir_v26i1e63445_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ab0/11615547/c1813107702c/jmir_v26i1e63445_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ab0/11615547/c1813107702c/jmir_v26i1e63445_fig1.jpg

相似文献

1
Using Large Language Models to Abstract Complex Social Determinants of Health From Original and Deidentified Medical Notes: Development and Validation Study.利用大型语言模型从原始和去识别的医疗记录中提取复杂的健康社会决定因素摘要:开发和验证研究。
J Med Internet Res. 2024 Nov 19;26:e63445. doi: 10.2196/63445.
2
Using Large Language Models to Annotate Complex Cases of Social Determinants of Health in Longitudinal Clinical Records.使用大语言模型注释纵向临床记录中健康社会决定因素的复杂病例。
medRxiv. 2024 Apr 27:2024.04.25.24306380. doi: 10.1101/2024.04.25.24306380.
3
Validation of a Zero-shot Learning Natural Language Processing Tool to Facilitate Data Abstraction for Urologic Research.用于促进泌尿外科研究数据提取的零样本学习自然语言处理工具的验证
Eur Urol Focus. 2024 Mar;10(2):279-287. doi: 10.1016/j.euf.2024.01.009. Epub 2024 Jan 25.
4
Measuring the Value of a Practical Text Mining Approach to Identify Patients With Housing Issues in the Free-Text Notes in Electronic Health Record: Findings of a Retrospective Cohort Study.衡量实用文本挖掘方法在电子健康记录中的自由文本记录中识别住房问题患者的价值:一项回顾性队列研究的结果。
Front Public Health. 2021 Aug 27;9:697501. doi: 10.3389/fpubh.2021.697501. eCollection 2021.
5
Quality of Answers of Generative Large Language Models Versus Peer Users for Interpreting Laboratory Test Results for Lay Patients: Evaluation Study.生成式大语言模型与同行用户对解释非专业患者实验室检测结果的答案质量比较:评估研究。
J Med Internet Res. 2024 Apr 17;26:e56655. doi: 10.2196/56655.
6
Identifying signs and symptoms of urinary tract infection from emergency department clinical notes using large language models.利用大语言模型从急诊科临床记录中识别尿路感染的体征和症状。
Acad Emerg Med. 2024 Jun;31(6):599-610. doi: 10.1111/acem.14883. Epub 2024 Apr 3.
7
Scalable information extraction from free text electronic health records using large language models.使用大语言模型从自由文本电子健康记录中进行可扩展的信息提取。
BMC Med Res Methodol. 2025 Jan 28;25(1):23. doi: 10.1186/s12874-025-02470-z.
8
A large language model-based generative natural language processing framework fine-tuned on clinical notes accurately extracts headache frequency from electronic health records.基于大型语言模型的生成式自然语言处理框架,在临床笔记上进行了微调,能够从电子健康记录中准确提取头痛频率。
Headache. 2024 Apr;64(4):400-409. doi: 10.1111/head.14702. Epub 2024 Mar 25.
9
Classifying social determinants of health from unstructured electronic health records using deep learning-based natural language processing.利用基于深度学习的自然语言处理技术从非结构化电子健康记录中分类社会健康决定因素。
J Biomed Inform. 2022 Mar;127:103984. doi: 10.1016/j.jbi.2021.103984. Epub 2022 Jan 7.
10
Natural language processing to identify social determinants of health in Alzheimer's disease and related dementia from electronic health records.基于自然语言处理的电子健康记录中阿尔茨海默病及相关痴呆症社会决定因素的识别。
Health Serv Res. 2023 Dec;58(6):1292-1302. doi: 10.1111/1475-6773.14210. Epub 2023 Aug 3.

引用本文的文献

1
Leveraging large language models for the deidentification and temporal normalization of sensitive health information in electronic health records.利用大语言模型对电子健康记录中的敏感健康信息进行去识别化处理和时间标准化。
NPJ Digit Med. 2025 Aug 13;8(1):517. doi: 10.1038/s41746-025-01921-7.
2
Unveiling social determinants of health impact on adverse pregnancy outcomes through natural language processing.通过自然语言处理揭示健康的社会决定因素对不良妊娠结局的影响。
Sci Rep. 2025 Aug 9;15(1):29183. doi: 10.1038/s41598-025-13542-x.
3
Extracting Multifaceted Characteristics of Patients With Chronic Disease Comorbidity: Framework Development Using Large Language Models.

本文引用的文献

1
Retrieving Evidence from EHRs with LLMs: Possibilities and Challenges.利用大语言模型从电子健康记录中检索证据:可能性与挑战。
Proc Mach Learn Res. 2024 Jun;248:489-505.
2
Artificial intelligence, ChatGPT, and other large language models for social determinants of health: Current state and future directions.人工智能、ChatGPT 及其他用于健康社会决定因素的大语言模型:现状与未来方向。
Cell Rep Med. 2024 Jan 16;5(1):101356. doi: 10.1016/j.xcrm.2023.101356.
3
Large language models to identify social determinants of health in electronic health records.
提取慢性病合并症患者的多方面特征:使用大语言模型进行框架开发
JMIR Med Inform. 2025 May 15;13:e70096. doi: 10.2196/70096.
利用大语言模型识别电子健康记录中的健康社会决定因素。
NPJ Digit Med. 2024 Jan 11;7(1):6. doi: 10.1038/s41746-023-00970-0.
4
Zero-shot interpretable phenotyping of postpartum hemorrhage using large language models.使用大语言模型对产后出血进行零样本可解释表型分析。
NPJ Digit Med. 2023 Nov 30;6(1):212. doi: 10.1038/s41746-023-00957-x.
5
Effect of COVID-19 vaccination and booster on maternal-fetal outcomes: a retrospective cohort study.COVID-19 疫苗接种和加强针对于母婴结局的影响:一项回顾性队列研究。
Lancet Digit Health. 2023 Sep;5(9):e594-e606. doi: 10.1016/S2589-7500(23)00093-6. Epub 2023 Aug 1.
6
Trends, Characteristics, and Maternal Morbidity Associated With Unhoused Status in Pregnancy.妊娠无家可归状态的趋势、特征及与母体发病率的关系。
JAMA Netw Open. 2023 Jul 3;6(7):e2326352. doi: 10.1001/jamanetworkopen.2023.26352.
7
Large language models encode clinical knowledge.大语言模型编码临床知识。
Nature. 2023 Aug;620(7972):172-180. doi: 10.1038/s41586-023-06291-2. Epub 2023 Jul 12.
8
Real-world integration of the protocol for responding to and assessing patients' assets, risks, and experiences tool to assess social determinants of health in the electronic medical record at an academic medical center.在一家学术医疗中心,将应对和评估患者资产、风险及体验的协议与工具实际整合到电子病历中,以评估健康的社会决定因素。
Digit Health. 2023 May 22;9:20552076231176652. doi: 10.1177/20552076231176652. eCollection 2023 Jan-Dec.
9
Leveraging natural language processing to augment structured social determinants of health data in the electronic health record.利用自然语言处理技术增强电子健康记录中的结构化社会决定因素健康数据。
J Am Med Inform Assoc. 2023 Jul 19;30(8):1389-1397. doi: 10.1093/jamia/ocad073.
10
A large language model for electronic health records.用于电子健康记录的大型语言模型。
NPJ Digit Med. 2022 Dec 26;5(1):194. doi: 10.1038/s41746-022-00742-2.