• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

关于使用自然语言处理技术,利用电子健康记录中的非结构化数据来实施目标试验框架。

On the use of natural language processing to implement the target trial framework using unstructured data from the electronic health record.

作者信息

Rafalko Nicole, Gianfrancesco Milena, Goldstein Neal D

机构信息

Department of Epidemiology and Biostatistics, Drexel University Dornsife School of Public Health, Philadelphia, PA, USA.

Division of Rheumatology, School of Medicine, University of California, San Francisco, CA, USA.

出版信息

Glob Epidemiol. 2025 May 8;9:100204. doi: 10.1016/j.gloepi.2025.100204. eCollection 2025 Jun.

DOI:10.1016/j.gloepi.2025.100204
PMID:40476041
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12140070/
Abstract

The increasing availability and accessibility of electronic health record (EHR) data has made it a rich secondary source to conduct comparative effectiveness studies. To perform such studies, many researchers are turning to the target trial framework (TTF) to emulate the hypothetical randomized clinical trial. The quality of this emulation depends, in part, on the availability and accessibility of data for each component of the TTF. Yet one overarching challenge with using EHR data is that unstructured fields, such as clinical encounter notes, contain copious details on the patient yet require additional steps to extract if needed in the conduct of the study. Natural language processing (NLP) represents a spectrum of methods to assist with automating this extraction, from simpler rule-based methods to machine learning and artificial intelligence approaches that can handle complex language structures. What follows is a discussion on how NLP methods can augment information and data for researchers looking to estimate a treatment effect using EHR data via the TTF to emulate the hypothetical clinical trial. We conclude with recommendations for researchers interested in using NLP methods to obtain data stored in the free text of the EHR as well as considerations regarding the quality and validity of this data for the TTF.

摘要

电子健康记录(EHR)数据的可得性和可及性不断提高,使其成为进行比较效果研究的丰富二级数据源。为开展此类研究,许多研究人员正转向目标试验框架(TTF)以模拟假设的随机临床试验。这种模拟的质量部分取决于TTF各组成部分数据的可得性和可及性。然而,使用EHR数据的一个首要挑战是,诸如临床会诊记录等非结构化字段包含有关患者的大量细节,但在研究过程中如需提取则需要额外步骤。自然语言处理(NLP)代表了一系列有助于自动化此提取过程的方法,从更简单的基于规则的方法到能够处理复杂语言结构的机器学习和人工智能方法。以下是关于NLP方法如何为希望通过TTF使用EHR数据来估计治疗效果以模拟假设临床试验的研究人员增加信息和数据的讨论。我们最后为有兴趣使用NLP方法获取存储在EHR自由文本中的数据的研究人员提供建议,以及关于此数据对TTF的质量和有效性的考虑因素。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c18/12140070/1eceaf77d4dd/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c18/12140070/1eceaf77d4dd/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c18/12140070/1eceaf77d4dd/gr1.jpg

相似文献

1
On the use of natural language processing to implement the target trial framework using unstructured data from the electronic health record.关于使用自然语言处理技术,利用电子健康记录中的非结构化数据来实施目标试验框架。
Glob Epidemiol. 2025 May 8;9:100204. doi: 10.1016/j.gloepi.2025.100204. eCollection 2025 Jun.
2
Leveraging Natural Language Processing and Machine Learning Methods for Adverse Drug Event Detection in Electronic Health/Medical Records: A Scoping Review.利用自然语言处理和机器学习方法在电子健康/医疗记录中进行药物不良事件检测:一项范围综述
Drug Saf. 2025 Apr;48(4):321-337. doi: 10.1007/s40264-024-01505-6. Epub 2025 Jan 9.
3
Getting More Out of Large Databases and EHRs with Natural Language Processing and Artificial Intelligence: The Future Is Here.借助自然语言处理和人工智能从大型数据库和电子健康记录中获取更多价值:未来已来。
J Bone Joint Surg Am. 2022 Oct 19;104(Suppl 3):51-55. doi: 10.2106/JBJS.22.00567.
4
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
5
The Growing Impact of Natural Language Processing in Healthcare and Public Health.自然语言处理在医疗保健和公共卫生领域的影响日益扩大。
Inquiry. 2024 Jan-Dec;61:469580241290095. doi: 10.1177/00469580241290095.
6
Natural Language Processing of Clinical Notes on Chronic Diseases: Systematic Review.慢性病临床记录的自然语言处理:系统综述
JMIR Med Inform. 2019 Apr 27;7(2):e12239. doi: 10.2196/12239.
7
Deployment of Real-time Natural Language Processing and Deep Learning Clinical Decision Support in the Electronic Health Record: Pipeline Implementation for an Opioid Misuse Screener in Hospitalized Adults.电子健康记录中实时自然语言处理和深度学习临床决策支持的应用:成年住院患者阿片类药物滥用筛查器的流程实施
JMIR Med Inform. 2023 Apr 20;11:e44977. doi: 10.2196/44977.
8
Augmented intelligence with natural language processing applied to electronic health records for identifying patients with non-alcoholic fatty liver disease at risk for disease progression.应用自然语言处理的增强型人工智能用于电子健康记录,以识别非酒精性脂肪性肝病患者中疾病进展风险较高的患者。
Int J Med Inform. 2019 Sep;129:334-341. doi: 10.1016/j.ijmedinf.2019.06.028. Epub 2019 Jul 6.
9
Applications of natural language processing in ophthalmology: present and future.自然语言处理在眼科中的应用:现状与未来。
Front Med (Lausanne). 2022 Aug 8;9:906554. doi: 10.3389/fmed.2022.906554. eCollection 2022.
10
Extracting Critical Information from Unstructured Clinicians' Notes Data to Identify Dementia Severity Using a Rule-Based Approach: Feasibility Study.基于规则的方法从非结构化临床医生笔记数据中提取关键信息以识别痴呆严重程度的可行性研究。
JMIR Aging. 2024 Sep 24;7:e57926. doi: 10.2196/57926.

本文引用的文献

1
The Target Trial Framework for Causal Inference From Observational Data: Why and When Is It Helpful?用于从观察性数据进行因果推断的目标试验框架:为何以及何时有用?
Ann Intern Med. 2025 Mar;178(3):402-407. doi: 10.7326/ANNALS-24-01871. Epub 2025 Feb 18.
2
Testing and Evaluation of Health Care Applications of Large Language Models: A Systematic Review.大语言模型在医疗保健应用中的测试与评估:一项系统综述。
JAMA. 2025 Jan 28;333(4):319-328. doi: 10.1001/jama.2024.21700.
3
A toolbox for surfacing health equity harms and biases in large language models.
一个用于揭示大语言模型中健康公平性危害和偏见的工具箱。
Nat Med. 2024 Dec;30(12):3590-3600. doi: 10.1038/s41591-024-03258-2. Epub 2024 Sep 23.
4
Structuring medication signeturs as a language regression task: comparison of zero- and few-shot GPT with fine-tuned models.将药物签名构建为语言回归任务:零样本和少样本GPT与微调模型的比较。
JAMIA Open. 2024 Jun 18;7(2):ooae051. doi: 10.1093/jamiaopen/ooae051. eCollection 2024 Jul.
5
A Qualitative Study of Physicians' Views on the Reuse of Electronic Health Record Data for Secondary Analysis.关于医生对电子健康记录数据二次分析复用观点的定性研究。
Qual Health Res. 2024 Jun 3:10497323241245644. doi: 10.1177/10497323241245644.
6
Examining Linguistic Differences in Electronic Health Records for Diverse Patients With Diabetes: Natural Language Processing Analysis.探究不同糖尿病患者电子健康记录中的语言差异:自然语言处理分析
JMIR Med Inform. 2024 May 23;12:e50428. doi: 10.2196/50428.
7
Target trial emulation for comparative effectiveness research with observational data: Promise and challenges for studying medications for opioid use disorder.基于观察性数据的药物比较有效性研究的目标模拟试验:阿片类药物使用障碍治疗药物研究的前景与挑战。
Addiction. 2024 Jul;119(7):1313-1321. doi: 10.1111/add.16473. Epub 2024 Mar 22.
8
Large language models to identify social determinants of health in electronic health records.利用大语言模型识别电子健康记录中的健康社会决定因素。
NPJ Digit Med. 2024 Jan 11;7(1):6. doi: 10.1038/s41746-023-00970-0.
9
Using Electronic Health Records and Linked Claims Data to Assess New Medication Use and Primary Nonadherence in Rheumatology Patients.利用电子健康记录和关联索赔数据评估风湿病患者的新用药和主要不依从情况。
Arthritis Care Res (Hoboken). 2024 Apr;76(4):550-558. doi: 10.1002/acr.25269. Epub 2024 Jan 18.
10
Large language models propagate race-based medicine.大语言模型传播基于种族的医学观念。
NPJ Digit Med. 2023 Oct 20;6(1):195. doi: 10.1038/s41746-023-00939-z.