• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用商用自然语言处理工具从口述门诊咨询记录中提取临床特征:试点、回顾性、横断面验证研究。

Extracting Clinical Features From Dictated Ambulatory Consult Notes Using a Commercially Available Natural Language Processing Tool: Pilot, Retrospective, Cross-Sectional Validation Study.

作者信息

Petch Jeremy, Batt Jane, Murray Joshua, Mamdani Muhammad

机构信息

Institute of Health Policy, Management and Evaluation, Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada.

Centre for Data Science and Digital Health, Hamilton Health Sciences, Hamilton, ON, Canada.

出版信息

JMIR Med Inform. 2019 Nov 1;7(4):e12575. doi: 10.2196/12575.

DOI:10.2196/12575
PMID:31682579
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6913750/
Abstract

BACKGROUND

The increasing adoption of electronic health records (EHRs) in clinical practice holds the promise of improving care and advancing research by serving as a rich source of data, but most EHRs allow clinicians to enter data in a text format without much structure. Natural language processing (NLP) may reduce reliance on manual abstraction of these text data by extracting clinical features directly from unstructured clinical digital text data and converting them into structured data.

OBJECTIVE

This study aimed to assess the performance of a commercially available NLP tool for extracting clinical features from free-text consult notes.

METHODS

We conducted a pilot, retrospective, cross-sectional study of the accuracy of NLP from dictated consult notes from our tuberculosis clinic with manual chart abstraction as the reference standard. Consult notes for 130 patients were extracted and processed using NLP. We extracted 15 clinical features from these consult notes and grouped them a priori into categories of simple, moderate, and complex for analysis.

RESULTS

For the primary outcome of overall accuracy, NLP performed best for features classified as simple, achieving an overall accuracy of 96% (95% CI 94.3-97.6). Performance was slightly lower for features of moderate clinical and linguistic complexity at 93% (95% CI 91.1-94.4), and lowest for complex features at 91% (95% CI 87.3-93.1).

CONCLUSIONS

The findings of this study support the use of NLP for extracting clinical features from dictated consult notes in the setting of a tuberculosis clinic. Further research is needed to fully establish the validity of NLP for this and other purposes.

摘要

背景

临床实践中电子健康记录(EHR)的使用日益增加,有望通过作为丰富的数据来源来改善医疗服务并推进研究,但大多数电子健康记录允许临床医生以文本格式输入数据,结构较少。自然语言处理(NLP)可以通过直接从非结构化临床数字文本数据中提取临床特征并将其转换为结构化数据,减少对这些文本数据人工提取的依赖。

目的

本研究旨在评估一种商用自然语言处理工具从自由文本会诊记录中提取临床特征的性能。

方法

我们进行了一项试点、回顾性横断面研究,以人工图表提取为参考标准,评估自然语言处理从我们结核病诊所的口述会诊记录中提取信息的准确性。使用自然语言处理提取并处理了130名患者的会诊记录。我们从这些会诊记录中提取了15个临床特征,并将它们预先分为简单、中等和复杂三类进行分析。

结果

对于总体准确性这一主要结果,自然语言处理在分类为简单的特征上表现最佳,总体准确率达到96%(95%可信区间94.3 - 97.6)。对于临床和语言复杂性中等的特征,性能略低,为93%(95%可信区间91.1 - 94.4),而对于复杂特征最低,为91%(95%可信区间87.3 - 93.1)。

结论

本研究结果支持在结核病诊所环境中使用自然语言处理从口述会诊记录中提取临床特征。需要进一步研究以充分确立自然语言处理在此及其他目的方面的有效性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9240/6913750/032c278f4787/medinform_v7i4e12575_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9240/6913750/032c278f4787/medinform_v7i4e12575_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9240/6913750/032c278f4787/medinform_v7i4e12575_fig1.jpg

相似文献

1
Extracting Clinical Features From Dictated Ambulatory Consult Notes Using a Commercially Available Natural Language Processing Tool: Pilot, Retrospective, Cross-Sectional Validation Study.使用商用自然语言处理工具从口述门诊咨询记录中提取临床特征:试点、回顾性、横断面验证研究。
JMIR Med Inform. 2019 Nov 1;7(4):e12575. doi: 10.2196/12575.
2
Measuring Adoption of Patient Priorities-Aligned Care Using Natural Language Processing of Electronic Health Records: Development and Validation of the Model.利用电子健康记录的自然语言处理来衡量患者优先事项匹配护理的采用情况:模型的开发与验证
JMIR Med Inform. 2021 Feb 19;9(2):e18756. doi: 10.2196/18756.
3
Data for registry and quality review can be retrospectively collected using natural language processing from unstructured charts of arthroplasty patients.可以使用自然语言处理从关节置换患者的非结构化图表中回顾性地收集注册和质量审查数据。
Bone Joint J. 2020 Jul;102-B(7_Supple_B):99-104. doi: 10.1302/0301-620X.102B7.BJJ-2019-1574.R1.
4
Multicenter Validation of Natural Language Processing Algorithms for the Detection of Common Data Elements in Operative Notes for Total Hip Arthroplasty: Algorithm Development and Validation.用于检测全髋关节置换术手术记录中常见数据元素的自然语言处理算法的多中心验证:算法开发与验证
JMIR Med Inform. 2022 Aug 31;10(8):e38155. doi: 10.2196/38155.
5
Validation of a Zero-shot Learning Natural Language Processing Tool to Facilitate Data Abstraction for Urologic Research.用于促进泌尿外科研究数据提取的零样本学习自然语言处理工具的验证
Eur Urol Focus. 2024 Mar;10(2):279-287. doi: 10.1016/j.euf.2024.01.009. Epub 2024 Jan 25.
6
Assessment of Natural Language Processing of Electronic Health Records to Measure Goals-of-Care Discussions as a Clinical Trial Outcome.评估电子健康记录中的自然语言处理以衡量作为临床试验结局的照护目标讨论。
JAMA Netw Open. 2023 Mar 1;6(3):e231204. doi: 10.1001/jamanetworkopen.2023.1204.
7
Natural Language Processing to Identify Advance Care Planning Documentation in a Multisite Pragmatic Clinical Trial.自然语言处理在多中心实用临床试验中识别预先医疗照护计划文件。
J Pain Symptom Manage. 2022 Jan;63(1):e29-e36. doi: 10.1016/j.jpainsymman.2021.06.025. Epub 2021 Jul 14.
8
Natural Language Processing of Clinical Notes to Identify Mental Illness and Substance Use Among People Living with HIV: Retrospective Cohort Study.利用临床记录的自然语言处理技术识别HIV感染者中的精神疾病和药物使用情况:回顾性队列研究
JMIR Med Inform. 2021 Mar 10;9(3):e23456. doi: 10.2196/23456.
9
Extracting Medical Information From Free-Text and Unstructured Patient-Generated Health Data Using Natural Language Processing Methods: Feasibility Study With Real-world Data.使用自然语言处理方法从自由文本和非结构化患者生成的健康数据中提取医学信息:基于真实世界数据的可行性研究
JMIR Form Res. 2023 Mar 7;7:e43014. doi: 10.2196/43014.
10
External Validation of Natural Language Processing Algorithms to Extract Common Data Elements in THA Operative Notes.THA 手术记录中常用数据元素的自然语言处理算法的外部验证。
J Arthroplasty. 2023 Oct;38(10):2081-2084. doi: 10.1016/j.arth.2022.10.031. Epub 2022 Oct 22.

引用本文的文献

1
Scalable information extraction from free text electronic health records using large language models.使用大语言模型从自由文本电子健康记录中进行可扩展的信息提取。
BMC Med Res Methodol. 2025 Jan 28;25(1):23. doi: 10.1186/s12874-025-02470-z.
2
A Pilot Report on Extracting Symptom Onset Date and Time from Clinical Notes in Patients Presenting with Chest Pain.一份关于从胸痛患者临床记录中提取症状发作日期和时间的初步报告。
medRxiv. 2024 Dec 31:2024.12.26.24319658. doi: 10.1101/2024.12.26.24319658.
3
Using large language models for extracting stressful life events to assess their impact on preventive colon cancer screening adherence.

本文引用的文献

1
Learning Health System for Breast Cancer: Pilot Project Experience.乳腺癌学习型健康系统:试点项目经验
JCO Clin Cancer Inform. 2019 Aug;3:1-11. doi: 10.1200/CCI.19.00032.
2
Automatically identifying social isolation from clinical narratives for patients with prostate Cancer.自动识别前列腺癌患者临床叙述中的社会孤立现象。
BMC Med Inform Decis Mak. 2019 Mar 14;19(1):43. doi: 10.1186/s12911-019-0795-y.
3
Using natural language processing and machine learning to identify breast cancer local recurrence.利用自然语言处理和机器学习识别乳腺癌局部复发。
使用大语言模型提取应激性生活事件以评估其对预防性结肠癌筛查依从性的影响。
BMC Public Health. 2025 Jan 2;25(1):12. doi: 10.1186/s12889-024-21123-2.
4
Real-World Treatment Patterns and Clinical Outcomes among Patients Receiving CDK4/6 Inhibitors for Metastatic Breast Cancer in a Canadian Setting Using AI-Extracted Data.在加拿大利用 AI 提取数据观察接受 CDK4/6 抑制剂治疗转移性乳腺癌患者的真实世界治疗模式和临床结局。
Curr Oncol. 2024 Apr 9;31(4):2172-2184. doi: 10.3390/curroncol31040161.
5
Real-World Outcomes of Patients with Advanced Epidermal Growth Factor Receptor-Mutated Non-Small Cell Lung Cancer in Canada Using Data Extracted by Large Language Model-Based Artificial Intelligence.利用基于大语言模型的人工智能提取的数据观察加拿大晚期表皮生长因子受体突变型非小细胞肺癌患者的真实世界结局。
Curr Oncol. 2024 Apr 2;31(4):1947-1960. doi: 10.3390/curroncol31040146.
6
Developing a Data and Analytics Platform to Enable a Breast Cancer Learning Health System at a Regional Cancer Center.开发数据和分析平台,以在区域癌症中心实现乳腺癌学习型健康系统。
JCO Clin Cancer Inform. 2023 Mar;7:e2200182. doi: 10.1200/CCI.22.00182.
7
Automating Access to Real-World Evidence.实现真实世界证据获取的自动化。
JTO Clin Res Rep. 2022 May 17;3(6):100340. doi: 10.1016/j.jtocrr.2022.100340. eCollection 2022 Jun.
8
A Semiautomated Chart Review for Assessing the Development of Radiation Pneumonitis Using Natural Language Processing: Diagnostic Accuracy and Feasibility Study.一项使用自然语言处理评估放射性肺炎发展情况的半自动病历审查:诊断准确性和可行性研究
JMIR Med Inform. 2021 Nov 12;9(11):e29241. doi: 10.2196/29241.
9
Automated Categorization of Systemic Disease and Duration From Electronic Medical Record System Data Using Finite-State Machine Modeling: Prospective Validation Study.使用有限状态机建模从电子病历系统数据中自动分类全身性疾病及其病程:前瞻性验证研究
JMIR Form Res. 2020 Dec 17;4(12):e24490. doi: 10.2196/24490.
BMC Bioinformatics. 2018 Dec 28;19(Suppl 17):498. doi: 10.1186/s12859-018-2466-x.
4
Identifying Falls Risk Screenings Not Documented with Administrative Codes Using Natural Language Processing.使用自然语言处理识别未用行政代码记录的跌倒风险筛查。
AMIA Annu Symp Proc. 2018 Apr 16;2017:1923-1930. eCollection 2017.
5
Detecting clinically relevant new information in clinical notes across specialties and settings.检测跨专业和设置的临床记录中的临床相关新信息。
BMC Med Inform Decis Mak. 2017 Jul 5;17(Suppl 2):68. doi: 10.1186/s12911-017-0464-y.
6
Feasibility of extracting data from electronic medical records for research: an international comparative study.从电子病历中提取数据用于研究的可行性:一项国际比较研究。
BMC Med Inform Decis Mak. 2016 Jul 13;16:90. doi: 10.1186/s12911-016-0332-1.
7
Natural Language Processing in Radiology: A Systematic Review.自然语言处理在放射学中的应用:系统评价。
Radiology. 2016 May;279(2):329-43. doi: 10.1148/radiol.16142770.
8
Extracting Concepts Related to Homelessness from the Free Text of VA Electronic Medical Records.从退伍军人事务部电子病历的自由文本中提取与无家可归相关的概念。
AMIA Annu Symp Proc. 2014 Nov 14;2014:589-98. eCollection 2014.
9
Using electronic medical records to enable large-scale studies in psychiatry: treatment resistant depression as a model.利用电子病历进行精神病学的大规模研究:以治疗抵抗性抑郁症为模型。
Psychol Med. 2012 Jan;42(1):41-50. doi: 10.1017/S0033291711000997. Epub 2011 Jun 20.
10
Natural language processing for the development of a clinical registry: a validation study in intraductal papillary mucinous neoplasms.自然语言处理在临床注册中的应用:一项关于导管内乳头状黏液性肿瘤的验证研究。
HPB (Oxford). 2010 Dec;12(10):688-95. doi: 10.1111/j.1477-2574.2010.00235.x.