• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从叙述性报告中解锁临床数据:一项自然语言处理研究

Unlocking clinical data from narrative reports: a study of natural language processing.

作者信息

Hripcsak G, Friedman C, Alderson P O, DuMouchel W, Johnson S B, Clayton P D

机构信息

Department of Medical Informatics, Columbia-Presbyterian Medical Center, New York, NY 10032, USA.

出版信息

Ann Intern Med. 1995 May 1;122(9):681-8. doi: 10.7326/0003-4819-122-9-199505010-00007.

DOI:10.7326/0003-4819-122-9-199505010-00007
PMID:7702231
Abstract

OBJECTIVE

To evaluate the automated detection of clinical conditions described in narrative reports.

DESIGN

Automated methods and human experts detected the presence or absence of six clinical conditions in 200 admission chest radiograph reports.

STUDY SUBJECTS

A computerized, general-purpose natural language processor; 6 internists; 6 radiologists; 6 lay persons; and 3 other computer methods.

MAIN OUTCOME MEASURES

Intersubject disagreement was quantified by "distance" (the average number of clinical conditions per report on which two subjects disagreed) and by sensitivity and specificity with respect to the physicians.

RESULTS

Using a majority vote, physicians detected 101 conditions in the 200 reports (0.51 per report); the most common condition was acute bacterial pneumonia (prevalence, 0.14), and the least common was chronic obstructive pulmonary disease (prevalence, 0.03). Pairs of physicians disagreed on the presence of at least 1 condition for an average of 20% of reports. The average intersubject distance among physicians was 0.24 (95% Cl, 0.19 to 0.29) out of a maximum possible distance of 6. No physician had a significantly greater distance than the average. The average distance of the natural language processor from the physicians was 0.26 (Cl, 0.21 to 0.32; not significantly greater than the average among physicians). Lay persons and alternative computer methods had significantly greater distance from the physicians (all > 0.5). The natural language processor had a sensitivity of 81% (Cl, 73% to 87%) and a specificity of 98% (Cl, 97% to 99%); physicians had an average sensitivity of 85% and an average specificity of 98%.

CONCLUSIONS

Physicians disagreed on the interpretation of narrative reports, but this was not caused by outlier physicians or a consistent difference in the way internists and radiologists read reports. The natural language processor was not distinguishable from the physicians and was superior to all other comparison subjects. Although the domain of this study was restricted (six clinical conditions in chest radiographs), natural language processing seems to have the potential to extract clinical information from narrative reports in a manner that will support automated decision-support and clinical research.

摘要

目的

评估对叙述性报告中所描述临床状况的自动检测。

设计

自动方法和人类专家对200份入院胸部X光片报告中的六种临床状况的存在与否进行检测。

研究对象

一个计算机化的通用自然语言处理器;6名内科医生;6名放射科医生;6名非专业人员;以及其他3种计算机方法。

主要观察指标

通过“距离”(每份报告中两名受试者意见不一致的临床状况的平均数量)以及相对于医生的敏感性和特异性来量化受试者间的分歧。

结果

采用多数表决法,医生们在200份报告中检测到101种状况(每份报告0.51种);最常见的状况是急性细菌性肺炎(患病率0.14),最不常见的是慢性阻塞性肺疾病(患病率0.03)。平均20%的报告中,医生对至少一种状况的存在与否存在分歧。医生之间的平均受试者间距离为0.24(95%可信区间,0.19至0.29),最大可能距离为6。没有医生的距离显著大于平均距离。自然语言处理器与医生的平均距离为0.26(可信区间,0.21至0.32;不显著大于医生之间的平均距离)。非专业人员和其他计算机方法与医生的距离显著更大(均>0.5)。自然语言处理器的敏感性为81%(可信区间,73%至87%),特异性为98%(可信区间,97%至99%);医生的平均敏感性为85%,平均特异性为98%。

结论

医生对叙述性报告的解读存在分歧,但这并非由异常值医生或内科医生与放射科医生阅读报告方式的持续差异所致。自然语言处理器与医生无明显差异,且优于所有其他比较对象。尽管本研究的领域有限(胸部X光片中的六种临床状况),但自然语言处理似乎有潜力以支持自动决策辅助和临床研究的方式从叙述性报告中提取临床信息。

相似文献

1
Unlocking clinical data from narrative reports: a study of natural language processing.从叙述性报告中解锁临床数据:一项自然语言处理研究
Ann Intern Med. 1995 May 1;122(9):681-8. doi: 10.7326/0003-4819-122-9-199505010-00007.
2
Extracting findings from narrative reports: software transferability and sources of physician disagreement.从叙述性报告中提取结果:软件可转移性与医生意见分歧的根源
Methods Inf Med. 1998 Jan;37(1):1-7.
3
Automatic detection of acute bacterial pneumonia from chest X-ray reports.从胸部X光报告中自动检测急性细菌性肺炎。
J Am Med Inform Assoc. 2000 Nov-Dec;7(6):593-604. doi: 10.1136/jamia.2000.0070593.
4
A comparison of classification algorithms to automatically identify chest X-ray reports that support pneumonia.用于自动识别支持肺炎诊断的胸部X光报告的分类算法比较。
J Biomed Inform. 2001 Feb;34(1):4-14. doi: 10.1006/jbin.2001.1000.
5
Knowledge discovery and data mining to assist natural language understanding.知识发现与数据挖掘助力自然语言理解。
Proc AMIA Symp. 1998:835-9.
6
Use of natural language processing to translate clinical information from a database of 889,921 chest radiographic reports.利用自然语言处理技术从包含889,921份胸部X光报告的数据库中翻译临床信息。
Radiology. 2002 Jul;224(1):157-63. doi: 10.1148/radiol.2241011118.
7
Automatic identification of pneumonia related concepts on chest x-ray reports.胸部X光报告中肺炎相关概念的自动识别。
Proc AMIA Symp. 1999:67-71.
8
Identification of suspected tuberculosis patients based on natural language processing of chest radiograph reports.基于胸部X光片报告的自然语言处理识别疑似肺结核患者
Proc AMIA Annu Fall Symp. 1996:542-6.
9
Classification algorithms applied to narrative reports.应用于叙述性报告的分类算法。
Proc AMIA Symp. 1999:455-9.
10
Tool-supported Interactive Correction and Semantic Annotation of Narrative Clinical Reports.叙事性临床报告的工具支持交互式校正与语义标注
Methods Inf Med. 2017 May 18;56(3):217-229. doi: 10.3414/ME16-01-0083. Epub 2017 Apr 28.

引用本文的文献

1
Iterative refinement and goal articulation to optimize large language models for clinical information extraction.迭代优化与目标阐述以优化用于临床信息提取的大语言模型
NPJ Digit Med. 2025 May 23;8(1):301. doi: 10.1038/s41746-025-01686-z.
2
Prompts to Table: Specification and Iterative Refinement for Clinical Information Extraction with Large Language Models.表格提示:用于使用大语言模型进行临床信息提取的规范与迭代优化
medRxiv. 2025 Apr 1:2025.02.11.25322107. doi: 10.1101/2025.02.11.25322107.
3
Large Language Models for Efficient Medical Information Extraction.
用于高效医学信息提取的大语言模型
AMIA Jt Summits Transl Sci Proc. 2024 May 31;2024:509-514. eCollection 2024.
4
Tasks as needs: reframing the paradigm of clinical natural language processing research for real-world decision support.任务需求:为现实世界的决策支持重新构建临床自然语言处理研究的范例。
J Am Med Inform Assoc. 2022 Sep 12;29(10):1810-1817. doi: 10.1093/jamia/ocac121.
5
A systematic review on natural language processing systems for eligibility prescreening in clinical research.自然语言处理系统在临床研究资格筛选中的应用:系统评价
J Am Med Inform Assoc. 2021 Dec 28;29(1):197-206. doi: 10.1093/jamia/ocab228.
6
Glimpsing the Impact of COVID19 Lock-Down on People With Epilepsy: A Text Mining Approach.洞察新冠疫情封锁对癫痫患者的影响:一种文本挖掘方法。
Front Neurol. 2020 Aug 19;11:870. doi: 10.3389/fneur.2020.00870. eCollection 2020.
7
Natural Language Processing of Serum Protein Electrophoresis Reports in the Veterans Affairs Health Care System.基于退伍军人事务医疗系统的血清蛋白电泳报告的自然语言处理。
JCO Clin Cancer Inform. 2020 Aug;4:749-756. doi: 10.1200/CCI.19.00167.
8
Text Processing for Detection of Fungal Ocular Involvement in Critical Care Patients: Cross-Sectional Study.文本处理在重症监护患者真菌感染眼部累及检测中的应用:一项横断面研究。
J Med Internet Res. 2020 Aug 14;22(8):e18855. doi: 10.2196/18855.
9
A deep-learning based system for accurate extraction of blood pressure data in clinical narratives.一种基于深度学习的系统,用于准确提取临床记录中的血压数据。
AMIA Jt Summits Transl Sci Proc. 2020 May 30;2020:703-709. eCollection 2020.
10
Deep phenotyping: Embracing complexity and temporality-Towards scalability, portability, and interoperability.深度表型分析:拥抱复杂性和时间性——迈向可扩展性、便携性和互操作性。
J Biomed Inform. 2020 May;105:103433. doi: 10.1016/j.jbi.2020.103433. Epub 2020 Apr 23.