• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从非结构化 MRI 报告中辨别肿瘤状态——现有报告中信息的完整性和自动化自然语言处理的实用性。

Discerning tumor status from unstructured MRI reports--completeness of information in existing reports and utility of automated natural language processing.

机构信息

Department of Radiology, Mayo Clinic, Rochester, MN 55905, USA.

出版信息

J Digit Imaging. 2010 Apr;23(2):119-32. doi: 10.1007/s10278-009-9215-7. Epub 2009 May 30.

DOI:10.1007/s10278-009-9215-7
PMID:19484309
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2837158/
Abstract

Information in electronic medical records is often in an unstructured free-text format. This format presents challenges for expedient data retrieval and may fail to convey important findings. Natural language processing (NLP) is an emerging technique for rapid and efficient clinical data retrieval. While proven in disease detection, the utility of NLP in discerning disease progression from free-text reports is untested. We aimed to (1) assess whether unstructured radiology reports contained sufficient information for tumor status classification; (2) develop an NLP-based data extraction tool to determine tumor status from unstructured reports; and (3) compare NLP and human tumor status classification outcomes. Consecutive follow-up brain tumor magnetic resonance imaging reports (2000--2007) from a tertiary center were manually annotated using consensus guidelines on tumor status. Reports were randomized to NLP training (70%) or testing (30%) groups. The NLP tool utilized a support vector machines model with statistical and rule-based outcomes. Most reports had sufficient information for tumor status classification, although 0.8% did not describe status despite reference to prior examinations. Tumor size was unreported in 68.7% of documents, while 50.3% lacked data on change magnitude when there was detectable progression or regression. Using retrospective human classification as the gold standard, NLP achieved 80.6% sensitivity and 91.6% specificity for tumor status determination (mean positive predictive value, 82.4%; negative predictive value, 92.0%). In conclusion, most reports contained sufficient information for tumor status determination, though variable features were used to describe status. NLP demonstrated good accuracy for tumor status classification and may have novel application for automated disease status classification from electronic databases.

摘要

电子病历中的信息通常采用非结构化的自由文本格式。这种格式给快速有效的临床数据检索带来了挑战,并且可能无法传达重要的发现。自然语言处理 (NLP) 是一种新兴的技术,可以快速有效地检索临床数据。虽然在疾病检测方面已经得到验证,但 NLP 在从自由文本报告中辨别疾病进展方面的效用尚未经过测试。我们的目的是:(1)评估非结构化放射学报告是否包含足够的信息用于肿瘤状态分类;(2)开发一种基于 NLP 的数据提取工具,用于从非结构化报告中确定肿瘤状态;(3)比较 NLP 和人工肿瘤状态分类结果。采用肿瘤状态共识指南对来自一家三级医院的连续随访脑肿瘤磁共振成像报告(2000-2007 年)进行手动注释。报告被随机分为 NLP 训练(70%)或测试(30%)组。NLP 工具使用带有统计和基于规则的输出的支持向量机模型。虽然 0.8%的报告没有描述状态,但大多数报告都包含足够的信息用于肿瘤状态分类,尽管有参考先前的检查。肿瘤大小在 68.7%的文件中未报告,而在有可检测的进展或消退时,50.3%的文件缺乏关于变化幅度的数据。使用回顾性人工分类作为金标准,NLP 对肿瘤状态确定的灵敏度为 80.6%,特异性为 91.6%(平均阳性预测值为 82.4%;阴性预测值为 92.0%)。总之,大多数报告包含足够的信息用于肿瘤状态的确定,尽管使用了不同的特征来描述状态。NLP 对肿瘤状态分类具有较高的准确性,并且可能在从电子数据库中自动进行疾病状态分类方面具有新的应用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/97b4/3043775/3ed072f81050/10278_2009_9215_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/97b4/3043775/24438e714913/10278_2009_9215_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/97b4/3043775/f311ed445732/10278_2009_9215_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/97b4/3043775/b62a208a667b/10278_2009_9215_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/97b4/3043775/8560903eda70/10278_2009_9215_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/97b4/3043775/d80f67e3940f/10278_2009_9215_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/97b4/3043775/3ed072f81050/10278_2009_9215_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/97b4/3043775/24438e714913/10278_2009_9215_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/97b4/3043775/f311ed445732/10278_2009_9215_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/97b4/3043775/b62a208a667b/10278_2009_9215_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/97b4/3043775/8560903eda70/10278_2009_9215_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/97b4/3043775/d80f67e3940f/10278_2009_9215_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/97b4/3043775/3ed072f81050/10278_2009_9215_Fig6_HTML.jpg

相似文献

1
Discerning tumor status from unstructured MRI reports--completeness of information in existing reports and utility of automated natural language processing.从非结构化 MRI 报告中辨别肿瘤状态——现有报告中信息的完整性和自动化自然语言处理的实用性。
J Digit Imaging. 2010 Apr;23(2):119-32. doi: 10.1007/s10278-009-9215-7. Epub 2009 May 30.
2
Automatic retrieval of bone fracture knowledge using natural language processing.利用自然语言处理自动获取骨折知识。
J Digit Imaging. 2013 Aug;26(4):709-13. doi: 10.1007/s10278-012-9531-1.
3
Use of Radcube for extraction of finding trends in a large radiology practice.使用 Radcube 提取大型放射科实践中的发现趋势。
J Digit Imaging. 2009 Dec;22(6):629-40. doi: 10.1007/s10278-008-9128-x. Epub 2008 Jun 10.
4
Natural Language Processing for Automated Quantification of Brain Metastases Reported in Free-Text Radiology Reports.用于对自由文本放射学报告中报告的脑转移瘤进行自动定量的自然语言处理
JCO Clin Cancer Inform. 2019 Apr;3:1-9. doi: 10.1200/CCI.18.00138.
5
Performance of a Machine Learning Classifier of Knee MRI Reports in Two Large Academic Radiology Practices: A Tool to Estimate Diagnostic Yield.在两家大型学术放射科实践中膝关节MRI报告的机器学习分类器性能:一种估计诊断率的工具
AJR Am J Roentgenol. 2017 Apr;208(4):750-753. doi: 10.2214/AJR.16.16128. Epub 2017 Jan 31.
6
Natural language processing of radiology reports for identification of skeletal site-specific fractures.放射科报告的自然语言处理以识别骨骼部位特异性骨折。
BMC Med Inform Decis Mak. 2019 Apr 4;19(Suppl 3):73. doi: 10.1186/s12911-019-0780-5.
7
Extracting data from electronic medical records: validation of a natural language processing program to assess prostate biopsy results.从电子病历中提取数据:评估前列腺活检结果的自然语言处理程序的验证
World J Urol. 2014 Feb;32(1):99-103. doi: 10.1007/s00345-013-1040-4. Epub 2013 Feb 17.
8
Automatic abstraction of imaging observations with their characteristics from mammography reports.从乳腺钼靶报告中自动提取具有其特征的影像观察结果。
J Am Med Inform Assoc. 2015 Apr;22(e1):e81-92. doi: 10.1136/amiajnl-2014-003009. Epub 2014 Oct 28.
9
Programming techniques for improving rule readability for rule-based information extraction natural language processing pipelines of unstructured and semi-structured medical texts.用于改进基于规则的信息抽取自然语言处理管道的规则可读性的编程技术,这些管道处理非结构化和半结构化的医学文本。
Health Informatics J. 2023 Apr-Jun;29(2):14604582231164696. doi: 10.1177/14604582231164696.
10
The implementation of natural language processing to extract index lesions from breast magnetic resonance imaging reports.运用自然语言处理技术从乳腺磁共振成像报告中提取病灶索引。
BMC Med Inform Decis Mak. 2019 Dec 30;19(1):288. doi: 10.1186/s12911-019-0997-3.

引用本文的文献

1
TECRR: a benchmark dataset of radiological reports for BI-RADS classification with machine learning, deep learning, and large language model baselines.TECRR:一个基于机器学习、深度学习和大语言模型基线的用于 BI-RADS 分类的放射学报告基准数据集。
BMC Med Inform Decis Mak. 2024 Oct 24;24(1):310. doi: 10.1186/s12911-024-02717-7.
2
Development and Portability of a Text Mining Algorithm for Capturing Disease Progression in Electronic Health Records of Patients With Stage IV Non-Small Cell Lung Cancer.开发并实现一种用于从电子健康记录中捕获 IV 期非小细胞肺癌患者疾病进展情况的文本挖掘算法。
JCO Clin Cancer Inform. 2024 Oct;8:e2400053. doi: 10.1200/CCI.24.00053. Epub 2024 Oct 4.
3

本文引用的文献

1
Identifying QT prolongation from ECG impressions using a general-purpose Natural Language Processor.使用通用自然语言处理器从心电图印象中识别QT间期延长。
Int J Med Inform. 2009 Apr;78 Suppl 1(Suppl 1):S34-42. doi: 10.1016/j.ijmedinf.2008.09.001. Epub 2008 Oct 19.
2
Extracting information from textual documents in the electronic health record: a review of recent research.从电子健康记录中的文本文件提取信息:近期研究综述
Yearb Med Inform. 2008:128-44.
3
The radiology report of the future: a summary of the 2007 Intersociety Conference.
Inferring cancer disease response from radiology reports using large language models with data augmentation and prompting.
利用数据增强和提示技术的大型语言模型从放射学报告推断癌症疾病反应。
J Am Med Inform Assoc. 2023 Sep 25;30(10):1657-1664. doi: 10.1093/jamia/ocad133.
4
Artificial Intelligence in Emergency Radiology: Where Are We Going?急诊放射学中的人工智能:我们将何去何从?
Diagnostics (Basel). 2022 Dec 19;12(12):3223. doi: 10.3390/diagnostics12123223.
5
Natural Language Processing and Graph Theory: Making Sense of Imaging Records in a Novel Representation Frame.自然语言处理与图论:在一种新型表示框架中理解影像记录
JMIR Med Inform. 2022 Dec 21;10(12):e40534. doi: 10.2196/40534.
6
Natural language processing for identification of hypertrophic cardiomyopathy patients from cardiac magnetic resonance reports.基于自然语言处理的心脏磁共振报告中肥厚型心肌病患者的识别
BMC Med Inform Decis Mak. 2022 Oct 18;22(1):272. doi: 10.1186/s12911-022-02017-y.
7
Assessment of Electronic Health Record for Cancer Research and Patient Care Through a Scoping Review of Cancer Natural Language Processing.通过癌症自然语言处理的范围综述评估癌症研究和患者护理的电子健康记录。
JCO Clin Cancer Inform. 2022 Jul;6:e2200006. doi: 10.1200/CCI.22.00006.
8
Information extraction for prognostic stage prediction from breast cancer medical records using NLP and ML.基于自然语言处理和机器学习的乳腺癌病历预后分期预测的信息提取。
Med Biol Eng Comput. 2021 Sep;59(9):1751-1772. doi: 10.1007/s11517-021-02399-7. Epub 2021 Jul 23.
9
T-staging pulmonary oncology from radiological reports using natural language processing: translating into a multi-language setting.使用自然语言处理从放射学报告中进行肺癌T分期:向多语言环境的转化
Insights Imaging. 2021 Jun 10;12(1):77. doi: 10.1186/s13244-021-01018-1.
10
Automated NLP Extraction of Clinical Rationale for Treatment Discontinuation in Breast Cancer.自动化自然语言处理提取乳腺癌治疗停药的临床理由。
JCO Clin Cancer Inform. 2021 May;5:550-560. doi: 10.1200/CCI.20.00139.
未来的放射学报告:2007年跨学会会议总结
J Am Coll Radiol. 2008 May;5(5):626-9. doi: 10.1016/j.jacr.2007.12.015.
4
Mayo clinic NLP system for patient smoking status identification.梅奥诊所用于识别患者吸烟状况的自然语言处理系统。
J Am Med Inform Assoc. 2008 Jan-Feb;15(1):25-8. doi: 10.1197/jamia.M2437. Epub 2007 Oct 18.
5
Classification of cancer stage from free-text histology reports.从自由文本组织学报告中进行癌症分期分类。
Conf Proc IEEE Eng Med Biol Soc. 2006;2006:5153-6. doi: 10.1109/IEMBS.2006.259563.
6
Collection of cancer stage data by classifying free-text medical reports.通过对自由文本医学报告进行分类来收集癌症分期数据。
J Am Med Inform Assoc. 2007 Nov-Dec;14(6):736-45. doi: 10.1197/jamia.M2130. Epub 2007 Aug 21.
7
Electronic medical records for clinical research: application to the identification of heart failure.用于临床研究的电子病历:在心力衰竭识别中的应用
Am J Manag Care. 2007 Jun;13(6 Part 1):281-8.
8
Radiology reporting, past, present, and future: the radiologist's perspective.放射学报告的过去、现在与未来:放射科医生的视角
J Am Coll Radiol. 2007 May;4(5):313-9. doi: 10.1016/j.jacr.2007.01.015.
9
A framework for improving radiology reporting.一种改进放射学报告的框架。
J Am Coll Radiol. 2005 Feb;2(2):159-67. doi: 10.1016/j.jacr.2004.06.015.
10
Temporal reasoning with medical data--a review with emphasis on medical natural language processing.医学数据的时间推理——以医学自然语言处理为重点的综述
J Biomed Inform. 2007 Apr;40(2):183-202. doi: 10.1016/j.jbi.2006.12.009. Epub 2007 Jan 11.