基于自然语言处理的乳腺磁共振成像报告中成像观察和评估类别的自动提取。

Automatic extraction of imaging observation and assessment categories from breast magnetic resonance imaging reports with natural language processing.

机构信息

Department of Radiology, Peking University First Hospital, Beijing 100034, China.

出版信息

Chin Med J (Engl). 2019 Jul 20;132(14):1673-1680. doi: 10.1097/CM9.0000000000000301.

DOI:10.1097/CM9.0000000000000301

PMID:31268905

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6759110/

Abstract

BACKGROUND

Structured reports are not widely used and thus most reports exist in the form of free text. The process of data extraction by experts is time-consuming and error-prone, whereas data extraction by natural language processing (NLP) is a potential solution that could improve diagnosis efficiency and accuracy. The purpose of this study was to evaluate an NLP program that determines American College of Radiology Breast Imaging Reporting and Data System (BI-RADS) descriptors and final assessment categories from breast magnetic resonance imaging (MRI) reports.

METHODS

This cross-sectional study involved 2330 breast MRI reports in the electronic medical record from 2009 to 2017. We used 1635 reports for the creation of a revised BI-RADS MRI lexicon and synonyms lists as well as the iterative development of an NLP system. The remaining 695 reports that were not used for developing the system were used as an independent test set for the final evaluation of the NLP system. The recall and precision of an NLP algorithm to detect the revised BI-RADS MRI descriptors and BI-RADS categories from the free-text reports were evaluated against a standard reference of manual human review.

RESULTS

There was a high level of agreement between two manual reviewers, with a κ value of 0.95. For all breast imaging reports, the NLP algorithm demonstrated a recall of 78.5% and a precision of 86.1% for correct identification of the revised BI-RADS MRI descriptors and the BI-RADS categories. NLP generated the total results in <1 s, whereas the manual reviewers averaged 3.38 and 3.23 min per report, respectively.

CONCLUSIONS

The NLP algorithm demonstrates high recall and precision for information extraction from free-text reports. This approach will help to narrow the gap between unstructured report text and structured data, which is needed in decision support and other applications.

摘要

背景

结构化报告并未得到广泛应用，因此大多数报告仍以自由文本的形式存在。专家进行数据提取的过程既耗时又容易出错，而自然语言处理（NLP）进行数据提取则是一种潜在的解决方案，它可以提高诊断效率和准确性。本研究的目的是评估一种 NLP 程序，该程序可从乳腺磁共振成像（MRI）报告中确定美国放射学院乳腺成像报告和数据系统（BI-RADS）的描述符和最终评估类别。

方法

本横断面研究纳入了 2009 年至 2017 年电子病历中的 2330 份乳腺 MRI 报告。我们使用 1635 份报告来创建修订后的 BI-RADS MRI 词汇表和同义词列表，以及迭代开发 NLP 系统。剩下的 695 份未用于开发系统的报告被用作独立测试集，用于最终评估 NLP 系统。我们评估了 NLP 算法从自由文本报告中检测修订后的 BI-RADS MRI 描述符和 BI-RADS 类别的召回率和准确率，与手动人工审查的标准参考进行了比较。

结果

两位手动审阅者之间存在高度一致性，κ 值为 0.95。对于所有乳腺影像学报告，NLP 算法在正确识别修订后的 BI-RADS MRI 描述符和 BI-RADS 类别方面的召回率为 78.5%，准确率为 86.1%。NLP 在不到 1 秒的时间内生成全部结果，而手动审阅者分别需要 3.38 分钟和 3.23 分钟来完成每份报告。

结论

NLP 算法在从自由文本报告中提取信息方面具有较高的召回率和准确率。这种方法将有助于缩小非结构化报告文本和结构化数据之间的差距，这是决策支持和其他应用程序所需要的。

相似文献

Automatic extraction of imaging observation and assessment categories from breast magnetic resonance imaging reports with natural language processing.

Chin Med J (Engl). 2019 Jul 20;132(14):1673-1680. doi: 10.1097/CM9.0000000000000301.

The implementation of natural language processing to extract index lesions from breast magnetic resonance imaging reports.

BMC Med Inform Decis Mak. 2019 Dec 30;19(1):288. doi: 10.1186/s12911-019-0997-3.

Automated extraction of BI-RADS final assessment categories from radiology reports with natural language processing.

J Digit Imaging. 2013 Oct;26(5):989-94. doi: 10.1007/s10278-013-9616-5.

Using automatically extracted information from mammography reports for decision-support.

J Biomed Inform. 2016 Aug;62:224-31. doi: 10.1016/j.jbi.2016.07.001. Epub 2016 Jul 4.

Evaluating the accuracy of lung-RADS score extraction from radiology reports: Manual entry versus natural language processing.

Int J Med Inform. 2024 Nov;191:105580. doi: 10.1016/j.ijmedinf.2024.105580. Epub 2024 Jul 31.

Automatic classification and prioritisation of actionable BI-RADS categories using natural language processing models.

Clin Radiol. 2024 Jan;79(1):e1-e7. doi: 10.1016/j.crad.2023.09.009. Epub 2023 Sep 27.

Automatic abstraction of imaging observations with their characteristics from mammography reports.

J Am Med Inform Assoc. 2015 Apr;22(e1):e81-92. doi: 10.1136/amiajnl-2014-003009. Epub 2014 Oct 28.

Automated annotation and classification of BI-RADS assessment from radiology reports.

J Biomed Inform. 2017 May;69:177-187. doi: 10.1016/j.jbi.2017.04.011. Epub 2017 Apr 18.

Automated detection of ambiguity in BI-RADS assessment categories in mammography reports.

Stud Health Technol Inform. 2014;197:35-9.

引用本文的文献

Theory of radiologist interaction with instant messaging decision support tools: A sequential-explanatory study.

PLOS Digit Health. 2024 Feb 26;3(2):e0000297. doi: 10.1371/journal.pdig.0000297. eCollection 2024 Feb.

A scoping review of natural language processing of radiology reports in breast cancer.

Front Oncol. 2023 Apr 12;13:1160167. doi: 10.3389/fonc.2023.1160167. eCollection 2023.

Using a classification model for determining the value of liver radiological reports of patients with colorectal cancer.

Front Oncol. 2022 Nov 21;12:913806. doi: 10.3389/fonc.2022.913806. eCollection 2022.

A systematic review of natural language processing applied to radiology reports.

BMC Med Inform Decis Mak. 2021 Jun 3;21(1):179. doi: 10.1186/s12911-021-01533-7.

本文引用的文献

Natural Language Processing in Radiology: A Systematic Review.

Radiology. 2016 May;279(2):329-43. doi: 10.1148/radiol.16142770.

Natural Language Processing Technologies in Radiology Research and Clinical Applications.

Radiographics. 2016 Jan-Feb;36(1):176-91. doi: 10.1148/rg.2016150080.

Using natural language processing to extract mammographic findings.

J Biomed Inform. 2015 Apr;54:77-84. doi: 10.1016/j.jbi.2015.01.010. Epub 2015 Feb 3.

BI-RADS update.

Radiol Clin North Am. 2014 May;52(3):481-7. doi: 10.1016/j.rcl.2014.02.008.

Automated extraction of BI-RADS final assessment categories from radiology reports with natural language processing.

J Digit Imaging. 2013 Oct;26(5):989-94. doi: 10.1007/s10278-013-9616-5.

Repeat abdominal imaging examinations in a tertiary care hospital.

Am J Med. 2012 Feb;125(2):155-61. doi: 10.1016/j.amjmed.2011.03.031.

Automatically correlating clinical findings and body locations in radiology reports using MedLEE.

J Digit Imaging. 2012 Apr;25(2):240-9. doi: 10.1007/s10278-011-9411-0.

Global cancer statistics.

CA Cancer J Clin. 2011 Mar-Apr;61(2):69-90. doi: 10.3322/caac.20107. Epub 2011 Feb 4.

The ACR BI-RADS experience: learning from history.

J Am Coll Radiol. 2009 Dec;6(12):851-60. doi: 10.1016/j.jacr.2009.07.023.

Discerning tumor status from unstructured MRI reports--completeness of information in existing reports and utility of automated natural language processing.

J Digit Imaging. 2010 Apr;23(2):119-32. doi: 10.1007/s10278-009-9215-7. Epub 2009 May 30.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于自然语言处理的乳腺磁共振成像报告中成像观察和评估类别的自动提取。

Automatic extraction of imaging observation and assessment categories from breast magnetic resonance imaging reports with natural language processing.

机构信息

Department of Radiology, Peking University First Hospital, Beijing 100034, China.

出版信息

Chin Med J (Engl). 2019 Jul 20;132(14):1673-1680. doi: 10.1097/CM9.0000000000000301.

DOI:10.1097/CM9.0000000000000301

PMID:31268905

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6759110/

Abstract

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

摘要

基于自然语言处理的乳腺磁共振成像报告中成像观察和评估类别的自动提取。

Automatic extraction of imaging observation and assessment categories from breast magnetic resonance imaging reports with natural language processing.

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

基于自然语言处理的乳腺磁共振成像报告中成像观察和评估类别的自动提取。

Automatic extraction of imaging observation and assessment categories from breast magnetic resonance imaging reports with natural language processing.

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献