Suppr
超能文献

在两家大型学术放射科实践中膝关节MRI报告的机器学习分类器性能：一种估计诊断率的工具

Performance of a Machine Learning Classifier of Knee MRI Reports in Two Large Academic Radiology Practices: A Tool to Estimate Diagnostic Yield.

作者信息

Hassanpour Saeed, Langlotz Curtis P, Amrhein Timothy J, Befera Nicholas T, Lungren Matthew P

机构信息

1 Stanford Center for Biomedical Informatics Research, Stanford University, Stanford, CA.

2 Department of Radiology, Stanford University School of Medicine, Stanford University Medical Center, 725 Welch Rd, Rm 1675, Stanford, CA 94305-5913.

出版信息

AJR Am J Roentgenol. 2017 Apr;208(4):750-753. doi: 10.2214/AJR.16.16128. Epub 2017 Jan 31.

DOI:10.2214/AJR.16.16128

PMID:28140627

Abstract

OBJECTIVE

The purpose of this study is to evaluate the performance of a natural language processing (NLP) system in classifying a database of free-text knee MRI reports at two separate academic radiology practices.

MATERIALS AND METHODS

An NLP system that uses terms and patterns in manually classified narrative knee MRI reports was constructed. The NLP system was trained and tested on expert-classified knee MRI reports from two major health care organizations. Radiology reports were modeled in the training set as vectors, and a support vector machine framework was used to train the classifier. A separate test set from each organization was used to evaluate the performance of the system. We evaluated the performance of the system both within and across organizations. Standard evaluation metrics, such as accuracy, precision, recall, and F1 score (i.e., the weighted average of the precision and recall), and their respective 95% CIs were used to measure the efficacy of our classification system.

RESULTS

The accuracy for radiology reports that belonged to the model's clinically significant concept classes after training data from the same institution was good, yielding an F1 score greater than 90% (95% CI, 84.6-97.3%). Performance of the classifier on cross-institutional application without institution-specific training data yielded F1 scores of 77.6% (95% CI, 69.5-85.7%) and 90.2% (95% CI, 84.5-95.9%) at the two organizations studied.

CONCLUSION

The results show excellent accuracy by the NLP machine learning classifier in classifying free-text knee MRI reports, supporting the institution-independent reproducibility of knee MRI report classification. Furthermore, the machine learning classifier performed well on free-text knee MRI reports from another institution. These data support the feasibility of multiinstitutional classification of radiologic imaging text reports with a single machine learning classifier without requiring institution-specific training data.

摘要

目的

本研究旨在评估一种自然语言处理（NLP）系统在两个独立的学术放射科实践中对自由文本膝关节MRI报告数据库进行分类的性能。

材料与方法

构建了一个使用手动分类的叙述性膝关节MRI报告中的术语和模式的NLP系统。该NLP系统在来自两个主要医疗保健组织的专家分类膝关节MRI报告上进行训练和测试。放射学报告在训练集中被建模为向量，并使用支持向量机框架训练分类器。来自每个组织的单独测试集用于评估系统的性能。我们评估了系统在组织内部和组织之间的性能。使用标准评估指标，如准确性、精确性、召回率和F1分数（即精确性和召回率的加权平均值）及其各自的95%置信区间来衡量我们分类系统的有效性。

结果

在使用来自同一机构的训练数据后，属于模型临床重要概念类别的放射学报告的准确性良好，F1分数大于90%（95%置信区间，84.6 - 97.3%）。在没有特定机构训练数据的跨机构应用中，分类器在两个研究组织中的F1分数分别为77.6%（95%置信区间，69.5 - 85.7%）和90.2%（95%置信区间，84.5 - 95.9%）。

结论

结果表明NLP机器学习分类器在对自由文本膝关节MRI报告进行分类时具有出色的准确性，支持膝关节MRI报告分类的机构独立可重复性。此外，机器学习分类器在来自另一个机构的自由文本膝关节MRI报告上表现良好。这些数据支持使用单个机器学习分类器对放射影像文本报告进行多机构分类的可行性，而无需特定机构的训练数据。

相似文献

Performance of a Machine Learning Classifier of Knee MRI Reports in Two Large Academic Radiology Practices: A Tool to Estimate Diagnostic Yield.

AJR Am J Roentgenol. 2017 Apr;208(4):750-753. doi: 10.2214/AJR.16.16128. Epub 2017 Jan 31.

Automated Radiology-Arthroscopy Correlation of Knee Meniscal Tears Using Natural Language Processing Algorithms.

Acad Radiol. 2022 Apr;29(4):479-487. doi: 10.1016/j.acra.2021.01.017. Epub 2021 Feb 11.

Information extraction from multi-institutional radiology reports.

Artif Intell Med. 2016 Jan;66:29-39. doi: 10.1016/j.artmed.2015.09.007. Epub 2015 Oct 3.

A natural language processing pipeline for pairing measurements uniquely across free-text CT reports.

J Biomed Inform. 2015 Feb;53:36-48. doi: 10.1016/j.jbi.2014.08.015. Epub 2014 Sep 6.

Natural language processing and machine learning algorithm to identify brain MRI reports with acute ischemic stroke.

PLoS One. 2019 Feb 28;14(2):e0212778. doi: 10.1371/journal.pone.0212778. eCollection 2019.

Automated Outcome Classification of Computed Tomography Imaging Reports for Pediatric Traumatic Brain Injury.

Acad Emerg Med. 2016 Feb;23(2):171-8. doi: 10.1111/acem.12859. Epub 2016 Jan 14.

Identification of Long Bone Fractures in Radiology Reports Using Natural Language Processing to support Healthcare Quality Improvement.

Appl Clin Inform. 2016 Nov 9;7(4):1051-1068. doi: 10.4338/ACI-2016-08-RA-0129.

Automated Classification of Selected Data Elements from Free-text Diagnostic Reports for Clinical Research.

Methods Inf Med. 2016 Aug 5;55(4):373-80. doi: 10.3414/ME15-02-0019. Epub 2016 Jul 13.

Automatic detection of patients with invasive fungal disease from free-text computed tomography (CT) scans.

J Biomed Inform. 2015 Feb;53:251-60. doi: 10.1016/j.jbi.2014.11.009. Epub 2014 Nov 24.

Predicting High Imaging Utilization Based on Initial Radiology Reports: A Feasibility Study of Machine Learning.

Acad Radiol. 2016 Jan;23(1):84-9. doi: 10.1016/j.acra.2015.09.014. Epub 2015 Oct 27.

引用本文的文献

Automated Radiology Report Labeling in Chest X-Ray Pathologies: Development and Evaluation of a Large Language Model Framework.

JMIR Med Inform. 2025 Mar 28;13:e68618. doi: 10.2196/68618.

Text mining approach for feature extraction and cartilage disease grade classification using knee MRI radiology reports.

Comput Struct Biotechnol J. 2024 Oct 5;24:622-629. doi: 10.1016/j.csbj.2024.10.003. eCollection 2024 Dec.

Efficient labeling of french mammogram reports with MammoBERT.

Sci Rep. 2024 Oct 22;14(1):24842. doi: 10.1038/s41598-024-76369-y.

Fusion Modeling: Combining Clinical and Imaging Data to Advance Cardiac Care.

Circ Cardiovasc Imaging. 2023 Dec;16(12):e014533. doi: 10.1161/CIRCIMAGING.122.014533. Epub 2023 Dec 11.

Detection of Gallbladder Disease Types Using Deep Learning: An Informative Medical Method.

Diagnostics (Basel). 2023 May 15;13(10):1744. doi: 10.3390/diagnostics13101744.

Transformer versus traditional natural language processing: how much data is enough for automated radiology report classification?

Br J Radiol. 2023 Sep;96(1149):20220769. doi: 10.1259/bjr.20220769. Epub 2023 May 25.

Prediction of Stroke Outcome Using Natural Language Processing-Based Machine Learning of Radiology Report of Brain MRI.

J Pers Med. 2020 Dec 16;10(4):286. doi: 10.3390/jpm10040286.

Analysis of Stroke Detection during the COVID-19 Pandemic Using Natural Language Processing of Radiology Reports.

AJNR Am J Neuroradiol. 2021 Mar;42(3):429-434. doi: 10.3174/ajnr.A6961. Epub 2020 Dec 17.

Patient Triage by Topic Modeling of Referral Letters: Feasibility Study.

JMIR Med Inform. 2020 Nov 6;8(11):e21252. doi: 10.2196/21252.

A Comparative Systematic Literature Review on Knee Bone Reports from MRI, X-rays and CT Scans Using Deep Learning and Machine Learning Methodologies.

Diagnostics (Basel). 2020 Jul 26;10(8):518. doi: 10.3390/diagnostics10080518.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

在两家大型学术放射科实践中膝关节MRI报告的机器学习分类器性能：一种估计诊断率的工具

Performance of a Machine Learning Classifier of Knee MRI Reports in Two Large Academic Radiology Practices: A Tool to Estimate Diagnostic Yield.

作者信息

机构信息

出版信息

OBJECTIVE

MATERIALS AND METHODS

RESULTS

CONCLUSION

目的

材料与方法

结果

结论

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译