叙事性放射学报告的自动标注

Automatic Annotation of Narrative Radiology Reports.

作者信息

Krsnik Ivan, Glavaš Goran, Krsnik Marina, Miletić Damir, Štajduhar Ivan

机构信息

Department of Computer Engineering, Faculty of Engineering, University of Rijeka, Vukovarska 58, 51000 Rijeka, Croatia.

School of Business Informatics and Mathematics, University of Mannheim, 68159 Mannheim, Germany.

出版信息

Diagnostics (Basel). 2020 Apr 1;10(4):196. doi: 10.3390/diagnostics10040196.

DOI:10.3390/diagnostics10040196

PMID:32244833

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7235892/

Abstract

Narrative texts in electronic health records can be efficiently utilized for building decision support systems in the clinic, only if they are correctly interpreted automatically in accordance with a specified standard. This paper tackles the problem of developing an automated method of labeling free-form radiology reports, as a precursor for building query-capable report databases in hospitals. The analyzed dataset consists of 1295 radiology reports concerning the condition of a knee, retrospectively gathered at the Clinical Hospital Centre Rijeka, Croatia. Reports were manually labeled with one or more labels from a set of 10 most commonly occurring clinical conditions. After primary preprocessing of the texts, two sets of text classification methods were compared: (1) traditional classification models-Naive Bayes (NB), Logistic Regression (LR), Support Vector Machine (SVM), and Random Forests (RF)-coupled with Bag-of-Words (BoW) features (i.e., symbolic text representation) and (2) Convolutional Neural Network (CNN) coupled with dense word vectors (i.e., word embeddings as a semantic text representation) as input features. We resorted to nested 10-fold cross-validation to evaluate the performance of competing methods using accuracy, precision, recall, and F 1 score. The CNN with semantic word representations as input yielded the overall best performance, having a micro-averaged F 1 score of 86 . 7 % . The CNN classifier yielded particularly encouraging results for the most represented conditions: degenerative disease ( 95 . 9 % ), arthrosis ( 93 . 3 % ), and injury ( 89 . 2 % ). As a data-hungry deep learning model, the CNN, however, performed notably worse than the competing models on underrepresented classes with fewer training instances such as multicausal disease or metabolic disease. LR, RF, and SVM performed comparably well, with the obtained micro-averaged F 1 scores of 84 . 6 % , 82 . 2 % , and 82 . 1 % , respectively.

摘要

只有当电子健康记录中的叙述性文本按照特定标准被正确自动解读时，才能有效地用于构建临床决策支持系统。本文探讨了开发一种自动标注自由格式放射学报告的方法这一问题，作为在医院构建具备查询功能的报告数据库的前奏。分析的数据集由1295份关于膝盖状况的放射学报告组成，这些报告是在克罗地亚里耶卡临床医院中心回顾性收集的。报告被手动标注了一组10种最常见临床病症中的一个或多个标签。在对文本进行初步预处理后，比较了两组文本分类方法：（1）传统分类模型——朴素贝叶斯（NB）、逻辑回归（LR）、支持向量机（SVM）和随机森林（RF）——与词袋（BoW）特征（即符号文本表示）相结合；（2）卷积神经网络（CNN）与密集词向量（即作为语义文本表示的词嵌入）相结合作为输入特征。我们采用嵌套10折交叉验证，使用准确率、精确率、召回率和F1分数来评估竞争方法的性能。以语义词表示作为输入的CNN产生了总体最佳性能，微平均F1分数为86.7%。对于最具代表性的病症，CNN分类器产生了特别令人鼓舞的结果：退行性疾病（95.9%）、关节病（93.3%）和损伤（89.2%）。然而，作为一个数据需求大的深度学习模型，CNN在训练实例较少的代表性不足的类别（如多病因疾病或代谢疾病）上的表现明显比竞争模型差。LR、RF和SVM的表现相当，获得的微平均F1分数分别为84.6%、82.2%和82.1%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be18/7235892/9c25052519bb/diagnostics-10-00196-g001.jpg

相似文献

Automatic Annotation of Narrative Radiology Reports.

Diagnostics (Basel). 2020 Apr 1;10(4):196. doi: 10.3390/diagnostics10040196.

A clinical text classification paradigm using weak supervision and deep representation.

BMC Med Inform Decis Mak. 2019 Jan 7;19(1):1. doi: 10.1186/s12911-018-0723-6.

Performance of a Machine Learning Classifier of Knee MRI Reports in Two Large Academic Radiology Practices: A Tool to Estimate Diagnostic Yield.

AJR Am J Roentgenol. 2017 Apr;208(4):750-753. doi: 10.2214/AJR.16.16128. Epub 2017 Jan 31.

Comprehensive Word-Level Classification of Screening Mammography Reports Using a Neural Network Sequence Labeling Approach.

J Digit Imaging. 2019 Oct;32(5):685-692. doi: 10.1007/s10278-018-0141-4.

Artificial Intelligence Learning Semantics via External Resources for Classifying Diagnosis Codes in Discharge Notes.

J Med Internet Res. 2017 Nov 6;19(11):e380. doi: 10.2196/jmir.8344.

Automated Classification of Free-Text Radiology Reports: Using Different Feature Extraction Methods to Identify Fractures of the Distal Fibula.

Rofo. 2023 Aug;195(8):713-719. doi: 10.1055/a-2061-6562. Epub 2023 May 9.

Prediction of Stroke Outcome Using Natural Language Processing-Based Machine Learning of Radiology Report of Brain MRI.

J Pers Med. 2020 Dec 16;10(4):286. doi: 10.3390/jpm10040286.

Natural Language-based Machine Learning Models for the Annotation of Clinical Radiology Reports.

Radiology. 2018 May;287(2):570-580. doi: 10.1148/radiol.2018171093. Epub 2018 Jan 30.

Automatic extraction of cancer registry reportable information from free-text pathology reports using multitask convolutional neural networks.

J Am Med Inform Assoc. 2020 Jan 1;27(1):89-98. doi: 10.1093/jamia/ocz153.

Investigating the impact of pre-processing techniques and pre-trained word embeddings in detecting Arabic health information on social media.

J Big Data. 2021;8(1):95. doi: 10.1186/s40537-021-00488-w. Epub 2021 Jul 2.

引用本文的文献

Automated labelling of radiology reports using natural language processing: Comparison of traditional and newer methods.

Health Care Sci. 2023 Apr 24;2(2):120-128. doi: 10.1002/hcs2.40. eCollection 2023 Apr.

Deep Learning-Based Natural Language Processing in Radiology: The Impact of Report Complexity, Disease Prevalence, Dataset Size, and Algorithm Type on Model Performance.

J Med Syst. 2021 Sep 4;45(10):91. doi: 10.1007/s10916-021-01761-4.

Year 2020 (with COVID): Observation of Scientific Literature on Clinical Natural Language Processing.

Yearb Med Inform. 2021 Aug;30(1):257-263. doi: 10.1055/s-0041-1726528. Epub 2021 Sep 3.

COVID-19 classification by CCSHNet with deep fusion using transfer learning and discriminant correlation analysis.

Inf Fusion. 2021 Apr;68:131-148. doi: 10.1016/j.inffus.2020.11.005. Epub 2020 Nov 13.

本文引用的文献

Measuring the Quality of Explanations: The System Causability Scale (SCS): Comparing Human and Machine Explanations.

Kunstliche Intell (Oldenbourg). 2020;34(2):193-198. doi: 10.1007/s13218-020-00636-z. Epub 2020 Jan 21.

Causability and explainability of artificial intelligence in medicine.

Wiley Interdiscip Rev Data Min Knowl Discov. 2019 Jul-Aug;9(4):e1312. doi: 10.1002/widm.1312. Epub 2019 Apr 2.

Comparative effectiveness of convolutional neural network (CNN) and recurrent neural network (RNN) architectures for radiology text report classification.

Artif Intell Med. 2019 Jun;97:79-88. doi: 10.1016/j.artmed.2018.11.004. Epub 2018 Nov 23.

Deep Learning to Classify Radiology Free-Text Reports.

Radiology. 2018 Mar;286(3):845-852. doi: 10.1148/radiol.2017171115. Epub 2017 Nov 13.

Recurrent neural networks for classifying relations in clinical notes.

J Biomed Inform. 2017 Aug;72:85-95. doi: 10.1016/j.jbi.2017.07.006. Epub 2017 Jul 8.

Semi-automated detection of anterior cruciate ligament injury from MRI.

Comput Methods Programs Biomed. 2017 Mar;140:151-164. doi: 10.1016/j.cmpb.2016.12.006. Epub 2016 Dec 15.

Automated discovery of safety and efficacy concerns for joint & muscle pain relief treatments from online reviews.

Int J Med Inform. 2017 Apr;100:108-120. doi: 10.1016/j.ijmedinf.2017.01.005. Epub 2017 Jan 20.

Text mining approach to predict hospital admissions using early medical records from the emergency department.

Int J Med Inform. 2017 Apr;100:1-8. doi: 10.1016/j.ijmedinf.2017.01.001. Epub 2017 Jan 5.

Causality patterns and machine learning for the extraction of problem-action relations in discharge summaries.

Int J Med Inform. 2017 Feb;98:1-12. doi: 10.1016/j.ijmedinf.2016.10.021. Epub 2016 Nov 9.

Automatic ICD-10 classification of cancers from free-text death certificates.

Int J Med Inform. 2015 Nov;84(11):956-65. doi: 10.1016/j.ijmedinf.2015.08.004. Epub 2015 Aug 13.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

叙事性放射学报告的自动标注

Automatic Annotation of Narrative Radiology Reports.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献