• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

叙事性放射学报告的自动标注

Automatic Annotation of Narrative Radiology Reports.

作者信息

Krsnik Ivan, Glavaš Goran, Krsnik Marina, Miletić Damir, Štajduhar Ivan

机构信息

Department of Computer Engineering, Faculty of Engineering, University of Rijeka, Vukovarska 58, 51000 Rijeka, Croatia.

School of Business Informatics and Mathematics, University of Mannheim, 68159 Mannheim, Germany.

出版信息

Diagnostics (Basel). 2020 Apr 1;10(4):196. doi: 10.3390/diagnostics10040196.

DOI:10.3390/diagnostics10040196
PMID:32244833
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7235892/
Abstract

Narrative texts in electronic health records can be efficiently utilized for building decision support systems in the clinic, only if they are correctly interpreted automatically in accordance with a specified standard. This paper tackles the problem of developing an automated method of labeling free-form radiology reports, as a precursor for building query-capable report databases in hospitals. The analyzed dataset consists of 1295 radiology reports concerning the condition of a knee, retrospectively gathered at the Clinical Hospital Centre Rijeka, Croatia. Reports were manually labeled with one or more labels from a set of 10 most commonly occurring clinical conditions. After primary preprocessing of the texts, two sets of text classification methods were compared: (1) traditional classification models-Naive Bayes (NB), Logistic Regression (LR), Support Vector Machine (SVM), and Random Forests (RF)-coupled with Bag-of-Words (BoW) features (i.e., symbolic text representation) and (2) Convolutional Neural Network (CNN) coupled with dense word vectors (i.e., word embeddings as a semantic text representation) as input features. We resorted to nested 10-fold cross-validation to evaluate the performance of competing methods using accuracy, precision, recall, and F 1 score. The CNN with semantic word representations as input yielded the overall best performance, having a micro-averaged F 1 score of 86 . 7 % . The CNN classifier yielded particularly encouraging results for the most represented conditions: degenerative disease ( 95 . 9 % ), arthrosis ( 93 . 3 % ), and injury ( 89 . 2 % ). As a data-hungry deep learning model, the CNN, however, performed notably worse than the competing models on underrepresented classes with fewer training instances such as multicausal disease or metabolic disease. LR, RF, and SVM performed comparably well, with the obtained micro-averaged F 1 scores of 84 . 6 % , 82 . 2 % , and 82 . 1 % , respectively.

摘要

只有当电子健康记录中的叙述性文本按照特定标准被正确自动解读时,才能有效地用于构建临床决策支持系统。本文探讨了开发一种自动标注自由格式放射学报告的方法这一问题,作为在医院构建具备查询功能的报告数据库的前奏。分析的数据集由1295份关于膝盖状况的放射学报告组成,这些报告是在克罗地亚里耶卡临床医院中心回顾性收集的。报告被手动标注了一组10种最常见临床病症中的一个或多个标签。在对文本进行初步预处理后,比较了两组文本分类方法:(1)传统分类模型——朴素贝叶斯(NB)、逻辑回归(LR)、支持向量机(SVM)和随机森林(RF)——与词袋(BoW)特征(即符号文本表示)相结合;(2)卷积神经网络(CNN)与密集词向量(即作为语义文本表示的词嵌入)相结合作为输入特征。我们采用嵌套10折交叉验证,使用准确率、精确率、召回率和F1分数来评估竞争方法的性能。以语义词表示作为输入的CNN产生了总体最佳性能,微平均F1分数为86.7%。对于最具代表性的病症,CNN分类器产生了特别令人鼓舞的结果:退行性疾病(95.9%)、关节病(93.3%)和损伤(89.2%)。然而,作为一个数据需求大的深度学习模型,CNN在训练实例较少的代表性不足的类别(如多病因疾病或代谢疾病)上的表现明显比竞争模型差。LR、RF和SVM的表现相当,获得的微平均F1分数分别为84.6%、82.2%和82.1%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be18/7235892/9c25052519bb/diagnostics-10-00196-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be18/7235892/9c25052519bb/diagnostics-10-00196-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be18/7235892/9c25052519bb/diagnostics-10-00196-g001.jpg

相似文献

1
Automatic Annotation of Narrative Radiology Reports.叙事性放射学报告的自动标注
Diagnostics (Basel). 2020 Apr 1;10(4):196. doi: 10.3390/diagnostics10040196.
2
A clinical text classification paradigm using weak supervision and deep representation.一种使用弱监督和深度表示的临床文本分类范式。
BMC Med Inform Decis Mak. 2019 Jan 7;19(1):1. doi: 10.1186/s12911-018-0723-6.
3
Performance of a Machine Learning Classifier of Knee MRI Reports in Two Large Academic Radiology Practices: A Tool to Estimate Diagnostic Yield.在两家大型学术放射科实践中膝关节MRI报告的机器学习分类器性能:一种估计诊断率的工具
AJR Am J Roentgenol. 2017 Apr;208(4):750-753. doi: 10.2214/AJR.16.16128. Epub 2017 Jan 31.
4
Comprehensive Word-Level Classification of Screening Mammography Reports Using a Neural Network Sequence Labeling Approach.基于神经网络序列标注方法的乳腺 X 线摄影筛查报告的全面词级分类。
J Digit Imaging. 2019 Oct;32(5):685-692. doi: 10.1007/s10278-018-0141-4.
5
Artificial Intelligence Learning Semantics via External Resources for Classifying Diagnosis Codes in Discharge Notes.人工智能通过外部资源学习语义以对出院小结中的诊断代码进行分类。
J Med Internet Res. 2017 Nov 6;19(11):e380. doi: 10.2196/jmir.8344.
6
Automated Classification of Free-Text Radiology Reports: Using Different Feature Extraction Methods to Identify Fractures of the Distal Fibula.自动化自由文本放射学报告分类:使用不同的特征提取方法识别腓骨远端骨折。
Rofo. 2023 Aug;195(8):713-719. doi: 10.1055/a-2061-6562. Epub 2023 May 9.
7
Prediction of Stroke Outcome Using Natural Language Processing-Based Machine Learning of Radiology Report of Brain MRI.使用基于自然语言处理的脑磁共振成像放射学报告机器学习预测卒中结局
J Pers Med. 2020 Dec 16;10(4):286. doi: 10.3390/jpm10040286.
8
Natural Language-based Machine Learning Models for the Annotation of Clinical Radiology Reports.基于自然语言的机器学习模型在临床放射学报告标注中的应用。
Radiology. 2018 May;287(2):570-580. doi: 10.1148/radiol.2018171093. Epub 2018 Jan 30.
9
Automatic extraction of cancer registry reportable information from free-text pathology reports using multitask convolutional neural networks.使用多任务卷积神经网络从自由文本病理报告中自动提取癌症登记报告信息。
J Am Med Inform Assoc. 2020 Jan 1;27(1):89-98. doi: 10.1093/jamia/ocz153.
10
Investigating the impact of pre-processing techniques and pre-trained word embeddings in detecting Arabic health information on social media.研究预处理技术和预训练词嵌入在社交媒体上检测阿拉伯语健康信息方面的影响。
J Big Data. 2021;8(1):95. doi: 10.1186/s40537-021-00488-w. Epub 2021 Jul 2.

引用本文的文献

1
Automated labelling of radiology reports using natural language processing: Comparison of traditional and newer methods.使用自然语言处理对放射学报告进行自动标注:传统方法与新方法的比较。
Health Care Sci. 2023 Apr 24;2(2):120-128. doi: 10.1002/hcs2.40. eCollection 2023 Apr.
2
Deep Learning-Based Natural Language Processing in Radiology: The Impact of Report Complexity, Disease Prevalence, Dataset Size, and Algorithm Type on Model Performance.深度学习在放射学中的自然语言处理:报告复杂性、疾病流行率、数据集大小和算法类型对模型性能的影响。
J Med Syst. 2021 Sep 4;45(10):91. doi: 10.1007/s10916-021-01761-4.
3
Year 2020 (with COVID): Observation of Scientific Literature on Clinical Natural Language Processing.

本文引用的文献

1
Measuring the Quality of Explanations: The System Causability Scale (SCS): Comparing Human and Machine Explanations.衡量解释的质量:系统可归因性量表(SCS):比较人类和机器的解释
Kunstliche Intell (Oldenbourg). 2020;34(2):193-198. doi: 10.1007/s13218-020-00636-z. Epub 2020 Jan 21.
2
Causability and explainability of artificial intelligence in medicine.人工智能在医学中的可归因性与可解释性。
Wiley Interdiscip Rev Data Min Knowl Discov. 2019 Jul-Aug;9(4):e1312. doi: 10.1002/widm.1312. Epub 2019 Apr 2.
3
Comparative effectiveness of convolutional neural network (CNN) and recurrent neural network (RNN) architectures for radiology text report classification.
2020 年(含新冠疫情):临床自然语言处理相关科学文献观察
Yearb Med Inform. 2021 Aug;30(1):257-263. doi: 10.1055/s-0041-1726528. Epub 2021 Sep 3.
4
COVID-19 classification by CCSHNet with deep fusion using transfer learning and discriminant correlation analysis.使用迁移学习和判别相关分析进行深度融合的CCSHNet对COVID-19的分类
Inf Fusion. 2021 Apr;68:131-148. doi: 10.1016/j.inffus.2020.11.005. Epub 2020 Nov 13.
卷积神经网络 (CNN) 和循环神经网络 (RNN) 架构在放射学文本报告分类中的比较效果。
Artif Intell Med. 2019 Jun;97:79-88. doi: 10.1016/j.artmed.2018.11.004. Epub 2018 Nov 23.
4
Deep Learning to Classify Radiology Free-Text Reports.深度学习在放射科自由文本报告分类中的应用
Radiology. 2018 Mar;286(3):845-852. doi: 10.1148/radiol.2017171115. Epub 2017 Nov 13.
5
Recurrent neural networks for classifying relations in clinical notes.用于对临床记录中的关系进行分类的循环神经网络。
J Biomed Inform. 2017 Aug;72:85-95. doi: 10.1016/j.jbi.2017.07.006. Epub 2017 Jul 8.
6
Semi-automated detection of anterior cruciate ligament injury from MRI.基于磁共振成像的前交叉韧带损伤半自动检测
Comput Methods Programs Biomed. 2017 Mar;140:151-164. doi: 10.1016/j.cmpb.2016.12.006. Epub 2016 Dec 15.
7
Automated discovery of safety and efficacy concerns for joint & muscle pain relief treatments from online reviews.通过在线评论自动发现关节和肌肉疼痛缓解治疗的安全性和有效性问题。
Int J Med Inform. 2017 Apr;100:108-120. doi: 10.1016/j.ijmedinf.2017.01.005. Epub 2017 Jan 20.
8
Text mining approach to predict hospital admissions using early medical records from the emergency department.利用急诊科早期医疗记录预测住院情况的文本挖掘方法。
Int J Med Inform. 2017 Apr;100:1-8. doi: 10.1016/j.ijmedinf.2017.01.001. Epub 2017 Jan 5.
9
Causality patterns and machine learning for the extraction of problem-action relations in discharge summaries.出院小结中问题-行动关系提取的因果模式与机器学习
Int J Med Inform. 2017 Feb;98:1-12. doi: 10.1016/j.ijmedinf.2016.10.021. Epub 2016 Nov 9.
10
Automatic ICD-10 classification of cancers from free-text death certificates.从自由文本死亡证明中对癌症进行ICD - 10自动分类。
Int J Med Inform. 2015 Nov;84(11):956-65. doi: 10.1016/j.ijmedinf.2015.08.004. Epub 2015 Aug 13.