Suppr超能文献

深度学习实现头部MRI数据集标注自动化以用于计算机视觉应用。

Deep learning to automate the labelling of head MRI datasets for computer vision applications.

作者信息

Wood David A, Kafiabadi Sina, Al Busaidi Aisha, Guilhem Emily L, Lynch Jeremy, Townend Matthew K, Montvila Antanas, Kiik Martin, Siddiqui Juveria, Gadapa Naveen, Benger Matthew D, Mazumder Asif, Barker Gareth, Ourselin Sebastian, Cole James H, Booth Thomas C

机构信息

School of Biomedical Engineering & Imaging Sciences, Kings College London, Rayne Institute, 4th Floor, Lambeth Wing, London, SE1 7EH, UK.

Department of Neuroradiology, Ruskin Wing, King's College Hospital NHS Foundation Trust, London, SE5 9RS, UK.

出版信息

Eur Radiol. 2022 Jan;32(1):725-736. doi: 10.1007/s00330-021-08132-0. Epub 2021 Jul 20.

Abstract

OBJECTIVES

The purpose of this study was to build a deep learning model to derive labels from neuroradiology reports and assign these to the corresponding examinations, overcoming a bottleneck to computer vision model development.

METHODS

Reference-standard labels were generated by a team of neuroradiologists for model training and evaluation. Three thousand examinations were labelled for the presence or absence of any abnormality by manually scrutinising the corresponding radiology reports ('reference-standard report labels'); a subset of these examinations (n = 250) were assigned 'reference-standard image labels' by interrogating the actual images. Separately, 2000 reports were labelled for the presence or absence of 7 specialised categories of abnormality (acute stroke, mass, atrophy, vascular abnormality, small vessel disease, white matter inflammation, encephalomalacia), with a subset of these examinations (n = 700) also assigned reference-standard image labels. A deep learning model was trained using labelled reports and validated in two ways: comparing predicted labels to (i) reference-standard report labels and (ii) reference-standard image labels. The area under the receiver operating characteristic curve (AUC-ROC) was used to quantify model performance. Accuracy, sensitivity, specificity, and F1 score were also calculated.

RESULTS

Accurate classification (AUC-ROC > 0.95) was achieved for all categories when tested against reference-standard report labels. A drop in performance (ΔAUC-ROC > 0.02) was seen for three categories (atrophy, encephalomalacia, vascular) when tested against reference-standard image labels, highlighting discrepancies in the original reports. Once trained, the model assigned labels to 121,556 examinations in under 30 min.

CONCLUSIONS

Our model accurately classifies head MRI examinations, enabling automated dataset labelling for downstream computer vision applications.

KEY POINTS

• Deep learning is poised to revolutionise image recognition tasks in radiology; however, a barrier to clinical adoption is the difficulty of obtaining large labelled datasets for model training. • We demonstrate a deep learning model which can derive labels from neuroradiology reports and assign these to the corresponding examinations at scale, facilitating the development of downstream computer vision models. • We rigorously tested our model by comparing labels predicted on the basis of neuroradiology reports with two sets of reference-standard labels: (1) labels derived by manually scrutinising each radiology report and (2) labels derived by interrogating the actual images.

摘要

目的

本研究旨在构建一个深度学习模型,从神经放射学报告中提取标签并将其分配给相应的检查,克服计算机视觉模型开发的一个瓶颈。

方法

由一组神经放射科医生生成参考标准标签用于模型训练和评估。通过人工仔细审查相应的放射学报告(“参考标准报告标签”),对3000例检查进行有无任何异常的标注;通过查看实际图像,为这些检查中的一个子集(n = 250)分配“参考标准图像标签”。另外,对2000份报告进行7种特殊异常类型(急性中风、肿块、萎缩、血管异常、小血管疾病、白质炎症、脑软化)有无的标注,这些检查中的一个子集(n = 700)也被分配参考标准图像标签。使用标注好的报告训练一个深度学习模型,并通过两种方式进行验证:将预测标签与(i)参考标准报告标签和(ii)参考标准图像标签进行比较。使用受试者操作特征曲线下面积(AUC-ROC)来量化模型性能。还计算了准确率、敏感性、特异性和F1分数。

结果

与参考标准报告标签进行测试时,所有类别均实现了准确分类(AUC-ROC > 0.95)。与参考标准图像标签进行测试时,三个类别(萎缩、脑软化、血管)的性能出现下降(ΔAUC-ROC > 0.02),突出了原始报告中的差异。一旦训练完成,该模型在不到30分钟的时间内为121,556例检查分配了标签。

结论

我们的模型能够准确地对头MRI检查进行分类,为下游计算机视觉应用实现自动化数据集标注。

要点

• 深度学习有望彻底改变放射学中的图像识别任务;然而,临床应用的一个障碍是难以获得用于模型训练的大型标注数据集。• 我们展示了一个深度学习模型,它可以从神经放射学报告中提取标签并大规模地将其分配给相应的检查,促进下游计算机视觉模型的开发。• 我们通过将基于神经放射学报告预测的标签与两组参考标准标签进行比较,对我们的模型进行了严格测试:(1)通过人工仔细审查每份放射学报告得出的标签和(2)通过查看实际图像得出的标签。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e600/8660736/38354e2c9214/330_2021_8132_Fig1_HTML.jpg

相似文献

1
Deep learning to automate the labelling of head MRI datasets for computer vision applications.
Eur Radiol. 2022 Jan;32(1):725-736. doi: 10.1007/s00330-021-08132-0. Epub 2021 Jul 20.
2
Deep learning models for triaging hospital head MRI examinations.
Med Image Anal. 2022 May;78:102391. doi: 10.1016/j.media.2022.102391. Epub 2022 Feb 12.
4
Factors affecting the labelling accuracy of brain MRI studies relevant for deep learning abnormality detection.
Front Radiol. 2023 Nov 27;3:1251825. doi: 10.3389/fradi.2023.1251825. eCollection 2023.
5
Multi-label annotation of text reports from computed tomography of the chest, abdomen, and pelvis using deep learning.
BMC Med Inform Decis Mak. 2022 Apr 15;22(1):102. doi: 10.1186/s12911-022-01843-4.
6
Deep-learning-assisted diagnosis for knee magnetic resonance imaging: Development and retrospective validation of MRNet.
PLoS Med. 2018 Nov 27;15(11):e1002699. doi: 10.1371/journal.pmed.1002699. eCollection 2018 Nov.
7
Comparison of Chest Radiograph Interpretations by Artificial Intelligence Algorithm vs Radiology Residents.
JAMA Netw Open. 2020 Oct 1;3(10):e2022779. doi: 10.1001/jamanetworkopen.2020.22779.
8
Breast MRI Background Parenchymal Enhancement Categorization Using Deep Learning: Outperforming the Radiologist.
J Magn Reson Imaging. 2022 Oct;56(4):1068-1076. doi: 10.1002/jmri.28111. Epub 2022 Feb 15.
9
Language model-based labeling of German thoracic radiology reports.
Rofo. 2025 Jan;197(1):55-64. doi: 10.1055/a-2287-5054. Epub 2024 Apr 25.
10
Natural Language-based Machine Learning Models for the Annotation of Clinical Radiology Reports.
Radiology. 2018 May;287(2):570-580. doi: 10.1148/radiol.2018171093. Epub 2018 Jan 30.

引用本文的文献

4
AI and Neurology.
Neurol Res Pract. 2025 Feb 17;7(1):11. doi: 10.1186/s42466-025-00367-2.
8
Breast tumor segmentation using neural cellular automata and shape guided segmentation in mammography images.
PLoS One. 2024 Oct 1;19(10):e0309421. doi: 10.1371/journal.pone.0309421. eCollection 2024.

本文引用的文献

2
PadChest: A large chest x-ray image dataset with multi-label annotated reports.
Med Image Anal. 2020 Dec;66:101797. doi: 10.1016/j.media.2020.101797. Epub 2020 Aug 20.
3
Deep learning with noisy labels: Exploring techniques and remedies in medical image analysis.
Med Image Anal. 2020 Oct;65:101759. doi: 10.1016/j.media.2020.101759. Epub 2020 Jun 20.
4
BioBERT: a pre-trained biomedical language representation model for biomedical text mining.
Bioinformatics. 2020 Feb 15;36(4):1234-1240. doi: 10.1093/bioinformatics/btz682.
5
The present and future of deep learning in radiology.
Eur J Radiol. 2019 May;114:14-24. doi: 10.1016/j.ejrad.2019.02.038. Epub 2019 Mar 2.
7
Natural language processing and machine learning algorithm to identify brain MRI reports with acute ischemic stroke.
PLoS One. 2019 Feb 28;14(2):e0212778. doi: 10.1371/journal.pone.0212778. eCollection 2019.
8
Automated Triaging of Adult Chest Radiographs with Deep Artificial Neural Networks.
Radiology. 2019 Apr;291(1):196-202. doi: 10.1148/radiol.2018180921. Epub 2019 Jan 22.
9
Artificial intelligence in radiology.
Nat Rev Cancer. 2018 Aug;18(8):500-510. doi: 10.1038/s41568-018-0016-5.
10
Deep Learning in Radiology.
Acad Radiol. 2018 Nov;25(11):1472-1480. doi: 10.1016/j.acra.2018.02.018. Epub 2018 Mar 30.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验