Morid Mohammad Amin, Borjali Alireza, Del Fiol Guilherme
Department of Information Systems and Analytics, Leavey School of Business, Santa Clara University, Santa Clara, CA, USA.
Department of Orthopaedic Surgery, Harvard Medical School, Boston, MA, USA; Department of Orthopaedic Surgery, Harris Orthopaedics Laboratory, Massachusetts General Hospital, Boston, MA, USA.
Comput Biol Med. 2021 Jan;128:104115. doi: 10.1016/j.compbiomed.2020.104115. Epub 2020 Nov 13.
Employing transfer learning (TL) with convolutional neural networks (CNNs), well-trained on non-medical ImageNet dataset, has shown promising results for medical image analysis in recent years. We aimed to conduct a scoping review to identify these studies and summarize their characteristics in terms of the problem description, input, methodology, and outcome.
To identify relevant studies, MEDLINE, IEEE, and ACM digital library were searched for studies published between June 1st 2012 and January 2nd, 2020. Two investigators independently reviewed articles to determine eligibility and to extract data according to a study protocol defined a priori.
After screening of 8421 articles, 102 met the inclusion criteria. Of 22 anatomical areas, eye (18%), breast (14%), and brain (12%) were the most commonly studied. Data augmentation was performed in 72% of fine-tuning TL studies versus 15% of the feature-extracting TL studies. Inception models were the most commonly used in breast related studies (50%), while VGGNet was the common in eye (44%), skin (50%) and tooth (57%) studies. AlexNet for brain (42%) and DenseNet for lung studies (38%) were the most frequently used models. Inception models were the most frequently used for studies that analyzed ultrasound (55%), endoscopy (57%), and skeletal system X-rays (57%). VGGNet was the most common for fundus (42%) and optical coherence tomography images (50%). AlexNet was the most frequent model for brain MRIs (36%) and breast X-Rays (50%). 35% of the studies compared their model with other well-trained CNN models and 33% of them provided visualization for interpretation.
This study identified the most prevalent tracks of implementation in the literature for data preparation, methodology selection and output evaluation for various medical image analysis tasks. Also, we identified several critical research gaps existing in the TL studies on medical image analysis. The findings of this scoping review can be used in future TL studies to guide the selection of appropriate research approaches, as well as identify research gaps and opportunities for innovation.
近年来,采用在非医学ImageNet数据集上训练良好的卷积神经网络(CNN)进行迁移学习(TL),在医学图像分析方面已显示出有前景的结果。我们旨在进行一项范围综述,以识别这些研究,并从问题描述、输入、方法和结果方面总结其特征。
为识别相关研究,检索了MEDLINE、IEEE和ACM数字图书馆中2012年6月1日至2020年1月2日发表的研究。两名研究者独立审查文章,以根据事先定义的研究方案确定其合格性并提取数据。
在筛选8421篇文章后,102篇符合纳入标准。在22个解剖区域中,眼睛(18%)、乳房(14%)和大脑(12%)是研究最频繁的区域。72%的微调TL研究进行了数据增强,而特征提取TL研究中这一比例为15%。Inception模型在与乳房相关的研究中使用最为频繁(50%),而VGGNet在眼睛(44%)、皮肤(50%)和牙齿(57%)研究中最为常用。用于大脑研究的AlexNet(42%)和用于肺部研究的DenseNet(38%)是最常使用的模型。Inception模型在分析超声(55%)、内窥镜检查(57%)和骨骼系统X射线(57%)的研究中使用最为频繁。VGGNet在眼底(42%)和光学相干断层扫描图像(50%)研究中最为常见。AlexNet在脑部MRI(36%)和乳房X射线(50%)研究中是最常使用的模型。35%的研究将其模型与其他训练良好的CNN模型进行了比较,33%的研究提供了可视化结果用于解释。
本研究确定了文献中针对各种医学图像分析任务在数据准备、方法选择和输出评估方面最普遍的实施路径。此外,我们还确定了医学图像分析TL研究中存在的几个关键研究空白。这一范围综述的结果可用于未来的TL研究,以指导合适研究方法的选择,以及识别研究空白和创新机会。