用于协作图像分类的多路径x-D递归神经网络

Multi-path x-D Recurrent Neural Networks for Collaborative Image Classification.

作者信息

Gao Riqiang, Huo Yuankai, Bao Shunxing, Tang Yucheng, Antic Sanja L, Epstein Emily S, Deppen Steve, Paulson Alexis B, Sandler Kim L, Massion Pierre P, Landman Bennett A

机构信息

Electrical Engineering and Computer Science, Vanderbilt University, Nashville, TN, USA 37235, Vanderbilt University Medical Center, Nashville, TN, USA 37235.

出版信息

Neurocomputing (Amst). 2020 Jul 15;397:48-59. doi: 10.1016/j.neucom.2020.02.033. Epub 2020 Feb 15.

DOI:10.1016/j.neucom.2020.02.033

PMID:32863584

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7454345/

Abstract

With the rapid development of image acquisition and storage, multiple images per class are commonly available for computer vision tasks (e.g., face recognition, object detection, medical imaging, etc.). Recently, the recurrent neural network (RNN) has been widely integrated with convolutional neural networks (CNN) to perform image classification on ordered (sequential) data. In this paper, by permutating multiple images as multiple dummy orders, we generalize the ordered "RNN+CNN" design (longitudinal) to a novel unordered fashion, called Multi-path x-D Recurrent Neural Network (MxDRNN) for image classification. To the best of our knowledge, few (if any) existing studies have deployed the RNN framework to unordered intra-class images to leverage classification performance. Specifically, multiple learning paths are introduced in the MxDRNN to extract discriminative features by permutating input dummy orders. Eight datasets from five different fields (MNIST, 3D-MNIST, CIFAR, VGGFace2, and lung screening computed tomography) are included to evaluate the performance of our method. The proposed MxDRNN improves the baseline performance by a large margin across the different application fields (e.g., accuracy from 46.40% to 76.54% in VGGFace2 test pose set, AUC from 0.7418 to 0.8162 in NLST lung dataset). Additionally, empirical experiments show the MxDRNN is more robust to category-irrelevant attributes (e.g., expression, pose in face images), which may introduce difficulties for image classification and algorithm generalizability. The code is publicly available.

摘要

随着图像采集和存储的快速发展，对于计算机视觉任务（如人脸识别、目标检测、医学成像等），每个类别通常都有多个图像可用。最近，循环神经网络（RNN）已广泛与卷积神经网络（CNN）集成，以对有序（序列）数据进行图像分类。在本文中，通过将多个图像排列为多个虚拟顺序，我们将有序的“RNN+CNN”设计（纵向）推广为一种新颖的无序方式，称为用于图像分类的多路径x-D循环神经网络（MxDRNN）。据我们所知，很少（如果有的话）现有研究将RNN框架应用于无序的类内图像以提升分类性能。具体而言，MxDRNN中引入了多条学习路径，通过排列输入的虚拟顺序来提取判别性特征。我们纳入了来自五个不同领域的八个数据集（MNIST、3D-MNIST、CIFAR、VGGFace2和肺部筛查计算机断层扫描）来评估我们方法的性能。所提出的MxDRNN在不同应用领域中大幅提高了基线性能（例如，在VGGFace2测试姿态集中准确率从46.40%提高到76.54%，在NLST肺部数据集中AUC从0.7418提高到0.8162）。此外，实证实验表明MxDRNN对与类别无关的属性（如面部图像中的表情、姿态）更具鲁棒性，这些属性可能给图像分类和算法通用性带来困难。代码已公开可用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/97a1/7454345/6d9b4161b2d8/nihms-1568810-f0001.jpg

相似文献

Multi-path x-D Recurrent Neural Networks for Collaborative Image Classification.用于协作图像分类的多路径x-D递归神经网络

Neurocomputing (Amst). 2020 Jul 15;397:48-59. doi: 10.1016/j.neucom.2020.02.033. Epub 2020 Feb 15.

CNN-RNN Network Integration for the Diagnosis of COVID-19 Using Chest X-ray and CT Images.基于胸部 X 射线和 CT 图像的 COVID-19 诊断的 CNN-RNN 网络集成。

Sensors (Basel). 2023 Jan 25;23(3):1356. doi: 10.3390/s23031356.

Automatic bladder segmentation from CT images using deep CNN and 3D fully connected CRF-RNN.利用深度卷积神经网络和 3D 全连接条件随机场循环神经网络自动进行 CT 图像的膀胱分割。

Int J Comput Assist Radiol Surg. 2018 Jul;13(7):967-975. doi: 10.1007/s11548-018-1733-7. Epub 2018 Mar 19.

Application of high resolution computed tomography image assisted classification model of middle ear diseases based on 3D-convolutional neural network.基于 3D 卷积神经网络的中耳疾病高分辨率 CT 图像辅助分类模型的应用。

Zhong Nan Da Xue Xue Bao Yi Xue Ban. 2022 Aug 28;47(8):1037-1048. doi: 10.11817/j.issn.1672-7347.2022.210704.

RNN-based longitudinal analysis for diagnosis of Alzheimer's disease.基于 RNN 的阿尔茨海默病纵向分析诊断。

Comput Med Imaging Graph. 2019 Apr;73:1-10. doi: 10.1016/j.compmedimag.2019.01.005. Epub 2019 Jan 26.

Uncertainty handling in convolutional neural networks.卷积神经网络中的不确定性处理。

Neural Comput Appl. 2022;34(19):16753-16769. doi: 10.1007/s00521-022-07313-2. Epub 2022 Jun 18.

Classification of Alzheimer's Disease by Combination of Convolutional and Recurrent Neural Networks Using FDG-PET Images.基于氟代脱氧葡萄糖正电子发射断层扫描（FDG-PET）图像，利用卷积神经网络和循环神经网络相结合的方法对阿尔茨海默病进行分类

Front Neuroinform. 2018 Jun 19;12:35. doi: 10.3389/fninf.2018.00035. eCollection 2018.

Automatic segmentation of OCT retinal boundaries using recurrent neural networks and graph search.使用递归神经网络和图搜索自动分割光学相干断层扫描（OCT）视网膜边界

Biomed Opt Express. 2018 Oct 26;9(11):5759-5777. doi: 10.1364/BOE.9.005759. eCollection 2018 Nov 1.

Detection, segmentation, and 3D pose estimation of surgical tools using convolutional neural networks and algebraic geometry.使用卷积神经网络和代数几何进行手术工具的检测、分割和三维姿态估计。

Med Image Anal. 2021 May;70:101994. doi: 10.1016/j.media.2021.101994. Epub 2021 Feb 7.

Automated AJCC (7th edition) staging of non-small cell lung cancer (NSCLC) using deep convolutional neural network (CNN) and recurrent neural network (RNN).使用深度卷积神经网络（CNN）和循环神经网络（RNN）对非小细胞肺癌（NSCLC）进行自动AJCC（第7版）分期

Health Inf Sci Syst. 2019 Jul 30;7(1):14. doi: 10.1007/s13755-019-0077-1. eCollection 2019 Dec.

引用本文的文献

Reducing uncertainty in cancer risk estimation for patients with indeterminate pulmonary nodules using an integrated deep learning model.利用集成深度学习模型降低不确定度肺结节患者的癌症风险评估。

Comput Biol Med. 2022 Nov;150:106113. doi: 10.1016/j.compbiomed.2022.106113. Epub 2022 Sep 29.

The impact of the lung EDRN-CVC on Phase 1, 2, & 3 biomarker validation studies.肺 EDRN-CVC 对 1 期、2 期和 3 期生物标志物验证研究的影响。

Cancer Biomark. 2022;33(4):449-465. doi: 10.3233/CBM-210382.

Cancer Risk Estimation Combining Lung Screening CT with Clinical Data Elements.结合肺部筛查CT与临床数据元素的癌症风险评估

Radiol Artif Intell. 2021 Oct 13;3(6):e210032. doi: 10.1148/ryai.2021210032. eCollection 2021 Nov.

Deep Multi-path Network Integrating Incomplete Biomarker and Chest CT Data for Evaluating Lung Cancer Risk.深度多路径网络集成不完整生物标志物和胸部CT数据以评估肺癌风险

Proc SPIE Int Soc Opt Eng. 2021 Feb;11596. doi: 10.1117/12.2580730. Epub 2021 Feb 15.

Time-distanced gates in long short-term memory networks.长短期记忆网络中的时间距离门控

Med Image Anal. 2020 Oct;65:101785. doi: 10.1016/j.media.2020.101785. Epub 2020 Jul 18.

本文引用的文献

ArcFace: Additive Angular Margin Loss for Deep Face Recognition.ArcFace：用于深度人脸识别的附加角度间隔损失。

IEEE Trans Pattern Anal Mach Intell. 2022 Oct;44(10):5962-5979. doi: 10.1109/TPAMI.2021.3087709. Epub 2022 Sep 14.

Hierarchical Deep Click Feature Prediction for Fine-Grained Image Recognition.层次化深度点击特征预测在细粒度图像识别中的应用。

IEEE Trans Pattern Anal Mach Intell. 2022 Feb;44(2):563-578. doi: 10.1109/TPAMI.2019.2932058. Epub 2022 Jan 7.

Clinical-grade computational pathology using weakly supervised deep learning on whole slide images.基于全切片图像的弱监督深度学习的临床级计算病理学。

Nat Med. 2019 Aug;25(8):1301-1309. doi: 10.1038/s41591-019-0508-1. Epub 2019 Jul 15.

End-to-end lung cancer screening with three-dimensional deep learning on low-dose chest computed tomography.基于低剂量 CT 的三维深度学习肺癌全流程筛查。

Nat Med. 2019 Jun;25(6):954-961. doi: 10.1038/s41591-019-0447-x. Epub 2019 May 20.

Spatial Pyramid-Enhanced NetVLAD With Weighted Triplet Loss for Place Recognition.用于地点识别的带加权三元组损失的空间金字塔增强NetVLAD

IEEE Trans Neural Netw Learn Syst. 2020 Feb;31(2):661-674. doi: 10.1109/TNNLS.2019.2908982. Epub 2019 Apr 26.

Deep Learning Predicts Lung Cancer Treatment Response from Serial Medical Imaging.深度学习从连续医学成像预测肺癌治疗反应。

Clin Cancer Res. 2019 Jun 1;25(11):3266-3275. doi: 10.1158/1078-0432.CCR-18-2495. Epub 2019 Apr 22.

Evaluate the Malignancy of Pulmonary Nodules Using the 3-D Deep Leaky Noisy-OR Network.利用三维深度渗漏噪声 OR 网络评估肺结节的恶性程度。

IEEE Trans Neural Netw Learn Syst. 2019 Nov;30(11):3484-3495. doi: 10.1109/TNNLS.2019.2892409. Epub 2019 Feb 14.

Local Deep-Feature Alignment for Unsupervised Dimension Reduction.用于无监督降维的局部深度特征对齐

IEEE Trans Image Process. 2018 Feb 22. doi: 10.1109/TIP.2018.2804218.

Beyond Bilinear: Generalized Multimodal Factorized High-Order Pooling for Visual Question Answering.超越双线性：用于视觉问答的广义多模态因式分解高阶池化

IEEE Trans Neural Netw Learn Syst. 2018 Dec;29(12):5947-5959. doi: 10.1109/TNNLS.2018.2817340. Epub 2018 Apr 9.

Multilevel Contextual 3-D CNNs for False Positive Reduction in Pulmonary Nodule Detection.用于减少肺结节检测中假阳性的多级上下文3D卷积神经网络

IEEE Trans Biomed Eng. 2017 Jul;64(7):1558-1567. doi: 10.1109/TBME.2016.2613502. Epub 2016 Sep 26.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验