IEEE Trans Cybern. 2020 Oct;50(10):4256-4267. doi: 10.1109/TCYB.2019.2933224. Epub 2019 Aug 23.
Real-world inverse synthetic aperture radar (ISAR) object recognition is a critical and challenging problem in computer vision tasks. In this article, an efficient real-world ISAR object recognition method is proposed, namely, real-world ISAR object recognition (RIOR), based on deep multimodal relation learning (DMRL). It cannot only handle the complex multimodal recognition problem efficiently but also exploit the relations among the features, attributes, labels, and classes with semantic knowledge: 1) an adaptive multimodal mechanism (AMM) is proposed in convolutional neural network (CNN) to substantially promote the CNN sampling and transformation capability and significantly raise the output feature map resolutions by keeping almost all of the information; 2) deep attribute relation graph learning (DARGL) is proposed to jointly estimate the large numbers of heterogeneous attributes and collaboratively explore the relations among the features, attributes, labels, and classes with common knowledge graphs; and 3) relational-regularized convolutional sparse learning (RCSL) is proposed to further achieve good translation invariance and improve the accuracy and speed of the entire system. Extensive qualitative and quantitative experiments are performed on two real-world ISAR datasets, demonstrating that RIOR outperforms the state-of-the-art methods while running quickly.
基于深度多模态关系学习的真实场景逆合成孔径雷达(ISAR)目标识别
真实场景逆合成孔径雷达(ISAR)目标识别是计算机视觉任务中的一个关键且具有挑战性的问题。在本文中,我们提出了一种有效的真实场景 ISAR 目标识别方法,即基于深度多模态关系学习(DMRL)的真实场景 ISAR 目标识别(RIOR)。它不仅可以有效地处理复杂的多模态识别问题,还可以利用特征、属性、标签和类之间的关系以及语义知识:1)在卷积神经网络(CNN)中提出了自适应多模态机制(AMM),通过保持几乎所有信息,显著提高了 CNN 的采样和变换能力,并显著提高了输出特征图的分辨率;2)提出了深度属性关系图学习(DARGL),以联合估计大量异构属性,并共同探索具有公共知识图的特征、属性、标签和类之间的关系;3)提出了关系正则化卷积稀疏学习(RCSL),以进一步实现良好的平移不变性,并提高整个系统的准确性和速度。在两个真实的 ISAR 数据集上进行了广泛的定性和定量实验,结果表明 RIOR 优于最先进的方法,同时运行速度也很快。