Li Chenglong, Xiang Zhiqiang, Tang Jin, Luo Bin, Wang Futian
IEEE Trans Neural Netw Learn Syst. 2022 Sep;33(9):5019-5031. doi: 10.1109/TNNLS.2021.3067107. Epub 2022 Aug 31.
Existing RGBT tracking methods usually localize a target object with a bounding box, in which the trackers are often affected by the inclusion of background clutter. To address this issue, this article presents a novel algorithm, called noise-robust cross-modal ranking, to suppress background effects in target bounding boxes for RGBT tracking. In particular, we handle the noise interference in cross-modal fusion and seed labels from the following two aspects. First, the soft cross-modality consistency is proposed to allow the sparse inconsistency in fusing different modalities, aiming to take both collaboration and heterogeneity of different modalities into account for more effective fusion. Second, the optimal seed learning is designed to handle label noises of ranking seeds caused by some problems, such as irregular object shape and occlusion. In addition, to deploy the complementarity and maintain the structural information of different features within each modality, we perform an individual ranking for each feature and employ a cross-feature consistency to pursue their collaboration. A unified optimization framework with an efficient convergence speed is developed to solve the proposed model. Extensive experiments demonstrate the effectiveness and efficiency of the proposed approach comparing with state-of-the-art tracking methods on GTOT and RGBT234 benchmark data sets.
现有的RGB-T跟踪方法通常使用边界框来定位目标对象,其中跟踪器经常受到背景杂波的影响。为了解决这个问题,本文提出了一种名为噪声鲁棒跨模态排序的新算法,以抑制RGB-T跟踪中目标边界框的背景影响。具体来说,我们从以下两个方面处理跨模态融合和种子标签中的噪声干扰。首先,提出软跨模态一致性,允许在融合不同模态时存在稀疏不一致性,旨在兼顾不同模态的协作性和异质性以实现更有效的融合。其次,设计最优种子学习来处理由不规则物体形状和遮挡等问题导致的排序种子标签噪声。此外,为了发挥互补性并保持每个模态内不同特征的结构信息,我们对每个特征进行单独排序,并采用跨特征一致性来促进它们的协作。开发了一个具有高效收敛速度的统一优化框架来求解所提出的模型。大量实验表明,与GTOT和RGBT234基准数据集上的现有跟踪方法相比,所提方法具有有效性和高效性。