Zhang Zheng, Yuan Xu, Zhu Lei, Song Jingkuan, Nie Liqiang
IEEE Trans Image Process. 2024;33:2558-2571. doi: 10.1109/TIP.2024.3378918. Epub 2024 Apr 3.
Despite remarkable successes in unimodal learning tasks, backdoor attacks against cross-modal learning are still underexplored due to the limited generalization and inferior stealthiness when involving multiple modalities. Notably, since works in this area mainly inherit ideas from unimodal visual attacks, they struggle with dealing with diverse cross-modal attack circumstances and manipulating imperceptible trigger samples, which hinders their practicability in real-world applications. In this paper, we introduce a novel bilateral backdoor to fill in the missing pieces of the puzzle in the cross-modal backdoor and propose a generalized invisible backdoor framework against cross-modal learning (BadCM). Specifically, a cross-modal mining scheme is developed to capture the modality-invariant components as target poisoning areas, where well-designed trigger patterns injected into these regions can be efficiently recognized by the victim models. This strategy is adapted to different image-text cross-modal models, making our framework available to various attack scenarios. Furthermore, for generating poisoned samples of high stealthiness, we conceive modality-specific generators for visual and linguistic modalities that facilitate hiding explicit trigger patterns in modality-invariant regions. To the best of our knowledge, BadCM is the first invisible backdoor method deliberately designed for diverse cross-modal attacks within one unified framework. Comprehensive experimental evaluations on two typical applications, i.e., cross-modal retrieval and VQA, demonstrate the effectiveness and generalization of our method under multiple kinds of attack scenarios. Moreover, we show that BadCM can robustly evade existing backdoor defenses. Our code is available at https://github.com/xandery-geek/BadCM.
尽管在单模态学习任务中取得了显著成功,但由于在涉及多种模态时泛化能力有限且隐蔽性较差,针对跨模态学习的后门攻击仍未得到充分探索。值得注意的是,由于该领域的工作主要继承了单模态视觉攻击的思想,它们在处理各种跨模态攻击情况和操纵难以察觉的触发样本方面存在困难,这阻碍了它们在实际应用中的实用性。在本文中,我们引入了一种新颖的双边后门来填补跨模态后门难题中缺失的部分,并提出了一种针对跨模态学习的广义隐形后门框架(BadCM)。具体而言,我们开发了一种跨模态挖掘方案,以捕获模态不变的组件作为目标中毒区域,在这些区域注入精心设计的触发模式可以被受害模型有效识别。这种策略适用于不同的图像 - 文本跨模态模型,使我们的框架适用于各种攻击场景。此外,为了生成具有高隐蔽性的中毒样本,我们为视觉和语言模态设计了特定模态的生成器,便于在模态不变区域隐藏明确的触发模式。据我们所知,BadCM是第一个在统一框架内为各种跨模态攻击精心设计的隐形后门方法。在两个典型应用,即跨模态检索和视觉问答上进行的综合实验评估,证明了我们的方法在多种攻击场景下的有效性和泛化性。此外,我们表明BadCM可以稳健地规避现有的后门防御。我们的代码可在https://github.com/xandery-geek/BadCM获取。