CoLeCLIP：通过联合任务提示和词汇学习实现开放域持续学习

CoLeCLIP: Open-Domain Continual Learning via Joint Task Prompt and Vocabulary Learning.

作者信息

Li Yukun, Pang Guansong, Suo Wei, Jing Chenchen, Xi Yuling, Liu Lingqiao, Chen Hao, Liang Guoqiang, Wang Peng

出版信息

IEEE Trans Neural Netw Learn Syst. 2025 Aug;36(8):15137-15151. doi: 10.1109/TNNLS.2025.3547882.

DOI:10.1109/TNNLS.2025.3547882

PMID:40193261

Abstract

This article investigates the problem of continual learning (CL) of vision-language models (VLMs) in open domains, where models are required to perform continual updating and inference on a stream of datasets from diverse seen and unseen domains with novel classes. Such a capability is crucial for various applications in open environments, e.g., AI assistants, autonomous driving systems, and robotics. Current CL studies mostly focus on closed-set scenarios in a single domain with known classes. Large pretrained VLMs such as CLIP have showcased exceptional zero-shot recognition capabilities, and several recent studies have leveraged the unique characteristics of VLMs to mitigate catastrophic forgetting in CL. However, they primarily focus on closed-set CL in a single-domain dataset. Open-domain CL of large VLMs is significantly more challenging due to 1) large class correlations and domain gaps across the datasets and 2) the forgetting of zero-shot knowledge in the pretrained VLMs and the knowledge learned from the newly adapted datasets. In this work, we introduce a novel approach, termed CoLeCLIP, which learns an open-domain CL model based on CLIP. It addresses these challenges through joint learning of a set of task prompts and a cross-domain class vocabulary. Extensive experiments on 11 domain datasets show that CoLeCLIP achieves new state-of-the-art performance for open-domain CL under both task- and class-incremental learning (CIL) settings.

摘要

本文研究了开放域中视觉语言模型（VLM）的持续学习（CL）问题，在这种情况下，模型需要对来自不同可见和不可见域且包含新类别的数据集流进行持续更新和推理。这种能力对于开放环境中的各种应用至关重要，例如人工智能助手、自动驾驶系统和机器人技术。当前的持续学习研究大多集中在具有已知类别的单个域中的封闭集场景。像CLIP这样的大型预训练视觉语言模型已经展示了出色的零样本识别能力，最近的一些研究利用视觉语言模型的独特特性来减轻持续学习中的灾难性遗忘。然而，它们主要关注单域数据集中的封闭集持续学习。大型视觉语言模型的开放域持续学习面临更大的挑战，原因如下：1）数据集之间存在很大的类别相关性和域差距；2）预训练视觉语言模型中的零样本知识以及从新适应数据集中学到的知识会被遗忘。在这项工作中，我们引入了一种名为CoLeCLIP的新方法，它基于CLIP学习一个开放域持续学习模型。它通过联合学习一组任务提示和跨域类别词汇表来应对这些挑战。在11个域数据集上进行的广泛实验表明，CoLeCLIP在任务增量学习和类别增量学习（CIL）设置下均实现了开放域持续学习的新的最优性能。

相似文献

CoLeCLIP: Open-Domain Continual Learning via Joint Task Prompt and Vocabulary Learning.CoLeCLIP：通过联合任务提示和词汇学习实现开放域持续学习

IEEE Trans Neural Netw Learn Syst. 2025 Aug;36(8):15137-15151. doi: 10.1109/TNNLS.2025.3547882.

Short-Term Memory Impairment短期记忆障碍

Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.利用基础模型库进行跨设备肿瘤显微镜检查中的细胞相似性搜索。

Front Oncol. 2025 Jun 18;15:1480384. doi: 10.3389/fonc.2025.1480384. eCollection 2025.

Sexual Harassment and Prevention Training性骚扰与预防培训

General 3D Vision-Language Model With Fast Rendering and Pre-Training Vision-Language Alignment.具有快速渲染和预训练视觉语言对齐的通用3D视觉语言模型。

IEEE Trans Pattern Anal Mach Intell. 2025 Sep;47(9):7352-7368. doi: 10.1109/TPAMI.2025.3566593.

Continual learning in medical image analysis: A comprehensive review of recent advancements and future prospects.医学图像分析中的持续学习：近期进展与未来前景的全面综述

Med Image Anal. 2025 Dec;106:103730. doi: 10.1016/j.media.2025.103730. Epub 2025 Jul 28.

PathVLM-Eval: Evaluation of open vision language models in histopathology.PathVLM-Eval：组织病理学中开放视觉语言模型的评估

J Pathol Inform. 2025 Jun 5;18:100455. doi: 10.1016/j.jpi.2025.100455. eCollection 2025 Aug.

Mixture of prompts learning for vision-language models.用于视觉语言模型的提示混合学习。

Front Artif Intell. 2025 Jun 10;8:1580973. doi: 10.3389/frai.2025.1580973. eCollection 2025.

The Black Book of Psychotropic Dosing and Monitoring.《精神药物剂量与监测黑皮书》

Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.

Stigma Management Strategies of Autistic Social Media Users.自闭症社交媒体用户的污名管理策略

Autism Adulthood. 2025 May 28;7(3):273-282. doi: 10.1089/aut.2023.0095. eCollection 2025 Jun.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

CoLeCLIP：通过联合任务提示和词汇学习实现开放域持续学习

CoLeCLIP: Open-Domain Continual Learning via Joint Task Prompt and Vocabulary Learning.

作者信息

出版信息

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献