Alarcão Soraia M, Mendonça Vânia, Maruta Carolina, Fonseca Manuel J
LASIGE, Faculdade de Ciências, Universidade de Lisboa, Lisboa, Portugal.
INESC-ID, Instituto Superior Técnico, Universidade de Lisboa, Lisboa, Portugal.
Multimed Tools Appl. 2023;82(8):11619-11661. doi: 10.1007/s11042-022-13119-0. Epub 2022 Aug 20.
One of the main challenges in CBIR systems is to choose discriminative and compact features, among dozens, to represent the images under comparison. Over the years, a great effort has been made to combine multiple features, mainly using early, late, and hierarchical fusion techniques. Unveiling the perfect combination of features is highly domain-specific and dependent on the type of image. Thus, the process of designing a CBIR system for new datasets or domains involves a huge experimentation overhead, leading to multiple fine-tuned CBIR systems. It would be desirable to dynamically find the best combination of CBIR systems without needing to go through such extensive experimentation and without requiring previous domain knowledge. In this paper, we propose ExpertosLF, a model-agnostic interpretable late fusion technique based on online learning with expert advice, which dynamically combines CBIR systems without knowing a priori which ones are the best for a given domain. At each query, ExpertosLF takes advantage of user's feedback to determine each CBIR contribution in the ensemble for the following queries. ExpertosLF produces an interpretable ensemble that is independent of the dataset and domain. Moreover, ExpertosLF is designed to be modular, and scalable. Experiments on 13 benchmark datasets from the Biomedical, Real, and Sketch domains revealed that: (i) ExpertosLF surpasses the performance of state of the art late-fusion techniques; (ii) it successfully and quickly converges to the performance of the best CBIR sets across domains without any previous domain knowledge (in most cases, fewer than 25 queries need to receive human feedback).
基于内容的图像检索(CBIR)系统的主要挑战之一是,在数十种特征中选择具有区分性且紧凑的特征,以表示待比较的图像。多年来,人们付出了巨大努力来组合多种特征,主要采用早期、晚期和分层融合技术。揭示特征的完美组合高度依赖于特定领域,并且取决于图像类型。因此,为新数据集或领域设计CBIR系统的过程涉及大量的实验开销,从而产生多个经过微调的CBIR系统。期望能够动态地找到CBIR系统的最佳组合,而无需进行如此广泛的实验,也无需先前的领域知识。在本文中,我们提出了ExpertosLF,这是一种基于专家建议的在线学习的模型无关可解释晚期融合技术,它可以在不知道哪些系统对于给定领域是最佳的先验知识的情况下动态地组合CBIR系统。在每次查询时,ExpertosLF利用用户反馈来确定每个CBIR系统在集成中对后续查询的贡献。ExpertosLF生成一个与数据集和领域无关的可解释集成。此外,ExpertosLF被设计为模块化且可扩展的。在来自生物医学、真实和草图领域的13个基准数据集上进行的实验表明:(i)ExpertosLF超过了现有晚期融合技术的性能;(ii)它成功且迅速地收敛到跨领域最佳CBIR集的性能,而无需任何先前的领域知识(在大多数情况下,少于25次查询就需要接收人工反馈)。