达尔文 GSE：为知识产权数据集建立更好的图像检索系统。

DarwinGSE: Towards better image retrieval systems for intellectual property datasets.

机构信息

Techframe-Information Systems, SA, São Domingos de Rana, Portugal.

Smart Cities Research Center, Polytechnic Institute of Tomar, Tomar, Portugal.

出版信息

PLoS One. 2024 Jul 1;19(7):e0304915. doi: 10.1371/journal.pone.0304915. eCollection 2024.

DOI:10.1371/journal.pone.0304915

PMID:38950045

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11216576/

Abstract

A trademark's image is usually the first type of indirect contact between a consumer and a product or a service. Companies rely on graphical trademarks as a symbol of quality and instant recognition, seeking to protect them from copyright infringements. A popular defense mechanism is graphical searching, where an image is compared to a large database to find potential conflicts with similar trademarks. Despite not being a new subject, image retrieval state-of-the-art lacks reliable solutions in the Industrial Property (IP) sector, where datasets are practically unrestricted in content, with abstract images for which modeling human perception is a challenging task. Existing Content-based Image Retrieval (CBIR) systems still present several problems, particularly in terms of efficiency and reliability. In this paper, we propose a new CBIR system that overcomes these major limitations. It follows a modular methodology, composed of a set of individual components tasked with the retrieval, maintenance and gradual optimization of trademark image searching, working on large-scale, unlabeled datasets. Its generalization capacity is achieved using multiple feature descriptions, weighted separately, and combined to represent a single similarity score. Images are evaluated for general features, edge maps, and regions of interest, using a method based on Watershedding K-Means segments. We propose an image recovery process that relies on a new similarity measure between all feature descriptions. New trademark images are added every day to ensure up-to-date results. The proposed system showcases a timely retrieval speed, with 95% of searches having a 10 second presentation speed and a mean average precision of 93.7%, supporting its applicability to real-word IP protection scenarios.

摘要

商标的形象通常是消费者与产品或服务的第一类间接接触。公司依赖图形商标作为质量和即时识别的象征，寻求保护它们免受版权侵权。一种流行的防御机制是图形搜索，即将图像与大型数据库进行比较，以查找与类似商标的潜在冲突。尽管不是一个新主题，但图像检索的最新技术在工业产权 (IP) 领域缺乏可靠的解决方案，其中数据集在内容上几乎不受限制，具有抽象图像，对其进行建模以模拟人类感知是一项具有挑战性的任务。现有的基于内容的图像检索 (CBIR) 系统仍然存在几个问题，特别是在效率和可靠性方面。在本文中，我们提出了一种新的 CBIR 系统，可以克服这些主要限制。它遵循一种模块化方法，由一组单独的组件组成，负责检索、维护和逐步优化商标图像搜索，处理大规模、无标签数据集。其泛化能力是通过使用多个特征描述来实现的，这些特征描述分别加权并组合起来表示单个相似性得分。使用基于分水岭 K-Means 段的方法对图像进行通用特征、边缘图和感兴趣区域的评估。我们提出了一种图像恢复过程，该过程依赖于所有特征描述之间的新相似性度量。每天都会添加新的商标图像，以确保结果是最新的。所提出的系统展示了及时的检索速度，95%的搜索具有 10 秒的呈现速度，平均准确率为 93.7%，支持其在实际 IP 保护场景中的应用。