用于复杂卷积神经网络和跨域基准测试的多级融合深度图像特征感知

Deep image features sensing with multilevel fusion for complex convolution neural networks & cross domain benchmarks.

作者信息

Shabir Aiza, Ahmed Khawaja Tehseen, Mahmood Arif, Garay Helena, Prado González Luis Eduardo, Ashraf Imran

机构信息

Institute of Computer Science and Information Technology, The Women University Multan, Multan, Pakistan.

Department of Computer Science, Bahauddin Zakariya University, Multan, Pakistan.

出版信息

PLoS One. 2025 Mar 18;20(3):e0317863. doi: 10.1371/journal.pone.0317863. eCollection 2025.

DOI:10.1371/journal.pone.0317863

PMID:40100801

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11918433/

Abstract

Efficient image retrieval from a variety of datasets is crucial in today's digital world. Visual properties are represented using primitive image signatures in Content Based Image Retrieval (CBIR). Feature vectors are employed to classify images into predefined categories. This research presents a unique feature identification technique based on suppression to locate interest points by computing productive sum of pixel derivatives by computing the differentials for corner scores. Scale space interpolation is applied to define interest points by combining color features from spatially ordered L2 normalized coefficients with shape and object information. Object based feature vectors are formed using high variance coefficients to reduce the complexity and are converted into bag-of-visual-words (BoVW) for effective retrieval and ranking. The presented method encompass feature vectors for information synthesis and improves the discriminating strength of the retrieval system by extracting deep image features including primitive, spatial, and overlayed using multilayer fusion of Convolutional Neural Networks(CNNs). Extensive experimentation is performed on standard image datasets benchmarks, including ALOT, Cifar-10, Corel-10k, Tropical Fruits, and Zubud. These datasets cover wide range of categories including shape, color, texture, spatial, and complicated objects. Experimental results demonstrate considerable improvements in precision and recall rates, average retrieval precision and recall, and mean average precision and recall rates across various image semantic groups within versatile datasets. The integration of traditional feature extraction methods fusion with multilevel CNN advances image sensing and retrieval systems, promising more accurate and efficient image retrieval solutions.

摘要

在当今数字世界中，从各种数据集中高效检索图像至关重要。在基于内容的图像检索（CBIR）中，视觉属性是使用原始图像签名来表示的。特征向量用于将图像分类到预定义的类别中。本研究提出了一种基于抑制的独特特征识别技术，通过计算像素导数的有效总和（通过计算角点分数的微分）来定位兴趣点。应用尺度空间插值，通过将来自空间有序的L2归一化系数的颜色特征与形状和对象信息相结合来定义兴趣点。基于对象的特征向量使用高方差系数形成，以降低复杂度，并转换为视觉词袋（BoVW）以进行有效的检索和排序。所提出的方法包含用于信息合成的特征向量，并通过使用卷积神经网络（CNN）的多层融合提取包括原始、空间和叠加的深度图像特征，提高了检索系统的辨别力。在标准图像数据集基准上进行了广泛的实验，包括ALOT、Cifar-10、Corel-10k、热带水果和Zubud。这些数据集涵盖了广泛的类别，包括形状、颜色、纹理、空间和复杂对象。实验结果表明，在通用数据集中的各种图像语义组中，精度和召回率、平均检索精度和召回率以及平均平均精度和召回率都有显著提高。传统特征提取方法与多级CNN的融合推进了图像传感和检索系统，有望提供更准确、高效的图像检索解决方案。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d72c/11918433/152798611138/pone.0317863.g001.jpg

相似文献

Deep image features sensing with multilevel fusion for complex convolution neural networks & cross domain benchmarks.用于复杂卷积神经网络和跨域基准测试的多级融合深度图像特征感知

PLoS One. 2025 Mar 18;20(3):e0317863. doi: 10.1371/journal.pone.0317863. eCollection 2025.

Brain tumor segmentation and detection in MRI using convolutional neural networks and VGG16.使用卷积神经网络和VGG16在磁共振成像（MRI）中进行脑肿瘤分割与检测

Cancer Biomark. 2025 Mar;42(3):18758592241311184. doi: 10.1177/18758592241311184. Epub 2025 Apr 4.

Deep Learning Using Isotroping, Laplacing, Eigenvalues Interpolative Binding, and Convolved Determinants with Normed Mapping for Large-Scale Image Retrieval.深度学习中采用各向同性、拉普拉斯、特征值插值绑定、卷积行列式和归一化映射进行大规模图像检索。

Sensors (Basel). 2021 Feb 6;21(4):1139. doi: 10.3390/s21041139.

Content Based Image Retrieval by Using Color Descriptor and Discrete Wavelet Transform.基于颜色描述符和离散小波变换的图像检索。

J Med Syst. 2018 Jan 25;42(3):44. doi: 10.1007/s10916-017-0880-7.

GaborNet: investigating the importance of color space, scale and orientation for image classification.加博尔网络：探究颜色空间、尺度和方向对图像分类的重要性

PeerJ Comput Sci. 2022 Feb 25;8:e890. doi: 10.7717/peerj-cs.890. eCollection 2022.

Object-Based Image Retrieval Using the U-Net-Based Neural Network.基于 U-Net 神经网络的目标图像检索

Comput Intell Neurosci. 2021 Nov 10;2021:4395646. doi: 10.1155/2021/4395646. eCollection 2021.

Hybrid Bag-of-Visual-Words and FeatureWiz Selection for Content-Based Visual Information Retrieval.基于视觉词汇混合袋和 FeatureWiz 选择的基于内容的视觉信息检索。

Sensors (Basel). 2023 Feb 2;23(3):1653. doi: 10.3390/s23031653.

A novel biomedical image indexing and retrieval system via deep preference learning.一种基于深度偏好学习的新型生物医学图像索引和检索系统。

Comput Methods Programs Biomed. 2018 May;158:53-69. doi: 10.1016/j.cmpb.2018.02.003. Epub 2018 Feb 6.

Efficient CNN architecture with image sensing and algorithmic channeling for dataset harmonization.具有图像传感和算法通道的高效卷积神经网络架构用于数据集协调。

Sci Rep. 2025 Mar 4;15(1):7552. doi: 10.1038/s41598-025-90616-w.

Relevance feedback for enhancing content based image retrieval and automatic prediction of semantic image features: Application to bone tumor radiographs.基于相关性反馈的图像检索增强和语义图像特征的自动预测：在骨肿瘤 X 光片上的应用。

J Biomed Inform. 2018 Aug;84:123-135. doi: 10.1016/j.jbi.2018.07.002. Epub 2018 Jul 5.

本文引用的文献

Review of deep learning: concepts, CNN architectures, challenges, applications, future directions.深度学习综述：概念、卷积神经网络架构、挑战、应用及未来方向。

J Big Data. 2021;8(1):53. doi: 10.1186/s40537-021-00444-8. Epub 2021 Mar 31.

Sensors (Basel). 2021 Feb 6;21(4):1139. doi: 10.3390/s21041139.

An efficient image descriptor for image classification and CBIR.一种用于图像分类和基于内容的图像检索的高效图像描述符。

Optik (Stuttg). 2020 Jul;214:164833. doi: 10.1016/j.ijleo.2020.164833. Epub 2020 May 4.

Deep learning.深度学习。

Nature. 2015 May 28;521(7553):436-44. doi: 10.1038/nature14539.

DAISY: an efficient dense descriptor applied to wide-baseline stereo.DAISY：一种应用于宽基线立体视觉的高效密集描述符。

IEEE Trans Pattern Anal Mach Intell. 2010 May;32(5):815-30. doi: 10.1109/TPAMI.2009.77.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用于复杂卷积神经网络和跨域基准测试的多级融合深度图像特征感知

Deep image features sensing with multilevel fusion for complex convolution neural networks & cross domain benchmarks.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献