使用深度学习驱动的基于内容的图像检索（CBIR）优化视觉数据检索，以改善人机交互。

Optimizing visual data retrieval using deep learning driven CBIR for improved human machine interaction.

作者信息

P Arulmozhi, R Gopi

机构信息

Faculty of Information Technology, Dhanalakshmi Srinivasan Engineering College, Perambalur, Tamilnadu, India.

Faculty of Computer Science and Engineering, Dhanalakshmi Srinivasan Engineering College, Perambalur, Tamilnadu, India.

出版信息

Sci Rep. 2025 Jul 2;15(1):23169. doi: 10.1038/s41598-025-05478-z.

DOI:10.1038/s41598-025-05478-z

PMID:40603973

Abstract

Content-based image retrieval (CBIR) systems have formidable obstacles in connecting human comprehension with machine-driven feature extraction due to the exponential expansion of visual data across many areas. Robust performance across varied datasets is challenging for traditional CBIR methods due to their reliance on hand-crafted features and inflexible structures. This study presents a deep adaptive attention network (DAAN) for CBIR that combines multi-scale feature extraction and hybrid neural architectures to solve these problems and improve the speed and accuracy of visual retrieval. The DAAN architecture integrates transformer-based models for capturing picture contextual connections with deep neural network (DNN) to extract spatial features. A new adaptive multi-level attention module (AMLA) that guarantees accurate feature weighting improves the system's ability to detect minute visual material changes. Findings show that DAAN-CBIR outperforms existing approaches with high mean average precision (map), retrieval speed, and reduced training time. These developments prove its efficacy in various fields, including e-commerce, digital information preservation, medical imaging diagnostics, and personalized media recommendations.

摘要

基于内容的图像检索（CBIR）系统在将人类理解与机器驱动的特征提取相联系方面面临巨大障碍，这是由于视觉数据在许多领域呈指数级增长。传统的CBIR方法由于依赖手工制作的特征和不灵活的结构，在不同数据集上实现稳健性能具有挑战性。本研究提出了一种用于CBIR的深度自适应注意力网络（DAAN），它结合了多尺度特征提取和混合神经架构来解决这些问题，并提高视觉检索的速度和准确性。DAAN架构将基于Transformer的模型与深度神经网络（DNN）集成，以捕获图片上下文连接并提取空间特征。一种新的自适应多级注意力模块（AMLA）可确保准确的特征加权，提高了系统检测微小视觉材料变化的能力。研究结果表明，DAAN-CBIR在平均精度均值（map）、检索速度和减少训练时间方面优于现有方法。这些进展证明了它在电子商务、数字信息保存、医学成像诊断和个性化媒体推荐等各个领域的有效性。

相似文献

Optimizing visual data retrieval using deep learning driven CBIR for improved human machine interaction.

Sci Rep. 2025 Jul 2;15(1):23169. doi: 10.1038/s41598-025-05478-z.

DNA-CBIR: DNA Translation Inspired Codon Pattern-Based Deep Image Feature Extraction for Content-Based Image Retrieval.

IEEE Trans Nanobioscience. 2025 Jul;24(3):318-330. doi: 10.1109/TNB.2025.3540102.

CBAM VGG16: An efficient driver distraction classification using CBAM embedded VGG16 architecture.

Comput Biol Med. 2024 Sep;180:108945. doi: 10.1016/j.compbiomed.2024.108945. Epub 2024 Aug 1.

TLTNet: A novel transscale cascade layered transformer network for enhanced retinal blood vessel segmentation.

Comput Biol Med. 2024 Aug;178:108773. doi: 10.1016/j.compbiomed.2024.108773. Epub 2024 Jun 25.

A novel deep learning framework for retinal disease detection leveraging contextual and local features cues from retinal images.

Med Biol Eng Comput. 2025 Feb 7. doi: 10.1007/s11517-025-03314-0.

A deep learning approach to direct immunofluorescence pattern recognition in autoimmune bullous diseases.

Br J Dermatol. 2024 Jul 16;191(2):261-266. doi: 10.1093/bjd/ljae142.

A fake news detection model using the integration of multimodal attention mechanism and residual convolutional network.

Sci Rep. 2025 Jul 1;15(1):20544. doi: 10.1038/s41598-025-05702-w.

Multiclass skin lesion classification and localziation from dermoscopic images using a novel network-level fused deep architecture and explainable artificial intelligence.

BMC Med Inform Decis Mak. 2025 Jul 1;25(1):215. doi: 10.1186/s12911-025-03051-2.

DASNet a dual branch multi level attention sheep counting network.

Sci Rep. 2025 Jul 2;15(1):23228. doi: 10.1038/s41598-025-97929-w.

Skin-CAD: Explainable deep learning classification of skin cancer from dermoscopic images by feature selection of dual high-level CNNs features and transfer learning.

Comput Biol Med. 2024 Aug;178:108798. doi: 10.1016/j.compbiomed.2024.108798. Epub 2024 Jun 25.

本文引用的文献

Fibrosis and inflammatory activity diagnosis of chronic hepatitis C based on extreme learning machine.

Sci Rep. 2025 Jan 2;15(1):11. doi: 10.1038/s41598-024-84695-4.

Content-Based Image Retrieval and Image Classification System for Early Prediction of Bladder Cancer.

Diagnostics (Basel). 2024 Nov 22;14(23):2637. doi: 10.3390/diagnostics14232637.

Zero-Shot Sketch-Based Image Retrieval Using StyleGen and Stacked Siamese Neural Networks.

J Imaging. 2024 Mar 27;10(4):79. doi: 10.3390/jimaging10040079.

Developing Deep LSTMs With Later Temporal Attention for Predicting COVID-19 Severity, Clinical Outcome, and Antibody Level by Screening Serological Indicators Over Time.

IEEE J Biomed Health Inform. 2024 Jul;28(7):4204-4215. doi: 10.1109/JBHI.2024.3384333. Epub 2024 Jul 2.

Enhancing the Super-Resolution of Medical Images: Introducing the Deep Residual Feature Distillation Channel Attention Network for Optimized Performance and Efficiency.

Bioengineering (Basel). 2023 Nov 19;10(11):1332. doi: 10.3390/bioengineering10111332.

Content-Based Image Retrieval for Traditional Indonesian Woven Fabric Images Using a Modified Convolutional Neural Network Method.

J Imaging. 2023 Aug 18;9(8):165. doi: 10.3390/jimaging9080165.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用深度学习驱动的基于内容的图像检索（CBIR）优化视觉数据检索，以改善人机交互。

Optimizing visual data retrieval using deep learning driven CBIR for improved human machine interaction.

作者信息

P Arulmozhi, R Gopi

机构信息

Faculty of Information Technology, Dhanalakshmi Srinivasan Engineering College, Perambalur, Tamilnadu, India.

Faculty of Computer Science and Engineering, Dhanalakshmi Srinivasan Engineering College, Perambalur, Tamilnadu, India.

出版信息

Sci Rep. 2025 Jul 2;15(1):23169. doi: 10.1038/s41598-025-05478-z.

DOI:10.1038/s41598-025-05478-z

PMID:40603973

Abstract

摘要

使用深度学习驱动的基于内容的图像检索（CBIR）优化视觉数据检索，以改善人机交互。

Optimizing visual data retrieval using deep learning driven CBIR for improved human machine interaction.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

使用深度学习驱动的基于内容的图像检索（CBIR）优化视觉数据检索，以改善人机交互。

Optimizing visual data retrieval using deep learning driven CBIR for improved human machine interaction.

作者信息

机构信息

出版信息

相似文献

本文引用的文献