融合卷积与稀疏编码以学习低维判别性图像表示

Integrating Convolution and Sparse Coding for Learning Low-Dimensional Discriminative Image Representations.

作者信息

Wei Xian, Liu Yingjie, Tang Xuan, Yu Shui, Chen Mingsong

出版信息

IEEE Trans Neural Netw Learn Syst. 2025 Jul;36(7):12483-12496. doi: 10.1109/TNNLS.2024.3453374.

DOI:10.1109/TNNLS.2024.3453374

Abstract

This work investigates the problem of efficiently learning discriminative low-dimensional (LD) representations of multiclass image objects. We propose a generic end-to-end approach that jointly optimizes sparse dictionary and convolutions for learning LOW-dimensional discriminative image representations, named SparConvLow, taking advantage of convolutional neural networks (CNNs), dictionary learning, and orthogonal projections. The whole learning process can be summarized as follows. First, a CNN module is employed to extract high-dimensional (HD) preliminary convolutional features. Second, to avoid the high computational cost of direct sparse coding on HD CNN features, we learn sparse representation (SR) over a task-driven dictionary in the space with the feature being orthogonally projected. We then exploit the discriminative projection on SR. The whole learning process is consistently treated as an end-to-end joint optimization problem of trace quotient maximization. The cost function is well-defined on the product of the CNN parameters space, the Stiefel manifold, the Oblique manifold, and the Grassmann manifold. By using the explicit gradient delivery, the cost function is optimized via a geometrical stochastic gradient descent (SGD) algorithm along with the chain rule and the backpropagation. The experimental results show that the proposed method can achieve a highly competitive performance with the state-of-the-art (SOTA) image classification, object categorization, and face recognition methods, under both supervised and semi-supervised settings. The code is available at https://github.com/MVPR-Group/SparConvLow.

摘要

这项工作研究了高效学习多类图像对象的判别性低维（LD）表示的问题。我们提出了一种通用的端到端方法，该方法联合优化稀疏字典和卷积，以学习低维判别性图像表示，名为SparConvLow，利用了卷积神经网络（CNN）、字典学习和正交投影。整个学习过程可总结如下。首先，使用一个CNN模块来提取高维（HD）初步卷积特征。其次，为避免对HD CNN特征进行直接稀疏编码的高计算成本，我们在通过正交投影特征的空间中，在任务驱动的字典上学习稀疏表示（SR）。然后，我们对SR进行判别性投影。整个学习过程始终被视为迹商最大化的端到端联合优化问题。成本函数在CNN参数空间、斯蒂费尔流形、斜流形和格拉斯曼流形的乘积上有明确定义。通过使用显式梯度传递，成本函数通过几何随机梯度下降（SGD）算法以及链式法则和反向传播进行优化。实验结果表明，在有监督和半监督设置下，该方法与当前最先进的（SOTA）图像分类、目标分类和人脸识别方法相比，能实现极具竞争力的性能。代码可在https://github.com/MVPR-Group/SparConvLow获取。

相似文献

Integrating Convolution and Sparse Coding for Learning Low-Dimensional Discriminative Image Representations.融合卷积与稀疏编码以学习低维判别性图像表示

IEEE Trans Neural Netw Learn Syst. 2025 Jul;36(7):12483-12496. doi: 10.1109/TNNLS.2024.3453374.

Development and Validation of a Convolutional Neural Network Model to Predict a Pathologic Fracture in the Proximal Femur Using Abdomen and Pelvis CT Images of Patients With Advanced Cancer.利用晚期癌症患者腹部和骨盆 CT 图像建立卷积神经网络模型预测股骨近端病理性骨折的研究

Clin Orthop Relat Res. 2023 Nov 1;481(11):2247-2256. doi: 10.1097/CORR.0000000000002771. Epub 2023 Aug 23.

Short-Term Memory Impairment短期记忆障碍

Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。

Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中，如果患者出现以下症状和体征，可判断其是否患有 COVID-19。

Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

"I Don't Understand Their Sense of Belonging": Exploring How Nonbinary Autistic Adults Experience Gender.“我不理解他们的归属感”：探索非二元性别的自闭症成年人如何体验性别。

Autism Adulthood. 2024 Dec 2;6(4):462-473. doi: 10.1089/aut.2023.0071. eCollection 2024 Dec.

The Black Book of Psychotropic Dosing and Monitoring.《精神药物剂量与监测黑皮书》

Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.

A deep learning approach to direct immunofluorescence pattern recognition in autoimmune bullous diseases.深度学习方法在自身免疫性大疱性疾病中的直接免疫荧光模式识别。

Br J Dermatol. 2024 Jul 16;191(2):261-266. doi: 10.1093/bjd/ljae142.

Sparse-view spectral CT reconstruction via a coupled subspace representation and score-based generative model.基于耦合子空间表示和基于分数的生成模型的稀疏视图光谱CT重建

Quant Imaging Med Surg. 2025 Jun 6;15(6):5474-5495. doi: 10.21037/qims-24-2226. Epub 2025 May 28.

Sexual Harassment and Prevention Training性骚扰与预防培训

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

融合卷积与稀疏编码以学习低维判别性图像表示

Integrating Convolution and Sparse Coding for Learning Low-Dimensional Discriminative Image Representations.

作者信息

出版信息

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献