• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于3D形状分类与检索的二阶谱变换模块

Second-order Spectral Transform Block for 3D Shape Classification and Retrieval.

作者信息

Yu Ruixuan, Sun Jian, Li Huibin

出版信息

IEEE Trans Image Process. 2020 Jan 23. doi: 10.1109/TIP.2020.2967579.

DOI:10.1109/TIP.2020.2967579
PMID:31995486
Abstract

In this paper, we propose a novel network block, dubbed as second-order spectral transform block, for 3D shape retrieval and classification. This network block generalizes the second-order pooling to 3D surface by designing a learnable non-linear transform on the spectrum of the pooled descriptor. The proposed block consists of following two components. First, the second-order average (SO-Avr) and max-pooling (SOMax) operations are designed on 3D surface to aggregate local descriptors, which are shown to be more discriminative than the popular average-pooling or max-pooling. Second, a learnable spectral transform parameterized by mixture of power function is proposed to perform non-linear feature mapping in the space of pooled descriptors, i.e., manifold of symmetric positive definite matrix for SO-Avr, and space of symmetric matrix for SOMax. The proposed block can be plugged into existing network architectures to aggregate local shape descriptors for boosting their performance. We apply it to a shallow network for nonrigid 3D shape analysis and to existing networks for rigid shape analysis, where it improves the first-tier retrieval accuracy by 7.2% on SHREC'14 Real dataset and achieves state-of-the-art classification accuracy on ModelNet40. As an extension, we apply our block to 2D image classification, showing its superiority compared with traditional second-order pooling methods. We also provide theoretical and experimental analysis on stability of the proposed second-order spectral transform block.

摘要

在本文中,我们提出了一种新颖的网络模块,称为二阶谱变换模块,用于三维形状检索和分类。该网络模块通过在池化描述符的频谱上设计一个可学习的非线性变换,将二阶池化推广到三维曲面。所提出的模块由以下两个部分组成。首先,在三维曲面上设计二阶平均(SO-Avr)和最大池化(SOMax)操作来聚合局部描述符,结果表明这些操作比常用的平均池化或最大池化更具判别力。其次,提出了一种由幂函数混合参数化的可学习谱变换,用于在池化描述符空间中执行非线性特征映射,即对于SO-Avr在对称正定矩阵流形中,对于SOMax在对称矩阵空间中。所提出的模块可以插入到现有的网络架构中,以聚合局部形状描述符来提升其性能。我们将其应用于一个用于非刚性三维形状分析的浅层网络以及用于刚性形状分析的现有网络,在SHREC'14真实数据集上,它将一级检索准确率提高了7.2%,并在ModelNet40上实现了当前最优的分类准确率。作为扩展,我们将我们的模块应用于二维图像分类,展示了其相较于传统二阶池化方法的优越性。我们还对所提出的二阶谱变换模块的稳定性进行了理论和实验分析。

相似文献

1
Second-order Spectral Transform Block for 3D Shape Classification and Retrieval.用于3D形状分类与检索的二阶谱变换模块
IEEE Trans Image Process. 2020 Jan 23. doi: 10.1109/TIP.2020.2967579.
2
Point2SpatialCapsule: Aggregating Features and Spatial Relationships of Local Regions on Point Clouds using Spatial-aware Capsules.点到空间胶囊:使用空间感知胶囊聚合点云上局部区域的特征和空间关系
IEEE Trans Image Process. 2020 Sep 7;PP. doi: 10.1109/TIP.2020.3019925.
3
Learning View-Based Graph Convolutional Network for Multi-View 3D Shape Analysis.基于学习的视图图卷积网络的多视图三维形状分析。
IEEE Trans Pattern Anal Mach Intell. 2023 Jun;45(6):7525-7541. doi: 10.1109/TPAMI.2022.3221785. Epub 2023 May 5.
4
Deep Nonlinear Metric Learning for 3-D Shape Retrieval.三维形状检索的深度非线性度量学习。
IEEE Trans Cybern. 2018 Jan;48(1):412-422. doi: 10.1109/TCYB.2016.2638924. Epub 2016 Dec 28.
5
Progressive Shape-Distribution-Encoder for Learning 3D Shape Representation.渐进式形状分布编码器,用于学习 3D 形状表示。
IEEE Trans Image Process. 2017 Mar;26(3):1231-1242. doi: 10.1109/TIP.2017.2651408. Epub 2017 Jan 10.
6
DeepShape: Deep-Learned Shape Descriptor for 3D Shape Retrieval.DeepShape:用于三维形状检索的深度学习形状描述符。
IEEE Trans Pattern Anal Mach Intell. 2017 Jul;39(7):1335-1345. doi: 10.1109/TPAMI.2016.2596722. Epub 2016 Jul 29.
7
Fine-Tuning CNN Image Retrieval with No Human Annotation.无人工标注微调卷积神经网络图像检索。
IEEE Trans Pattern Anal Mach Intell. 2019 Jul;41(7):1655-1668. doi: 10.1109/TPAMI.2018.2846566. Epub 2018 Jun 12.
8
A New Breakpoint to Classify 3D Voxels in MRI: A Space Transform Strategy with 3t2FTS-v2 and Its Application for ResNet50-Based Categorization of Brain Tumors.一种用于在磁共振成像中对三维体素进行分类的新断点:一种基于3t2FTS-v2的空间变换策略及其在基于ResNet50的脑肿瘤分类中的应用。
Bioengineering (Basel). 2023 May 23;10(6):629. doi: 10.3390/bioengineering10060629.
9
Multi-view SoftPool attention convolutional networks for 3D model classification.用于3D模型分类的多视图软池注意力卷积网络
Front Neurorobot. 2022 Nov 16;16:1029968. doi: 10.3389/fnbot.2022.1029968. eCollection 2022.
10
Combine EfficientNet and CNN for 3D model classification.结合高效神经网络(EfficientNet)和卷积神经网络(CNN)进行三维模型分类。
Math Biosci Eng. 2023 Mar 13;20(5):9062-9079. doi: 10.3934/mbe.2023398.

引用本文的文献

1
Scale-invariant Mexican Hat wavelet descriptor for non-rigid shape similarity measurement.用于非刚体形状相似度测量的尺度不变墨西哥帽小波描述符。
Sci Rep. 2023 Feb 13;13(1):2518. doi: 10.1038/s41598-023-29047-4.