• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于图像检索与重建的离散余弦变换(DCT)启发式特征变换

DCT Inspired Feature Transform for Image Retrieval and Reconstruction.

出版信息

IEEE Trans Image Process. 2016 Sep;25(9):4406-4420. doi: 10.1109/TIP.2016.2590323. Epub 2016 Jul 11.

DOI:10.1109/TIP.2016.2590323
PMID:27416596
Abstract

Scale invariant feature transform (SIFT) is effective for representing images in computer vision tasks, as one of the most resistant feature descriptions to common image deformations. However, two issues should be addressed: first, feature description based on gradient accumulation is not compact and contains redundancies; second, multiple orientations are often extracted from one local region and therefore produce multiple descriptions, which is not good for memory efficiency. To resolve these two issues, this paper introduces a novel method to determine the dominant orientation for multiple-orientation cases, named discrete cosine transform (DCT) intrinsic orientation, and a new DCT inspired feature transform (DIFT). In each local region, it first computes a unique DCT intrinsic orientation via DCT matrix and rotates the region accordingly, and then describes the rotated region with partial DCT matrix coefficients to produce an optimized low-dimensional descriptor. We test the accuracy and robustness of DIFT on real image matching. Afterward, extensive applications performed on public benchmarks for visual retrieval show that using DCT intrinsic orientation achieves performance on a par with SIFT, but with only 60% of its features; replacing the SIFT description with DIFT reduces dimensions from 128 to 32 and improves precision. Image reconstruction resulting from DIFT is presented to show another of its advantages over SIFT.

摘要

尺度不变特征变换(SIFT)在计算机视觉任务中对图像表示很有效,是对常见图像变形最具抗性的特征描述之一。然而,有两个问题需要解决:第一,基于梯度累积的特征描述不紧凑且包含冗余;第二,通常会从一个局部区域提取多个方向,因此会产生多个描述,这对内存效率不利。为了解决这两个问题,本文引入了一种新方法来确定多方向情况下的主导方向,即离散余弦变换(DCT)固有方向,以及一种受DCT启发的新特征变换(DIFT)。在每个局部区域中,它首先通过DCT矩阵计算唯一的DCT固有方向并相应地旋转该区域,然后用部分DCT矩阵系数描述旋转后的区域以生成优化的低维描述符。我们在真实图像匹配中测试了DIFT的准确性和鲁棒性。随后,在用于视觉检索的公共基准上进行的广泛应用表明,使用DCT固有方向的性能与SIFT相当,但特征数量仅为其60%;用DIFT替换SIFT描述可将维度从128降至32并提高精度。展示了由DIFT进行的图像重建,以说明它相对于SIFT的另一个优势。

相似文献

1
DCT Inspired Feature Transform for Image Retrieval and Reconstruction.用于图像检索与重建的离散余弦变换(DCT)启发式特征变换
IEEE Trans Image Process. 2016 Sep;25(9):4406-4420. doi: 10.1109/TIP.2016.2590323. Epub 2016 Jul 11.
2
A Hybrid Robust Image Watermarking Method Based on DWT-DCT and SIFT for Copyright Protection.一种基于离散小波变换-离散余弦变换和尺度不变特征变换的混合鲁棒图像水印方法用于版权保护。
J Imaging. 2021 Oct 19;7(10):218. doi: 10.3390/jimaging7100218.
3
Edge-SIFT: discriminative binary descriptor for scalable partial-duplicate mobile search.边缘 SIFT:可扩展的部分重复移动搜索的判别二进制描述符。
IEEE Trans Image Process. 2013 Jul;22(7):2889-902. doi: 10.1109/TIP.2013.2251650. Epub 2013 Mar 7.
4
MBR-SIFT: A mirror reflected invariant feature descriptor using a binary representation for image matching.MBR-SIFT:一种使用二进制表示进行图像匹配的镜像反射不变特征描述符。
PLoS One. 2017 May 18;12(5):e0178090. doi: 10.1371/journal.pone.0178090. eCollection 2017.
5
On the suitability of SIFT technique to deal with image modifications specific to confocal scanning laser microscopy.SIFT 技术对于处理共聚焦扫描激光显微镜特定的图像修改的适用性。
Microsc Microanal. 2010 Oct;16(5):515-30. doi: 10.1017/S1431927610000371. Epub 2010 Aug 5.
6
Transparent composite model for DCT coefficients: design and analysis.DCT 系数的透明复合模型:设计与分析。
IEEE Trans Image Process. 2014 Mar;23(3):1303-16. doi: 10.1109/TIP.2014.2300818.
7
Case-based fracture image retrieval.基于案例的骨折图像检索。
Int J Comput Assist Radiol Surg. 2012 May;7(3):401-11. doi: 10.1007/s11548-011-0643-8. Epub 2011 Jul 29.
8
A short feature vector for image matching: The Log-Polar Magnitude feature descriptor.用于图像匹配的短特征向量:对数极坐标幅度特征描述符。
PLoS One. 2017 Nov 30;12(11):e0188496. doi: 10.1371/journal.pone.0188496. eCollection 2017.
9
Cross-indexing of binary SIFT codes for large-scale image search.二进制 SIFT 代码的交叉索引在大规模图像搜索中的应用。
IEEE Trans Image Process. 2014 May;23(5):2047-57. doi: 10.1109/TIP.2014.2312283.
10
USB: ultrashort binary descriptor for fast visual matching and retrieval.USB:超短二进制描述符,用于快速视觉匹配和检索。
IEEE Trans Image Process. 2014 Aug;23(8):3671-83. doi: 10.1109/TIP.2014.2330794. Epub 2014 Jun 12.