SIFT 流：跨越场景的密集对应及其应用。

SIFT flow: dense correspondence across scenes and its applications.

机构信息

Microsoft Research New England, Microsoft Corp., One Memorial Drive, Cambridge, MA 02142, USA.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2011 May;33(5):978-94. doi: 10.1109/TPAMI.2010.147.

DOI:10.1109/TPAMI.2010.147

PMID:20714019

Abstract

While image alignment has been studied in different areas of computer vision for decades, aligning images depicting different scenes remains a challenging problem. Analogous to optical flow, where an image is aligned to its temporally adjacent frame, we propose SIFT flow, a method to align an image to its nearest neighbors in a large image corpus containing a variety of scenes. The SIFT flow algorithm consists of matching densely sampled, pixelwise SIFT features between two images while preserving spatial discontinuities. The SIFT features allow robust matching across different scene/object appearances, whereas the discontinuity-preserving spatial model allows matching of objects located at different parts of the scene. Experiments show that the proposed approach robustly aligns complex scene pairs containing significant spatial differences. Based on SIFT flow, we propose an alignment-based large database framework for image analysis and synthesis, where image information is transferred from the nearest neighbors to a query image according to the dense scene correspondence. This framework is demonstrated through concrete applications such as motion field prediction from a single image, motion synthesis via object transfer, satellite image registration, and face recognition.

摘要

虽然图像配准在计算机视觉的不同领域已经研究了几十年，但对齐描绘不同场景的图像仍然是一个具有挑战性的问题。类似于光流，将一个图像与它的时间相邻帧对齐，我们提出了 SIFT 流，一种将图像与其在包含各种场景的大型图像语料库中的最近邻对齐的方法。SIFT 流算法包括在两幅图像之间匹配密集采样的、逐像素的 SIFT 特征，同时保持空间不连续性。SIFT 特征允许在不同的场景/对象外观之间进行稳健匹配，而保持空间不连续性的空间模型允许匹配位于场景不同部分的对象。实验表明，所提出的方法能够稳健地对齐包含显著空间差异的复杂场景对。基于 SIFT 流，我们提出了一种基于对齐的大型数据库框架，用于图像分析和合成，根据密集的场景对应关系，将图像信息从最近邻传递到查询图像。该框架通过具体应用进行了演示，例如从单张图像预测运动场、通过对象传输进行运动合成、卫星图像配准和人脸识别。

相似文献

SIFT flow: dense correspondence across scenes and its applications.

IEEE Trans Pattern Anal Mach Intell. 2011 May;33(5):978-94. doi: 10.1109/TPAMI.2010.147.

Video registration using dynamic textures.

IEEE Trans Pattern Anal Mach Intell. 2011 Jan;33(1):158-71. doi: 10.1109/TPAMI.2010.61.

Observing human-object interactions: using spatial and functional compatibility for recognition.

IEEE Trans Pattern Anal Mach Intell. 2009 Oct;31(10):1775-89. doi: 10.1109/TPAMI.2009.83.

Detecting abandoned objects with a moving camera.

IEEE Trans Image Process. 2010 Aug;19(8):2201-10. doi: 10.1109/TIP.2010.2045714. Epub 2010 Apr 5.

Nonparametric Scene Parsing via Label Transfer.

IEEE Trans Pattern Anal Mach Intell. 2011 Dec;33(12):2368-82. doi: 10.1109/TPAMI.2011.131. Epub 2011 Jun 30.

Alignment of continuous video onto 3D point clouds.

IEEE Trans Pattern Anal Mach Intell. 2005 Aug;27(8):1305-18. doi: 10.1109/TPAMI.2005.152.

Robust object matching for persistent tracking with heterogeneous features.

IEEE Trans Pattern Anal Mach Intell. 2007 May;29(5):824-39. doi: 10.1109/TPAMI.2007.1052.

Registration of challenging image pairs: initialization, estimation, and decision.

IEEE Trans Pattern Anal Mach Intell. 2007 Nov;29(11):1973-89. doi: 10.1109/TPAMI.2007.1116.

Error analysis of robust optical flow estimation by least median of squares methods for the varying illumination model.

IEEE Trans Pattern Anal Mach Intell. 2006 Sep;28(9):1418-35. doi: 10.1109/TPAMI.2006.185.

Robust multiperson tracking from a mobile platform.

IEEE Trans Pattern Anal Mach Intell. 2009 Oct;31(10):1831-46. doi: 10.1109/TPAMI.2009.109.

引用本文的文献

Efficient cell-wide mapping of mitochondria in electron microscopic volumes using webKnossos.

Cell Rep Methods. 2025 Feb 24;5(2):100989. doi: 10.1016/j.crmeth.2025.100989.

Tracing the Chromatin: From 3C to Live-Cell Imaging.

Chem Biomed Imaging. 2024 Jun 25;2(10):659-682. doi: 10.1021/cbmi.4c00033. eCollection 2024 Oct 28.

A Depth Awareness and Learnable Feature Fusion Network for Enhanced Geometric Perception in Semantic Correspondence.

Sensors (Basel). 2024 Oct 17;24(20):6680. doi: 10.3390/s24206680.

SC-AOF: A Sliding Camera and Asymmetric Optical-Flow-Based Blending Method for Image Stitching.

Sensors (Basel). 2024 Jun 21;24(13):4035. doi: 10.3390/s24134035.

Discriminative context-aware network for camouflaged object detection.

Front Artif Intell. 2024 Mar 27;7:1347898. doi: 10.3389/frai.2024.1347898. eCollection 2024.

Object-Oriented and Visual-Based Localization in Urban Environments.

Sensors (Basel). 2024 Mar 21;24(6):2014. doi: 10.3390/s24062014.

Determining dense velocity fields for fluid images based on affine motion.

PeerJ Comput Sci. 2024 Feb 16;10:e1810. doi: 10.7717/peerj-cs.1810. eCollection 2024.

Ref-MEF: Reference-Guided Flexible Gated Image Reconstruction Network for Multi-Exposure Image Fusion.

Entropy (Basel). 2024 Feb 3;26(2):139. doi: 10.3390/e26020139.

A revised conceptual framework for mouse vomeronasal pumping and stimulus sampling.

Curr Biol. 2024 Mar 25;34(6):1206-1221.e6. doi: 10.1016/j.cub.2024.01.036. Epub 2024 Feb 5.

LMFD: lightweight multi-feature descriptors for image stitching.

Sci Rep. 2023 Nov 30;13(1):21162. doi: 10.1038/s41598-023-48432-7.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

SIFT 流：跨越场景的密集对应及其应用。

SIFT flow: dense correspondence across scenes and its applications.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献