• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于标签转移的非参数场景解析。

Nonparametric Scene Parsing via Label Transfer.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2011 Dec;33(12):2368-82. doi: 10.1109/TPAMI.2011.131. Epub 2011 Jun 30.

DOI:10.1109/TPAMI.2011.131
PMID:21709305
Abstract

While there has been a lot of recent work on object recognition and image understanding, the focus has been on carefully establishing mathematical models for images, scenes, and objects. In this paper, we propose a novel, nonparametric approach for object recognition and scene parsing using a new technology we name label transfer. For an input image, our system first retrieves its nearest neighbors from a large database containing fully annotated images. Then, the system establishes dense correspondences between the input image and each of the nearest neighbors using the dense SIFT flow algorithm [28], which aligns two images based on local image structures. Finally, based on the dense scene correspondences obtained from SIFT flow, our system warps the existing annotations and integrates multiple cues in a Markov random field framework to segment and recognize the query image. Promising experimental results have been achieved by our nonparametric scene parsing system on challenging databases. Compared to existing object recognition approaches that require training classifiers or appearance models for each object category, our system is easy to implement, has few parameters, and embeds contextual information naturally in the retrieval/alignment procedure.

摘要

虽然最近在对象识别和图像理解方面已经有了很多工作,但重点一直是仔细为图像、场景和对象建立数学模型。在本文中,我们提出了一种新颖的、基于非参数的方法,用于使用我们称之为标签传输的新技术进行对象识别和场景解析。对于输入图像,我们的系统首先从包含完全注释图像的大型数据库中检索其最近邻。然后,系统使用密集 SIFT 流算法[28]在输入图像和每个最近邻之间建立密集对应关系,该算法基于局部图像结构对齐两幅图像。最后,根据从 SIFT 流获得的密集场景对应关系,我们的系统对现有注释进行变形,并在马尔可夫随机场框架中集成多个线索,以分割和识别查询图像。我们的非参数场景解析系统在具有挑战性的数据库上取得了令人鼓舞的实验结果。与需要为每个对象类别训练分类器或外观模型的现有对象识别方法相比,我们的系统易于实现,参数少,并在检索/对齐过程中自然地嵌入上下文信息。

相似文献

1
Nonparametric Scene Parsing via Label Transfer.基于标签转移的非参数场景解析。
IEEE Trans Pattern Anal Mach Intell. 2011 Dec;33(12):2368-82. doi: 10.1109/TPAMI.2011.131. Epub 2011 Jun 30.
2
SIFT flow: dense correspondence across scenes and its applications.SIFT 流:跨越场景的密集对应及其应用。
IEEE Trans Pattern Anal Mach Intell. 2011 May;33(5):978-94. doi: 10.1109/TPAMI.2010.147.
3
Scene Parsing From an MAP Perspective.基于 MAP 的场景解析。
IEEE Trans Cybern. 2015 Sep;45(9):1876-86. doi: 10.1109/TCYB.2014.2361489. Epub 2014 Nov 4.
4
A tree-based context model for object recognition.基于树的上下文模型的目标识别。
IEEE Trans Pattern Anal Mach Intell. 2012 Feb;34(2):240-52. doi: 10.1109/TPAMI.2011.119.
5
Monocular visual scene understanding: understanding multi-object traffic scenes.单目视觉场景理解:理解多目标交通场景。
IEEE Trans Pattern Anal Mach Intell. 2013 Apr;35(4):882-97. doi: 10.1109/TPAMI.2012.174.
6
3-D object recognition using 2-D views.使用二维视图进行三维物体识别。
IEEE Trans Image Process. 2008 Nov;17(11):2236-55. doi: 10.1109/TIP.2008.2003404.
7
Three-dimensional model-based object recognition and segmentation in cluttered scenes.基于三维模型的杂乱场景中的目标识别与分割
IEEE Trans Pattern Anal Mach Intell. 2006 Oct;28(10):1584-601. doi: 10.1109/TPAMI.2006.213.
8
80 million tiny images: a large data set for nonparametric object and scene recognition.八千万张小图片:用于非参数化物体与场景识别的大型数据集。
IEEE Trans Pattern Anal Mach Intell. 2008 Nov;30(11):1958-70. doi: 10.1109/TPAMI.2008.128.
9
Hierarchical object parsing from structured noisy point clouds.基于结构化噪声点云的分层目标解析。
IEEE Trans Pattern Anal Mach Intell. 2013 Jul;35(7):1649-59. doi: 10.1109/TPAMI.2012.262.
10
Temporal Pixel-Level Semantic Understanding Through the VSPW Dataset.通过VSPW数据集实现时间像素级语义理解
IEEE Trans Pattern Anal Mach Intell. 2023 Sep;45(9):11297-11308. doi: 10.1109/TPAMI.2023.3266023. Epub 2023 Aug 7.

引用本文的文献

1
A fast vanishing point detection method based on row space features suitable for real driving scenarios.一种基于行空间特征的快速消失点检测方法,适用于真实驾驶场景。
Sci Rep. 2023 Feb 22;13(1):3088. doi: 10.1038/s41598-023-30152-7.
2
Deep learning: survey of environmental and camera impacts on internet of things images.深度学习:环境与摄像头对物联网图像影响的综述
Artif Intell Rev. 2023 Feb 6:1-34. doi: 10.1007/s10462-023-10405-7.
3
Integrated 3d flow-based multi-atlas brain structure segmentation.基于集成 3D 流的多图谱脑结构分割。
PLoS One. 2022 Aug 15;17(8):e0270339. doi: 10.1371/journal.pone.0270339. eCollection 2022.
4
: Dual-resolution Semantic Segmentation with Rare Class-Oriented Superpixel Prior.基于面向稀有类别的超像素先验的双分辨率语义分割
Multimed Tools Appl. 2021 Jan;80(2):1687-1706. doi: 10.1007/s11042-020-09691-y. Epub 2020 Sep 9.
5
A Multi-Level Approach to Waste Object Segmentation.一种用于废物对象分割的多层次方法。
Sensors (Basel). 2020 Jul 8;20(14):3816. doi: 10.3390/s20143816.
6
Consistent Semantic Annotation of Outdoor Datasets via 2D/3D Label Transfer.通过 2D/3D 标签传递实现户外数据集的一致语义标注。
Sensors (Basel). 2018 Jul 12;18(7):2249. doi: 10.3390/s18072249.
7
Semantic Image Segmentation with Contextual Hierarchical Models.基于上下文层次模型的语义图像分割
IEEE Trans Pattern Anal Mach Intell. 2016 May;38(5):951-64. doi: 10.1109/TPAMI.2015.2473846. Epub 2015 Aug 27.
8
Recognizing Materials using Perceptually Inspired Features.利用感知启发特征识别材料。
Int J Comput Vis. 2013 Jul 1;103(3):348-371. doi: 10.1007/s11263-013-0609-0.
9
Spatial Bias in Multi-Atlas Based Segmentation.基于多图谱分割中的空间偏差。
Conf Comput Vis Pattern Recognit Workshops. 2012 Jun 24;2012:909-916. doi: 10.1109/CVPR.2012.6247765.
10
Multi-Atlas Segmentation with Joint Label Fusion.基于联合标签融合的多图谱分割
IEEE Trans Pattern Anal Mach Intell. 2013 Mar;35(3):611-23. doi: 10.1109/TPAMI.2012.143. Epub 2012 Jun 26.