• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于形状语法和强化学习的立面解析。

Parsing facades with shape grammars and reinforcement learning.

机构信息

Ecole Centrale Paris, Grande Voie des Vignes 92290, Chatenay-Malabry, France.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2013 Jul;35(7):1744-56. doi: 10.1109/TPAMI.2012.252.

DOI:10.1109/TPAMI.2012.252
PMID:23682000
Abstract

In this paper, we use shape grammars (SGs) for facade parsing, which amounts to segmenting 2D building facades into balconies, walls, windows, and doors in an architecturally meaningful manner. The main thrust of our work is the introduction of reinforcement learning (RL) techniques to deal with the computational complexity of the problem. RL provides us with techniques such as Q-learning and state aggregation which we exploit to efficiently solve facade parsing. We initially phrase the 1D parsing problem in terms of a Markov Decision Process, paving the way for the application of RL-based tools. We then develop novel techniques for the 2D shape parsing problem that take into account the specificities of the facade parsing problem. Specifically, we use state aggregation to enforce the symmetry of facade floors and demonstrate how to use RL to exploit bottom-up, image-based guidance during optimization. We provide systematic results on the Paris building dataset and obtain state-of-the-art results in a fraction of the time required by previous methods. We validate our method under diverse imaging conditions and make our software and results available online.

摘要

在本文中,我们使用形状语法 (SG) 进行立面解析,即将二维建筑立面以建筑上有意义的方式分割为阳台、墙壁、窗户和门。我们工作的主要重点是引入强化学习 (RL) 技术来解决问题的计算复杂性。RL 为我们提供了 Q-learning 和状态聚合等技术,我们利用这些技术来有效地解决立面解析问题。我们最初将 1D 解析问题表述为马尔可夫决策过程,为应用基于 RL 的工具铺平了道路。然后,我们为 2D 形状解析问题开发了新颖的技术,这些技术考虑到了立面解析问题的特殊性。具体来说,我们使用状态聚合来强制实施立面楼层的对称性,并展示如何使用 RL 在优化过程中利用基于图像的自下而上的指导。我们在巴黎建筑数据集上提供了系统的结果,并在以前方法所需时间的一小部分内获得了最先进的结果。我们在不同的成像条件下验证了我们的方法,并在线提供我们的软件和结果。

相似文献

1
Parsing facades with shape grammars and reinforcement learning.基于形状语法和强化学习的立面解析。
IEEE Trans Pattern Anal Mach Intell. 2013 Jul;35(7):1744-56. doi: 10.1109/TPAMI.2012.252.
2
Image-based modeling of unwrappable façades.基于图像的不可展开立面建模。
IEEE Trans Vis Comput Graph. 2013 Oct;19(10):1720-31. doi: 10.1109/TVCG.2013.68.
3
Progressive Feature Learning for Facade Parsing With Occlusions.
IEEE Trans Image Process. 2022;31:2081-2093. doi: 10.1109/TIP.2022.3152004. Epub 2022 Feb 28.
4
A Kronecker Product Model for Repeated Pattern Detection on 2D Urban Images.二维城市图像上重复模式检测的 Kronecker 积模型。
IEEE Trans Pattern Anal Mach Intell. 2019 Sep;41(9):2266-2272. doi: 10.1109/TPAMI.2018.2858795. Epub 2018 Jul 23.
5
Research on the Top-Down Parsing Method for Context-Sensitive Graph Grammars.上下文敏感图文法的自顶向下解析方法研究
PLoS One. 2015 Nov 30;10(11):e0142776. doi: 10.1371/journal.pone.0142776. eCollection 2015.
6
Hierarchical object parsing from structured noisy point clouds.基于结构化噪声点云的分层目标解析。
IEEE Trans Pattern Anal Mach Intell. 2013 Jul;35(7):1649-59. doi: 10.1109/TPAMI.2012.262.
7
Online reinforcement learning for dynamic multimedia systems.在线强化学习在动态多媒体系统中的应用。
IEEE Trans Image Process. 2010 Feb;19(2):290-305. doi: 10.1109/TIP.2009.2035228. Epub 2009 Oct 30.
8
Image dataset: Year-long hourly façade photos of a university building.图像数据集:一所大学建筑长达一年的每小时外立面照片。
Data Brief. 2024 Aug 2;56:110798. doi: 10.1016/j.dib.2024.110798. eCollection 2024 Oct.
9
Efficient 2D and 3D Facade Segmentation Using Auto-Context.基于自上下文的高效二维和三维立面分割。
IEEE Trans Pattern Anal Mach Intell. 2018 May;40(5):1273-1280. doi: 10.1109/TPAMI.2017.2696526. Epub 2017 Apr 24.
10
Model-based reinforcement learning for partially observable games with sampling-based state estimation.基于模型的强化学习在基于采样状态估计的部分可观测博弈中的应用
Neural Comput. 2007 Nov;19(11):3051-87. doi: 10.1162/neco.2007.19.11.3051.

引用本文的文献

1
Kangba Region of Sichuan based on swin transformer visual model research on the identification of facades of ethnic buildings.基于Swin变压器视觉模型的四川康巴地区民族建筑外立面识别研究
Sci Rep. 2024 Nov 20;14(1):28742. doi: 10.1038/s41598-024-78774-9.