• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于网格锚点的图像裁剪:一个新的基准和一个高效的模型。

Grid Anchor Based Image Cropping: A New Benchmark and An Efficient Model.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2022 Mar;44(3):1304-1319. doi: 10.1109/TPAMI.2020.3024207. Epub 2022 Feb 3.

DOI:10.1109/TPAMI.2020.3024207
PMID:32931429
Abstract

Image cropping aims to improve the composition as well as aesthetic quality of an image by removing extraneous content from it. Most of the existing image cropping databases provide only one or several human-annotated bounding boxes as the groundtruths, which can hardly reflect the non-uniqueness and flexibility of image cropping in practice. The employed evaluation metrics such as intersection-over-union cannot reliably reflect the real performance of a cropping model, either. This work revisits the problem of image cropping, and presents a grid anchor based formulation by considering the special properties and requirements (e.g., local redundancy, content preservation, aspect ratio) of image cropping. Our formulation reduces the searching space of candidate crops from millions to no more than ninety. Consequently, a grid anchor based cropping benchmark is constructed, where all crops of each image are annotated and more reliable evaluation metrics are defined. To meet the practical demands of robust performance and high efficiency, we also design an effective and lightweight cropping model. By simultaneously considering the region of interest and region of discard, and leveraging multi-scale information, our model can robustly output visually pleasing crops for images of different scenes. With less than 2.5M parameters, our model runs at a speed of 200 FPS on one single GTX 1080Ti GPU and 12 FPS on one i7-6800K CPU. The code is available at: https://github.com/HuiZeng/Grid-Anchor-based-Image-Cropping-Pytorch.

摘要

图像裁剪旨在通过从图像中去除多余的内容来改善图像的构图和美学质量。大多数现有的图像裁剪数据库仅提供一个或几个人工注释的边界框作为groundtruths,这很难反映实际裁剪中边界框的非唯一性和灵活性。所采用的评价指标(如交并比)也不能可靠地反映裁剪模型的实际性能。本工作重新审视了图像裁剪问题,并提出了一种基于网格锚点的公式,同时考虑了图像裁剪的特殊性质和要求(例如,局部冗余、内容保留、纵横比)。我们的公式将候选裁剪的搜索空间从数百万减少到不超过九十。因此,构建了一个基于网格锚点的裁剪基准,其中每个图像的所有裁剪都进行了注释,并定义了更可靠的评价指标。为了满足鲁棒性能和高效率的实际需求,我们还设计了一种有效且轻量级的裁剪模型。通过同时考虑感兴趣区域和丢弃区域,并利用多尺度信息,我们的模型可以为不同场景的图像稳健地输出视觉上令人愉悦的裁剪。我们的模型参数少于 250 万,在单个 GTX 1080Ti GPU 上的运行速度为 200 FPS,在单个 i7-6800K CPU 上的运行速度为 12 FPS。代码可在:https://github.com/HuiZeng/Grid-Anchor-based-Image-Cropping-Pytorch 找到。

相似文献

1
Grid Anchor Based Image Cropping: A New Benchmark and An Efficient Model.基于网格锚点的图像裁剪:一个新的基准和一个高效的模型。
IEEE Trans Pattern Anal Mach Intell. 2022 Mar;44(3):1304-1319. doi: 10.1109/TPAMI.2020.3024207. Epub 2022 Feb 3.
2
A Deep Network Solution for Attention and Aesthetics Aware Photo Cropping.一种用于注意力和美学感知照片裁剪的深度网络解决方案。
IEEE Trans Pattern Anal Mach Intell. 2019 Jul;41(7):1531-1544. doi: 10.1109/TPAMI.2018.2840724. Epub 2018 May 25.
3
The region of interest localization for glaucoma analysis from retinal fundus image using deep learning.利用深度学习进行视网膜眼底图像的青光眼分析的感兴趣区域定位。
Comput Methods Programs Biomed. 2018 Oct;165:25-35. doi: 10.1016/j.cmpb.2018.08.003. Epub 2018 Aug 8.
4
SCPM-Net: An anchor-free 3D lung nodule detection network using sphere representation and center points matching.SCPM-Net:一种使用球体表示和中心点匹配的无锚点3D肺结节检测网络。
Med Image Anal. 2022 Jan;75:102287. doi: 10.1016/j.media.2021.102287. Epub 2021 Oct 22.
5
A benchmark bone marrow aspirate smear dataset and a multi-scale cell detection model for the diagnosis of hematological disorders.一个用于血液系统疾病诊断的基准骨髓穿刺涂片数据集和多尺度细胞检测模型。
Comput Med Imaging Graph. 2021 Jun;90:101912. doi: 10.1016/j.compmedimag.2021.101912. Epub 2021 Apr 2.
6
Benchmarking Scientific Image Forgery Detectors.基准科学图像伪造检测器。
Sci Eng Ethics. 2022 Aug 9;28(4):35. doi: 10.1007/s11948-022-00391-4.
7
How to model and simulate the effects of cropping systems on population dynamics and gene flow at the landscape level: example of oilseed rape volunteers and their role for co-existence of GM and non-GM crops.如何在景观层面模拟和仿真种植系统对种群动态及基因流动的影响:以油菜自生苗为例及其在转基因作物和非转基因作物共存中的作用
Environ Sci Pollut Res Int. 2009 May;16(3):348-60. doi: 10.1007/s11356-008-0080-6. Epub 2008 Dec 9.
8
Fast A3RL: Aesthetics-Aware Adversarial Reinforcement Learning for Image Cropping.快速A3RL:用于图像裁剪的美学感知对抗强化学习
IEEE Trans Image Process. 2019 May 7. doi: 10.1109/TIP.2019.2914360.
9
Benchmark on Indexing Algorithms for Accelerating Molecular Similarity Search.用于加速分子相似性搜索的索引算法基准测试。
J Chem Inf Model. 2020 Dec 28;60(12):6167-6184. doi: 10.1021/acs.jcim.0c00393. Epub 2020 Oct 23.
10
Single Image Super-Resolution Quality Assessment: A Real-World Dataset, Subjective Studies, and an Objective Metric.单图像超分辨率质量评估:真实世界数据集、主观研究和客观指标。
IEEE Trans Image Process. 2022;31:2279-2294. doi: 10.1109/TIP.2022.3154588. Epub 2022 Mar 15.