• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于深度学习的主动学习在遥感场景分类中的应用

Bioinspired Scene Classification by Deep Active Learning With Remote Sensing Applications.

出版信息

IEEE Trans Cybern. 2022 Jul;52(7):5682-5694. doi: 10.1109/TCYB.2020.2981480. Epub 2022 Jul 4.

DOI:10.1109/TCYB.2020.2981480
PMID:33635802
Abstract

Accurately classifying sceneries with different spatial configurations is an indispensable technique in computer vision and intelligent systems, for example, scene parsing, robot motion planning, and autonomous driving. Remarkable performance has been achieved by the deep recognition models in the past decade. As far as we know, however, these deep architectures are incapable of explicitly encoding the human visual perception, that is, the sequence of gaze movements and the subsequent cognitive processes. In this article, a biologically inspired deep model is proposed for scene classification, where the human gaze behaviors are robustly discovered and represented by a unified deep active learning (UDAL) framework. More specifically, to characterize objects' components with varied sizes, an objectness measure is employed to decompose each scenery into a set of semantically aware object patches. To represent each region at a low level, a local-global feature fusion scheme is developed which optimally integrates multimodal features by automatically calculating each feature's weight. To mimic the human visual perception of various sceneries, we develop the UDAL that hierarchically represents the human gaze behavior by recognizing semantically important regions within the scenery. Importantly, UDAL combines the semantically salient region detection and the deep gaze shifting path (GSP) representation learning into a principled framework, where only the partial semantic tags are required. Meanwhile, by incorporating the sparsity penalty, the contaminated/redundant low-level regional features can be intelligently avoided. Finally, the learned deep GSP features from the entire scene images are integrated to form an image kernel machine, which is subsequently fed into a kernel SVM to classify different sceneries. Experimental evaluations on six well-known scenery sets (including remote sensing images) have shown the competitiveness of our approach.

摘要

准确地对具有不同空间配置的场景进行分类是计算机视觉和智能系统中不可或缺的技术,例如场景解析、机器人运动规划和自动驾驶。在过去的十年中,深度识别模型已经取得了显著的性能。然而,据我们所知,这些深度架构无法显式地编码人类的视觉感知,即注视运动的顺序和随后的认知过程。在本文中,我们提出了一种受生物启发的深度模型用于场景分类,其中通过统一的深度主动学习 (UDAL) 框架稳健地发现和表示人类的注视行为。更具体地说,为了用不同大小的对象特征来描述对象的组成部分,我们采用了一种对象度量方法将每个场景分解为一组语义感知的对象补丁。为了在低层次上表示每个区域,我们开发了一种局部-全局特征融合方案,通过自动计算每个特征的权重来最优地整合多模态特征。为了模拟人类对各种场景的视觉感知,我们开发了 UDAL,通过识别场景中的语义重要区域来分层表示人类的注视行为。重要的是,UDAL 将语义突出区域检测和深度注视转移路径 (GSP) 表示学习结合到一个原则框架中,其中只需要部分语义标签。同时,通过引入稀疏惩罚,可以智能地避免污染/冗余的低水平区域特征。最后,将从整个场景图像中学习到的深度 GSP 特征集成在一起形成图像核机器,然后将其输入核支持向量机 (SVM) 以对不同的场景进行分类。在六个著名的场景集(包括遥感图像)上的实验评估表明了我们方法的竞争力。

相似文献

1
Bioinspired Scene Classification by Deep Active Learning With Remote Sensing Applications.基于深度学习的主动学习在遥感场景分类中的应用
IEEE Trans Cybern. 2022 Jul;52(7):5682-5694. doi: 10.1109/TCYB.2020.2981480. Epub 2022 Jul 4.
2
Scene Categorization by Deeply Learning Gaze Behavior in a Semisupervised Context.在半监督环境下通过深度学习注视行为进行场景分类
IEEE Trans Cybern. 2021 Aug;51(8):4265-4276. doi: 10.1109/TCYB.2019.2913016. Epub 2021 Aug 4.
3
Scene Categorization Using Deeply Learned Gaze Shifting Kernel.基于深度学习的注视转移核的场景分类。
IEEE Trans Cybern. 2019 Jun;49(6):2156-2167. doi: 10.1109/TCYB.2018.2820731. Epub 2018 May 11.
4
Deep Active Learning with Contaminated Tags for Image Aesthetics Assessment.用于图像美学评估的带有污染标签的深度主动学习
IEEE Trans Image Process. 2018 Apr 18. doi: 10.1109/TIP.2018.2828326.
5
Deeply Encoding Stable Patterns From Contaminated Data for Scenery Image Recognition.从污染数据中深度编码稳定模式以进行风景图像识别。
IEEE Trans Cybern. 2021 Dec;51(12):5671-5680. doi: 10.1109/TCYB.2019.2951798. Epub 2021 Dec 22.
6
Massive-Scale Aerial Photo Categorization by Cross-Resolution Visual Perception Enhancement.大规模航空影像分类的跨分辨率视觉感知增强方法
IEEE Trans Neural Netw Learn Syst. 2022 Aug;33(8):4017-4030. doi: 10.1109/TNNLS.2021.3055548. Epub 2022 Aug 3.
7
Community-Aware Photo Quality Evaluation by Deeply Encoding Human Perception.基于深度学习的人类感知编码的社区感知图像质量评价
IEEE Trans Cybern. 2022 May;52(5):3136-3146. doi: 10.1109/TCYB.2019.2937319. Epub 2022 May 19.
8
Biologically Inspired Model for Visual Cognition Achieving Unsupervised Episodic and Semantic Feature Learning.受生物启发的视觉认知模型实现无监督的情景和语义特征学习。
IEEE Trans Cybern. 2016 Oct;46(10):2335-2347. doi: 10.1109/TCYB.2015.2476706. Epub 2015 Sep 18.
9
Deep Learning for Feature Extraction in Remote Sensing: A Case-Study of Aerial Scene Classification.深度学习在遥感特征提取中的应用:以航空场景分类为例。
Sensors (Basel). 2020 Jul 14;20(14):3906. doi: 10.3390/s20143906.
10
Adaptive Discriminative Regions Learning Network for Remote Sensing Scene Classification.基于自适应判别区域学习网络的遥感场景分类方法。
Sensors (Basel). 2023 Jan 10;23(2):773. doi: 10.3390/s23020773.

引用本文的文献

1
Multi scale supervised entropy weighted binary pattern for texture classification.用于纹理分类的多尺度监督熵加权二值模式
Sci Rep. 2025 Jul 18;15(1):26087. doi: 10.1038/s41598-025-11245-x.
2
The analysis of optimization in music aesthetic education under artificial intelligence.人工智能视域下音乐审美教育的优化分析
Sci Rep. 2025 Apr 4;15(1):11545. doi: 10.1038/s41598-025-96436-2.
3
Optimizing heart disease diagnosis with advanced machine learning models: a comparison of predictive performance.使用先进机器学习模型优化心脏病诊断:预测性能比较
BMC Cardiovasc Disord. 2025 Mar 22;25(1):212. doi: 10.1186/s12872-025-04627-6.
4
Research on optimization strategies of university ideological and political parenting models under the empowerment of digital intelligence.数字智能赋能下高校思想政治育人模式优化策略研究
Sci Rep. 2025 Mar 13;15(1):8680. doi: 10.1038/s41598-025-92985-8.
5
Design and implementation of a radiomic-driven intelligent dental hospital diversion system utilizing multilabel imaging data.利用多标签成像数据的基于放射组学的智能牙科医院分流系统的设计与实现
J Transl Med. 2024 Dec 20;22(1):1123. doi: 10.1186/s12967-024-05958-2.
6
Facial Image expression recognition and prediction system.面部表情识别与预测系统。
Sci Rep. 2024 Nov 12;14(1):27760. doi: 10.1038/s41598-024-79146-z.
7
A novel binary hashing for agricultural scenery classification.一种用于农业场景分类的新型二进制哈希算法。
Sci Rep. 2024 Nov 11;14(1):27602. doi: 10.1038/s41598-024-77685-z.
8
DeepTool: A deep learning framework for tool wear onset detection and remaining useful life prediction.DeepTool:一种用于刀具磨损起始检测和剩余使用寿命预测的深度学习框架。
MethodsX. 2024 Sep 19;13:102965. doi: 10.1016/j.mex.2024.102965. eCollection 2024 Dec.
9
Intelligence model on sequence-based prediction of PPI using AISSO deep concept with hyperparameter tuning process.基于 AISSO 深度概念和超参数调整过程的序列基 PPI 预测智能模型。
Sci Rep. 2024 Sep 18;14(1):21797. doi: 10.1038/s41598-024-72558-x.
10
Enhancing colorectal cancer histology diagnosis using modified deep neural networks optimizer.使用改进的深度神经网络优化器增强结直肠癌组织学诊断。
Sci Rep. 2024 Aug 22;14(1):19534. doi: 10.1038/s41598-024-69193-x.