Zhou Junwu, Ren Fuji
School of Higher Vocational and Technical College, Shanghai Dianji University, Shanghai, 201306, China.
College of Computer Sciences, Anhui University, Hefei, 230039, China.
Sci Rep. 2025 Jan 4;15(1):739. doi: 10.1038/s41598-024-84181-x.
Decoding the semantic categories of complex sceneries is fundamental to numerous artificial intelligence (AI) infrastructures. This work presents an advanced selection of multi-channel perceptual visual features for recognizing scenic images with elaborate spatial structures, focusing on developing a deep hierarchical model dedicated to learning human gaze behavior. Utilizing the BING objectness measure, we efficiently localize objects or their details across varying scales within scenes. To emulate humans observing semantically or visually significant areas within scenes, we propose a robust deep active learning (RDAL) strategy. This strategy progressively generates gaze shifting paths (GSP) and calculates deep GSP representations within a unified architecture. A notable advantage of RDAL is the robustness to label noise, which is implemented by a carefully-designed sparse penalty term. This mechanism ensures that irrelevant or misleading deep GSP features are intelligently discarded. Afterward, a novel Hessian-regularized Feature Selector (HFS) is proposed to select high-quality features from the deep GSP features, wherein (i) the spatial composition of scenic patches can be optimally maintained, and (ii) a linear SVM is learned simultaneously. Empirical evaluations across six standard scenic datasets demonstrated our method's superior performance, highlighting its exceptional ability to differentiate various sophisticated scenery categories.
解码复杂场景的语义类别是众多人工智能(AI)基础设施的基础。这项工作提出了一种先进的多通道感知视觉特征选择方法,用于识别具有精细空间结构的风景图像,重点是开发一种致力于学习人类注视行为的深度层次模型。利用BING目标性度量,我们能够在场景中跨不同尺度有效地定位物体或其细节。为了模拟人类观察场景中语义或视觉上显著的区域,我们提出了一种稳健的深度主动学习(RDAL)策略。该策略逐步生成注视转移路径(GSP),并在统一架构内计算深度GSP表示。RDAL的一个显著优点是对标签噪声具有鲁棒性,这是通过精心设计的稀疏惩罚项实现的。这种机制确保无关或误导性的深度GSP特征被智能地丢弃。之后,提出了一种新颖的黑塞正则化特征选择器(HFS),从深度GSP特征中选择高质量特征,其中(i)风景斑块的空间组成可以得到最佳保持,(ii)同时学习线性支持向量机。在六个标准风景数据集上的实证评估证明了我们方法的优越性能,突出了其区分各种复杂风景类别的卓越能力。