Ma Mingshuo, Gui Zhiming, Gao Zhenji, Wang Bin
Faculty of Information Technology, Beijing University of Technology, Beijing 100124, China.
Integrated Natural Resources Survey Center, CGS, No. 55 Yard, Honglian South Road, Xicheng District, Beijing 100055, China.
Sensors (Basel). 2024 Jun 25;24(13):4127. doi: 10.3390/s24134127.
Rock image classification represents a challenging fine-grained image classification task characterized by subtle differences among closely related rock categories. Current contrastive learning methods prevalently utilized in fine-grained image classification restrict the model's capacity to discern critical features contrastively from image pairs, and are typically too large for deployment on mobile devices used for in situ rock identification. In this work, we introduce an innovative and compact model generation framework anchored by the design of a Feature Positioning Comparison Network (FPCN). The FPCN facilitates interaction between feature vectors from localized regions within image pairs, capturing both shared and distinctive features. Further, it accommodates the variable scales of objects depicted in images, which correspond to differing quantities of inherent object information, directing the network's attention to additional contextual details based on object size variability. Leveraging knowledge distillation, the architecture is streamlined, with a focus on nuanced information at activation boundaries to master the precise fine-grained decision boundaries, thereby enhancing the small model's accuracy. Empirical evidence demonstrates that our proposed method based on FPCN improves the classification accuracy mobile lightweight models by nearly 2% while maintaining the same time and space consumption.
岩石图像分类是一项具有挑战性的细粒度图像分类任务,其特点是密切相关的岩石类别之间存在细微差异。目前在细粒度图像分类中普遍使用的对比学习方法限制了模型从图像对中对比辨别关键特征的能力,并且通常规模过大,无法部署在用于现场岩石识别的移动设备上。在这项工作中,我们引入了一个创新的紧凑型模型生成框架,该框架以特征定位比较网络(FPCN)的设计为基础。FPCN促进了图像对中局部区域的特征向量之间的交互,捕捉共享特征和独特特征。此外,它适应图像中所描绘物体的可变尺度,这些尺度对应于不同数量的固有物体信息,并根据物体大小的变化引导网络关注额外的上下文细节。利用知识蒸馏,该架构得以简化,重点关注激活边界处的细微信息,以掌握精确的细粒度决策边界,从而提高小型模型的准确性。经验证据表明,我们基于FPCN提出的方法在保持相同时间和空间消耗的同时,将移动轻量级模型的分类准确率提高了近2%。