Paul Sneha, Patterson Zachary, Bouguila Nizar
Concordia Institute for Information Systems Engineering (CIISE), Concordia University, Montreal, QC H3G1M8, Canada.
J Imaging. 2024 Mar 15;10(3):71. doi: 10.3390/jimaging10030071.
The application of large field-of-view (FoV) cameras equipped with fish-eye lenses brings notable advantages to various real-world computer vision applications, including autonomous driving. While deep learning has proven successful in conventional computer vision applications using regular perspective images, its potential in fish-eye camera contexts remains largely unexplored due to limited datasets for fully supervised learning. Semi-supervised learning comes as a potential solution to manage this challenge. In this study, we explore and benchmark two popular semi-supervised methods from the perspective image domain for fish-eye image segmentation. We further introduce FishSegSSL, a novel fish-eye image segmentation framework featuring three semi-supervised components: pseudo-label filtering, dynamic confidence thresholding, and robust strong augmentation. Evaluation on the WoodScape dataset, collected from vehicle-mounted fish-eye cameras, demonstrates that our proposed method enhances the model's performance by up to 10.49% over fully supervised methods using the same amount of labeled data. Our method also improves the existing image segmentation methods by 2.34%. To the best of our knowledge, this is the first work on semi-supervised semantic segmentation on fish-eye images. Additionally, we conduct a comprehensive ablation study and sensitivity analysis to showcase the efficacy of each proposed method in this research.
配备鱼眼镜头的大视野(FoV)相机的应用为包括自动驾驶在内的各种现实世界计算机视觉应用带来了显著优势。虽然深度学习在使用常规透视图像的传统计算机视觉应用中已被证明是成功的,但由于用于完全监督学习的数据集有限,其在鱼眼相机环境中的潜力在很大程度上仍未得到探索。半监督学习成为应对这一挑战的潜在解决方案。在本研究中,我们从透视图像领域的角度探索并对两种流行的半监督方法进行基准测试,用于鱼眼图像分割。我们进一步引入了FishSegSSL,这是一个新颖的鱼眼图像分割框架,具有三个半监督组件:伪标签过滤、动态置信度阈值处理和强大的增强。对从车载鱼眼相机收集的WoodScape数据集进行的评估表明,我们提出的方法在使用相同数量的标注数据时,比完全监督方法将模型性能提高了高达10.49%。我们的方法还将现有的图像分割方法提高了2.34%。据我们所知,这是第一项关于鱼眼图像半监督语义分割的工作。此外,我们进行了全面的消融研究和敏感性分析,以展示本研究中提出的每种方法的有效性。