• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于动态实例查询的全景图像分割方法

Panoptic Image Segmentation Method Based on Dynamic Instance Query.

作者信息

Yang Lanshi, Wang Shiguo, Teng Shuhua

机构信息

School of Computer Science and Technology, Changsha University of Science and Technology, Changsha 410076, China.

School of Electronic Information, Hunan First Normal University, Changsha 410205, China.

出版信息

Sensors (Basel). 2025 May 5;25(9):2919. doi: 10.3390/s25092919.

DOI:10.3390/s25092919
PMID:40363356
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12074491/
Abstract

Panoptic segmentation, as a key task in the field of computer vision, holds significant importance in practical applications such as autonomous driving and robot vision. Currently, among deep-learning-based panoptic segmentation methods, query-based methods have received widespread attention. However, existing methods, such as Mask2Former, typically rely on a static query mechanism. This makes it difficult for the model to adapt to changes in the number of instances in different scenes and can lead to instance loss or confusion, thus limiting performance in complex scenes. Furthermore, it is prone to insufficient feature extraction and a loss of global information. To address these problems, this paper proposes a panoptic segmentation method based on dynamic instance queries (PSM-DIQ). PSM-DIQ uses a multi-dimensional attention mechanism to enhance feature extraction, utilizes instance-activation-guided dynamic query generation to improve the ability to distinguish between different instances, and optimizes pixel-query interactions through a dual-path Transformer decoder. Experiments on the Cityscapes and MS COCO datasets show that, based on the ResNet-50 backbone, PSM-DIQ significantly outperforms the Mask2Former baseline, with PQ values improving by 1.8 and 1.7 percentage points, respectively. The experimental results verify the effectiveness of PSM-DIQ in complex scene panoptic segmentation. Finally, this work will be released as an open-source software package on GitHub (v1.0).

摘要

全景分割作为计算机视觉领域的一项关键任务,在自动驾驶和机器人视觉等实际应用中具有重要意义。目前,在基于深度学习的全景分割方法中,基于查询的方法受到了广泛关注。然而,现有的方法,如Mask2Former,通常依赖于静态查询机制。这使得模型难以适应不同场景中实例数量的变化,并可能导致实例丢失或混淆,从而限制了在复杂场景中的性能。此外,它还容易出现特征提取不足和全局信息丢失的问题。为了解决这些问题,本文提出了一种基于动态实例查询的全景分割方法(PSM-DIQ)。PSM-DIQ使用多维注意力机制来增强特征提取,利用实例激活引导的动态查询生成来提高区分不同实例的能力,并通过双路径Transformer解码器优化像素查询交互。在Cityscapes和MS COCO数据集上的实验表明,基于ResNet-50主干,PSM-DIQ显著优于Mask2Former基线,PQ值分别提高了1.8和1.7个百分点。实验结果验证了PSM-DIQ在复杂场景全景分割中的有效性。最后,这项工作将作为一个开源软件包在GitHub上发布(v1.0)。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9938/12074491/a24c3a172cbc/sensors-25-02919-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9938/12074491/06b8473fbf70/sensors-25-02919-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9938/12074491/5827ff4cb4ab/sensors-25-02919-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9938/12074491/0fcd2ddc6ec8/sensors-25-02919-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9938/12074491/b9c1e3c085a0/sensors-25-02919-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9938/12074491/148c9083b6b9/sensors-25-02919-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9938/12074491/960675d96ced/sensors-25-02919-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9938/12074491/7eff85b579b8/sensors-25-02919-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9938/12074491/3cdcc58ea8cc/sensors-25-02919-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9938/12074491/a24c3a172cbc/sensors-25-02919-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9938/12074491/06b8473fbf70/sensors-25-02919-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9938/12074491/5827ff4cb4ab/sensors-25-02919-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9938/12074491/0fcd2ddc6ec8/sensors-25-02919-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9938/12074491/b9c1e3c085a0/sensors-25-02919-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9938/12074491/148c9083b6b9/sensors-25-02919-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9938/12074491/960675d96ced/sensors-25-02919-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9938/12074491/7eff85b579b8/sensors-25-02919-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9938/12074491/3cdcc58ea8cc/sensors-25-02919-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9938/12074491/a24c3a172cbc/sensors-25-02919-g009.jpg

相似文献

1
Panoptic Image Segmentation Method Based on Dynamic Instance Query.基于动态实例查询的全景图像分割方法
Sensors (Basel). 2025 May 5;25(9):2919. doi: 10.3390/s25092919.
2
Dense Pixel-Level Interpretation of Dynamic Scenes With Video Panoptic Segmentation.基于视频全景分割的动态场景密集像素级解读
IEEE Trans Image Process. 2022;31:5383-5395. doi: 10.1109/TIP.2022.3183440. Epub 2022 Aug 17.
3
Fast Panoptic Segmentation with Soft Attention Embeddings.快速全景分割的软注意嵌入。
Sensors (Basel). 2022 Jan 20;22(3):783. doi: 10.3390/s22030783.
4
Enhancing Query Formulation for Universal Image Segmentation.增强通用图像分割的查询公式制定
Sensors (Basel). 2024 Mar 14;24(6):1879. doi: 10.3390/s24061879.
5
Cascade contour-enhanced panoptic segmentation for robotic vision perception.用于机器人视觉感知的级联轮廓增强全景分割
Front Neurorobot. 2024 Oct 21;18:1489021. doi: 10.3389/fnbot.2024.1489021. eCollection 2024.
6
CompleteInst: An Efficient Instance Segmentation Network for Missed Detection Scene of Autonomous Driving.CompleteInst:一种用于自动驾驶漏检场景的高效实例分割网络。
Sensors (Basel). 2023 Nov 10;23(22):9102. doi: 10.3390/s23229102.
7
Instance and Panoptic Segmentation Using Conditional Convolutions.使用条件卷积的实例分割与全景分割
IEEE Trans Pattern Anal Mach Intell. 2023 Jan;45(1):669-680. doi: 10.1109/TPAMI.2022.3145407. Epub 2022 Dec 5.
8
Mask-Transformer-Based Networks for Teeth Segmentation in Panoramic Radiographs.基于掩码变压器的全景X光片中牙齿分割网络
Bioengineering (Basel). 2023 Jul 17;10(7):843. doi: 10.3390/bioengineering10070843.
9
Instance Motion Tendency Learning for Video Panoptic Segmentation.用于视频全景分割的实例运动趋势学习
IEEE Trans Image Process. 2023;32:764-778. doi: 10.1109/TIP.2022.3226414. Epub 2023 Jan 18.
10
Panoptic-PartFormer++: A Unified and Decoupled View for Panoptic Part Segmentation.全景部分Former++:用于全景部分分割的统一解耦视图
IEEE Trans Pattern Anal Mach Intell. 2024 Dec;46(12):11087-11103. doi: 10.1109/TPAMI.2024.3453916. Epub 2024 Nov 6.

本文引用的文献

1
Cascade contour-enhanced panoptic segmentation for robotic vision perception.用于机器人视觉感知的级联轮廓增强全景分割
Front Neurorobot. 2024 Oct 21;18:1489021. doi: 10.3389/fnbot.2024.1489021. eCollection 2024.
2
IDNet: Information Decomposition Network for Fast Panoptic Segmentation.IDNet:用于快速全景分割的信息分解网络。
IEEE Trans Image Process. 2024;33:1487-1496. doi: 10.1109/TIP.2023.3234499. Epub 2024 Feb 21.
3
Image Segmentation Using Deep Learning: A Survey.基于深度学习的图像分割技术综述。
IEEE Trans Pattern Anal Mach Intell. 2022 Jul;44(7):3523-3542. doi: 10.1109/TPAMI.2021.3059968. Epub 2022 Jun 3.