IEEE Trans Pattern Anal Mach Intell. 2022 Nov;44(11):8587-8601. doi: 10.1109/TPAMI.2021.3111116. Epub 2022 Oct 4.
Compared to many other dense prediction tasks, e.g., semantic segmentation, it is the arbitrary number of instances that has made instance segmentation much more challenging. In order to predict a mask for each instance, mainstream approaches either follow the "detect-then-segment" strategy (e.g., Mask R-CNN), or predict embedding vectors first then cluster pixels into individual instances. In this paper, we view the task of instance segmentation from a completely new perspective by introducing the notion of "instance categories", which assigns categories to each pixel within an instance according to the instance's location. With this notion, we propose segmenting objects by locations (SOLO), a simple, direct, and fast framework for instance segmentation with strong performance. We derive a few SOLO variants (e.g., Vanilla SOLO, Decoupled SOLO, Dynamic SOLO) following the basic principle. Our method directly maps a raw input image to the desired object categories and instance masks, eliminating the need for the grouping post-processing or the bounding box detection. Our approach achieves state-of-the-art results for instance segmentation in terms of both speed and accuracy, while being considerably simpler than the existing methods. Besides instance segmentation, our method yields state-of-the-art results in object detection (from our mask byproduct) and panoptic segmentation. We further demonstrate the flexibility and high-quality segmentation of SOLO by extending it to perform one-stage instance-level image matting. Code is available at: https://git.io/AdelaiDet.
与许多其他密集预测任务(例如语义分割)相比,正是实例的任意数量使得实例分割变得更加具有挑战性。为了预测每个实例的蒙版,主流方法要么遵循“检测后分割”策略(例如 Mask R-CNN),要么首先预测嵌入向量,然后将像素聚类到各个实例中。在本文中,我们通过引入“实例类别”的概念,从一个全新的角度看待实例分割任务,根据实例的位置为每个实例内的像素分配类别。有了这个概念,我们提出了通过位置分割对象(SOLO),这是一种简单、直接、快速的实例分割框架,具有强大的性能。我们根据基本原理推导出了几个 SOLO 变体(例如,Vanilla SOLO、Decoupled SOLO、Dynamic SOLO)。我们的方法直接将原始输入图像映射到所需的对象类别和实例蒙版,无需进行分组后处理或边界框检测。我们的方法在速度和准确性方面都实现了实例分割的最新水平,同时比现有方法简单得多。除了实例分割,我们的方法在对象检测(来自我们的蒙版副产品)和全景分割方面也取得了最新的结果。我们通过将其扩展到执行单级实例级图像遮罩来进一步证明 SOLO 的灵活性和高质量分割。代码可在:https://git.io/AdelaiDet 获得。