EPSNet: Efficient Panoptic Segmentation Network with Cross-layer
  Attention Fusion
        - URL: http://arxiv.org/abs/2003.10142v3
- Date: Thu, 24 Dec 2020 05:15:41 GMT
- Title: EPSNet: Efficient Panoptic Segmentation Network with Cross-layer
  Attention Fusion
- Authors: Chia-Yuan Chang, Shuo-En Chang, Pei-Yung Hsiao, and Li-Chen Fu
- Abstract summary: We propose an Efficient Panoptic Network (EPSNet) to tackle the panoptic segmentation tasks with fast inference speed.
Basically, EPSNet generates masks based on simple linear combination of prototype masks and mask coefficients.
To enhance the quality of shared prototypes, we adopt a module called "cross-layer attention fusion module"
- Score: 5.815742965809424
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract:   Panoptic segmentation is a scene parsing task which unifies semantic
segmentation and instance segmentation into one single task. However, the
current state-of-the-art studies did not take too much concern on inference
time. In this work, we propose an Efficient Panoptic Segmentation Network
(EPSNet) to tackle the panoptic segmentation tasks with fast inference speed.
Basically, EPSNet generates masks based on simple linear combination of
prototype masks and mask coefficients. The light-weight network branches for
instance segmentation and semantic segmentation only need to predict mask
coefficients and produce masks with the shared prototypes predicted by
prototype network branch. Furthermore, to enhance the quality of shared
prototypes, we adopt a module called "cross-layer attention fusion module",
which aggregates the multi-scale features with attention mechanism helping them
capture the long-range dependencies between each other. To validate the
proposed work, we have conducted various experiments on the challenging COCO
panoptic dataset, which achieve highly promising performance with significantly
faster inference speed (53ms on GPU).
 
      
        Related papers
        - Bridge the Points: Graph-based Few-shot Segment Anything Semantically [79.1519244940518]
 Recent advancements in pre-training techniques have enhanced the capabilities of vision foundation models.
Recent studies extend the SAM to Few-shot Semantic segmentation (FSS)
We propose a simple yet effective approach based on graph analysis.
 arXiv  Detail & Related papers  (2024-10-09T15:02:28Z)
- The revenge of BiSeNet: Efficient Multi-Task Image Segmentation [6.172605433695617]
 BiSeNetFormer is a novel architecture for efficient multi-task image segmentation.
By seamlessly supporting multiple tasks, BiSeNetFormer offers a versatile solution for multi-task segmentation.
Our results indicate that BiSeNetFormer represents a significant advancement towards fast, efficient, and multi-task segmentation networks.
 arXiv  Detail & Related papers  (2024-04-15T08:32:18Z)
- Cross-CBAM: A Lightweight network for Scene Segmentation [2.064612766965483]
 We present the Cross-CBAM network, a novel lightweight network for real-time semantic segmentation.
In experiments on the Cityscapes dataset and Camvid dataset, we achieve 73.4% mIoU with a speed of 240.9FPS and 77.2% mIoU with a speed of 88.6FPS on NVIDIA GTX 1080Ti.
 arXiv  Detail & Related papers  (2023-06-04T09:03:05Z)
- Beyond the Prototype: Divide-and-conquer Proxies for Few-shot
  Segmentation [63.910211095033596]
 Few-shot segmentation aims to segment unseen-class objects given only a handful of densely labeled samples.
We propose a simple yet versatile framework in the spirit of divide-and-conquer.
Our proposed approach, named divide-and-conquer proxies (DCP), allows for the development of appropriate and reliable information.
 arXiv  Detail & Related papers  (2022-04-21T06:21:14Z)
- Semantic Attention and Scale Complementary Network for Instance
  Segmentation in Remote Sensing Images [54.08240004593062]
 We propose an end-to-end multi-category instance segmentation model, which consists of a Semantic Attention (SEA) module and a Scale Complementary Mask Branch (SCMB)
SEA module contains a simple fully convolutional semantic segmentation branch with extra supervision to strengthen the activation of interest instances on the feature map.
SCMB extends the original single mask branch to trident mask branches and introduces complementary mask supervision at different scales.
 arXiv  Detail & Related papers  (2021-07-25T08:53:59Z)
- SE-PSNet: Silhouette-based Enhancement Feature for Panoptic Segmentation
  Network [5.353718408751182]
 We propose a solution to tackle the panoptic segmentation task.
The structure combines the bottom-up method and the top-down method.
The network mainly pays attention to the quality of the mask.
 arXiv  Detail & Related papers  (2021-07-11T17:20:32Z)
- Prototypical Cross-Attention Networks for Multiple Object Tracking and
  Segmentation [95.74244714914052]
 Multiple object tracking and segmentation requires detecting, tracking, and segmenting objects belonging to a set of given classes.
We propose Prototypical Cross-Attention Network (PCAN), capable of leveraging rich-temporal information online.
PCAN outperforms current video instance tracking and segmentation competition winners on Youtube-VIS and BDD100K datasets.
 arXiv  Detail & Related papers  (2021-06-22T17:57:24Z)
- CRNet: Cross-Reference Networks for Few-Shot Segmentation [59.85183776573642]
 Few-shot segmentation aims to learn a segmentation model that can be generalized to novel classes with only a few training images.
With a cross-reference mechanism, our network can better find the co-occurrent objects in the two images.
Experiments on the PASCAL VOC 2012 dataset show that our network achieves state-of-the-art performance.
 arXiv  Detail & Related papers  (2020-03-24T04:55:43Z)
- PointINS: Point-based Instance Segmentation [117.38579097923052]
 Mask representation in instance segmentation with Point-of-Interest (PoI) features is challenging because learning a high-dimensional mask feature for each instance requires a heavy computing burden.
We propose an instance-aware convolution, which decomposes this mask representation learning task into two tractable modules.
Along with instance-aware convolution, we propose PointINS, a simple and practical instance segmentation approach.
 arXiv  Detail & Related papers  (2020-03-13T08:24:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.