Bidirectional Graph Reasoning Network for Panoptic Segmentation
- URL: http://arxiv.org/abs/2004.06272v1
- Date: Tue, 14 Apr 2020 02:32:10 GMT
- Title: Bidirectional Graph Reasoning Network for Panoptic Segmentation
- Authors: Yangxin Wu, Gengwei Zhang, Yiming Gao, Xiajun Deng, Ke Gong, Xiaodan
Liang, Liang Lin
- Abstract summary: We introduce a Bidirectional Graph Reasoning Network (BGRNet) to mine the intra-modular and intermodular relations within and between foreground things and background stuff classes.
BGRNet first constructs image-specific graphs in both instance and semantic segmentation branches that enable flexible reasoning at the proposal level and class level.
- Score: 126.06251745669107
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Recent researches on panoptic segmentation resort to a single end-to-end
network to combine the tasks of instance segmentation and semantic
segmentation. However, prior models only unified the two related tasks at the
architectural level via a multi-branch scheme or revealed the underlying
correlation between them by unidirectional feature fusion, which disregards the
explicit semantic and co-occurrence relations among objects and background.
Inspired by the fact that context information is critical to recognize and
localize the objects, and inclusive object details are significant to parse the
background scene, we thus investigate on explicitly modeling the correlations
between object and background to achieve a holistic understanding of an image
in the panoptic segmentation task. We introduce a Bidirectional Graph Reasoning
Network (BGRNet), which incorporates graph structure into the conventional
panoptic segmentation network to mine the intra-modular and intermodular
relations within and between foreground things and background stuff classes. In
particular, BGRNet first constructs image-specific graphs in both instance and
semantic segmentation branches that enable flexible reasoning at the proposal
level and class level, respectively. To establish the correlations between
separate branches and fully leverage the complementary relations between things
and stuff, we propose a Bidirectional Graph Connection Module to diffuse
information across branches in a learnable fashion. Experimental results
demonstrate the superiority of our BGRNet that achieves the new
state-of-the-art performance on challenging COCO and ADE20K panoptic
segmentation benchmarks.
Related papers
- Towards Deeply Unified Depth-aware Panoptic Segmentation with
Bi-directional Guidance Learning [63.63516124646916]
We propose a deeply unified framework for depth-aware panoptic segmentation.
We propose a bi-directional guidance learning approach to facilitate cross-task feature learning.
Our method sets the new state of the art for depth-aware panoptic segmentation on both Cityscapes-DVPS and SemKITTI-DVPS datasets.
arXiv Detail & Related papers (2023-07-27T11:28:33Z) - ConstGCN: Constrained Transmission-based Graph Convolutional Networks
for Document-level Relation Extraction [24.970508961370548]
Document-level relation extraction with graph neural networks faces a fundamental graph construction gap between training and inference.
We propose $textbfConstGCN$, a novel graph convolutional network which performs knowledge-based information propagation between entities.
Experimental results show that our method outperforms the previous state-of-the-art (SOTA) approaches on the DocRE dataset.
arXiv Detail & Related papers (2022-10-08T07:36:04Z) - Open-world Semantic Segmentation via Contrasting and Clustering
Vision-Language Embedding [95.78002228538841]
We propose a new open-world semantic segmentation pipeline that makes the first attempt to learn to segment semantic objects of various open-world categories without any efforts on dense annotations.
Our method can directly segment objects of arbitrary categories, outperforming zero-shot segmentation methods that require data labeling on three benchmark datasets.
arXiv Detail & Related papers (2022-07-18T09:20:04Z) - Tackling the Challenges in Scene Graph Generation with Local-to-Global
Interactions [4.726777092009554]
We seek new insights into the underlying challenges of the Scene Graph Generation (SGG) task.
Motivated by the analysis, we design a novel SGG framework, Local-to-Global Interaction Networks (LOGIN)
Our framework enables predicting the scene graph in a local-to-global manner by design, leveraging the possible complementariness.
arXiv Detail & Related papers (2021-06-16T03:58:21Z) - SOSD-Net: Joint Semantic Object Segmentation and Depth Estimation from
Monocular images [94.36401543589523]
We introduce the concept of semantic objectness to exploit the geometric relationship of these two tasks.
We then propose a Semantic Object and Depth Estimation Network (SOSD-Net) based on the objectness assumption.
To the best of our knowledge, SOSD-Net is the first network that exploits the geometry constraint for simultaneous monocular depth estimation and semantic segmentation.
arXiv Detail & Related papers (2021-01-19T02:41:03Z) - CoADNet: Collaborative Aggregation-and-Distribution Networks for
Co-Salient Object Detection [91.91911418421086]
Co-Salient Object Detection (CoSOD) aims at discovering salient objects that repeatedly appear in a given query group containing two or more relevant images.
One challenging issue is how to effectively capture co-saliency cues by modeling and exploiting inter-image relationships.
We present an end-to-end collaborative aggregation-and-distribution network (CoADNet) to capture both salient and repetitive visual patterns from multiple images.
arXiv Detail & Related papers (2020-11-10T04:28:11Z) - Cascaded Human-Object Interaction Recognition [175.60439054047043]
We introduce a cascade architecture for a multi-stage, coarse-to-fine HOI understanding.
At each stage, an instance localization network progressively refines HOI proposals and feeds them into an interaction recognition network.
With our carefully-designed human-centric relation features, these two modules work collaboratively towards effective interaction understanding.
arXiv Detail & Related papers (2020-03-09T17:05:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.