Denoised Non-Local Neural Network for Semantic Segmentation
- URL: http://arxiv.org/abs/2110.14200v1
- Date: Wed, 27 Oct 2021 06:16:31 GMT
- Title: Denoised Non-Local Neural Network for Semantic Segmentation
- Authors: Qi Song, Jie Li, Hao Guo, Rui Huang
- Abstract summary: We propose a Denoised Non-Local Network (Denoised NL) to eliminate the inter-class and intra-class noises respectively.
Our proposed NL can achieve the state-of-the-art performance of 83.5% and 46.69% mIoU on Cityscapes and ADE20K, respectively.
- Score: 18.84185406522064
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The non-local network has become a widely used technique for semantic
segmentation, which computes an attention map to measure the relationships of
each pixel pair. However, most of the current popular non-local models tend to
ignore the phenomenon that the calculated attention map appears to be very
noisy, containing inter-class and intra-class inconsistencies, which lowers the
accuracy and reliability of the non-local methods. In this paper, we
figuratively denote these inconsistencies as attention noises and explore the
solutions to denoise them. Specifically, we inventively propose a Denoised
Non-Local Network (Denoised NL), which consists of two primary modules, i.e.,
the Global Rectifying (GR) block and the Local Retention (LR) block, to
eliminate the inter-class and intra-class noises respectively. First, GR adopts
the class-level predictions to capture a binary map to distinguish whether the
selected two pixels belong to the same category. Second, LR captures the
ignored local dependencies and further uses them to rectify the unwanted
hollows in the attention map. The experimental results on two challenging
semantic segmentation datasets demonstrate the superior performance of our
model. Without any external training data, our proposed Denoised NL can achieve
the state-of-the-art performance of 83.5\% and 46.69\% mIoU on Cityscapes and
ADE20K, respectively.
Related papers
- Improving Weakly-Supervised Object Localization Using Adversarial Erasing and Pseudo Label [7.400926717561454]
This paper investigates a framework for weakly-supervised object localization.
It aims to train a neural network capable of predicting both the object class and its location using only images and their image-level class labels.
arXiv Detail & Related papers (2024-04-15T06:02:09Z) - Learning Enriched Features via Selective State Spaces Model for Efficient Image Deblurring [0.0]
Image deblurring aims to restore a high-quality image from its corresponding blurred.
We propose an efficient image deblurring network that leverages selective state spaces model to aggregate enriched and accurate features.
Experimental results demonstrate that the proposed method outperforms state-of-the-art approaches on widely used benchmarks.
arXiv Detail & Related papers (2024-03-29T10:40:41Z) - Background Activation Suppression for Weakly Supervised Object
Localization and Semantic Segmentation [84.62067728093358]
Weakly supervised object localization and semantic segmentation aim to localize objects using only image-level labels.
New paradigm has emerged by generating a foreground prediction map to achieve pixel-level localization.
This paper presents two astonishing experimental observations on the object localization learning process.
arXiv Detail & Related papers (2023-09-22T15:44:10Z) - Rethinking the Localization in Weakly Supervised Object Localization [51.29084037301646]
Weakly supervised object localization (WSOL) is one of the most popular and challenging tasks in computer vision.
Recent dividing WSOL into two parts (class-agnostic object localization and object classification) has become the state-of-the-art pipeline for this task.
We propose to replace SCR with a binary-class detector (BCD) for localizing multiple objects, where the detector is trained by discriminating the foreground and background.
arXiv Detail & Related papers (2023-08-11T14:38:51Z) - Learning Confident Classifiers in the Presence of Label Noise [5.829762367794509]
This paper proposes a probabilistic model for noisy observations that allows us to build a confident classification and segmentation models.
Our experiments show that our algorithm outperforms state-of-the-art solutions for the considered classification and segmentation problems.
arXiv Detail & Related papers (2023-01-02T04:27:25Z) - Divide and Contrast: Source-free Domain Adaptation via Adaptive
Contrastive Learning [122.62311703151215]
Divide and Contrast (DaC) aims to connect the good ends of both worlds while bypassing their limitations.
DaC divides the target data into source-like and target-specific samples, where either group of samples is treated with tailored goals.
We further align the source-like domain with the target-specific samples using a memory bank-based Maximum Mean Discrepancy (MMD) loss to reduce the distribution mismatch.
arXiv Detail & Related papers (2022-11-12T09:21:49Z) - Adaptive Local-Component-aware Graph Convolutional Network for One-shot
Skeleton-based Action Recognition [54.23513799338309]
We present an Adaptive Local-Component-aware Graph Convolutional Network for skeleton-based action recognition.
Our method provides a stronger representation than the global embedding and helps our model reach state-of-the-art.
arXiv Detail & Related papers (2022-09-21T02:33:07Z) - Semi-supervised Domain Adaptive Structure Learning [72.01544419893628]
Semi-supervised domain adaptation (SSDA) is a challenging problem requiring methods to overcome both 1) overfitting towards poorly annotated data and 2) distribution shift across domains.
We introduce an adaptive structure learning method to regularize the cooperation of SSL and DA.
arXiv Detail & Related papers (2021-12-12T06:11:16Z) - DMN4: Few-shot Learning via Discriminative Mutual Nearest Neighbor
Neural Network [40.18613107849442]
Few-shot learning aims to classify images under low-data regimes.
Recent work has achieved promising performances by using deep descriptors.
We propose a Mutual Nearest Neighbor (MNN) relation to explicitly select the query descriptors that are most relevant to each task.
arXiv Detail & Related papers (2021-03-15T06:57:09Z) - Region-based Non-local Operation for Video Classification [11.746833714322154]
This paper presents region-based non-local (RNL) operations as a family of self-attention mechanisms.
By combining a channel attention module with the proposed RNL, we design an attention chain, which can be integrated into the off-the-shelf CNNs for end-to-end training.
The experimental results of our method outperform other attention mechanisms, and we achieve state-of-the-art performance on the Something-Something V1 dataset.
arXiv Detail & Related papers (2020-07-17T14:57:05Z) - High-Order Information Matters: Learning Relation and Topology for
Occluded Person Re-Identification [84.43394420267794]
We propose a novel framework by learning high-order relation and topology information for discriminative features and robust alignment.
Our framework significantly outperforms state-of-the-art by6.5%mAP scores on Occluded-Duke dataset.
arXiv Detail & Related papers (2020-03-18T12:18:35Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.