Label Decoupling Framework for Salient Object Detection
- URL: http://arxiv.org/abs/2008.11048v1
- Date: Tue, 25 Aug 2020 14:23:38 GMT
- Title: Label Decoupling Framework for Salient Object Detection
- Authors: Jun Wei, Shuhui Wang, Zhe Wu, Chi Su, Qingming Huang, Qi Tian
- Abstract summary: Recent methods mainly focus on aggregating multi-level features from convolutional network (FCN) and introducing edge information as auxiliary supervision.
We propose a label decoupling framework (LDF) which consists of a label decoupling procedure and a feature interaction network (FIN)
Experiments on six benchmark datasets demonstrate that LDF outperforms state-of-the-art approaches on different evaluation metrics.
- Score: 157.96262922808245
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: To get more accurate saliency maps, recent methods mainly focus on
aggregating multi-level features from fully convolutional network (FCN) and
introducing edge information as auxiliary supervision. Though remarkable
progress has been achieved, we observe that the closer the pixel is to the
edge, the more difficult it is to be predicted, because edge pixels have a very
imbalance distribution. To address this problem, we propose a label decoupling
framework (LDF) which consists of a label decoupling (LD) procedure and a
feature interaction network (FIN). LD explicitly decomposes the original
saliency map into body map and detail map, where body map concentrates on
center areas of objects and detail map focuses on regions around edges. Detail
map works better because it involves much more pixels than traditional edge
supervision. Different from saliency map, body map discards edge pixels and
only pays attention to center areas. This successfully avoids the distraction
from edge pixels during training. Therefore, we employ two branches in FIN to
deal with body map and detail map respectively. Feature interaction (FI) is
designed to fuse the two complementary branches to predict the saliency map,
which is then used to refine the two branches again. This iterative refinement
is helpful for learning better representations and more precise saliency maps.
Comprehensive experiments on six benchmark datasets demonstrate that LDF
outperforms state-of-the-art approaches on different evaluation metrics.
Related papers
- Image-based Geolocalization by Ground-to-2.5D Map Matching [21.21416396311102]
Methods often utilize cross-view localization techniques to match ground-view query images with 2D maps.
We propose a new approach to learning representative embeddings from multi-modal data.
By encoding crucial geometric cues, our method learns discriminative location embeddings for matching panoramic images and maps.
arXiv Detail & Related papers (2023-08-11T08:00:30Z) - NeMO: Neural Map Growing System for Spatiotemporal Fusion in
Bird's-Eye-View and BDD-Map Benchmark [9.430779563669908]
Vision-centric Bird's-Eye View representation is essential for autonomous driving systems.
This work outlines a new paradigm, named NeMO, for generating local maps through the utilization of a readable and writable big map.
With an assumption that the feature distribution of all BEV grids follows an identical pattern, we adopt a shared-weight neural network for all grids to update the big map.
arXiv Detail & Related papers (2023-06-07T15:46:15Z) - Sharp Eyes: A Salient Object Detector Working The Same Way as Human
Visual Characteristics [3.222802562733787]
We propose a sharp eyes network (SENet) that first seperates the object from scene, and then finely segments it.
The proposed method aims to utilize the expanded objects to guide the network obtain complete prediction.
arXiv Detail & Related papers (2023-01-18T11:00:45Z) - Learning Implicit Feature Alignment Function for Semantic Segmentation [51.36809814890326]
Implicit Feature Alignment function (IFA) is inspired by the rapidly expanding topic of implicit neural representations.
We show that IFA implicitly aligns the feature maps at different levels and is capable of producing segmentation maps in arbitrary resolutions.
Our method can be combined with improvement on various architectures, and it achieves state-of-the-art accuracy trade-off on common benchmarks.
arXiv Detail & Related papers (2022-06-17T09:40:14Z) - Disentangle Saliency Detection into Cascaded Detail Modeling and Body
Filling [68.73040261040539]
We propose to decompose the saliency detection task into two cascaded sub-tasks, emphi.e., detail modeling and body filling.
Specifically, the detail modeling focuses on capturing the object edges by supervision of explicitly decomposed detail label.
The body filling learns the body part which will be filled into the detail map to generate more accurate saliency map.
arXiv Detail & Related papers (2022-02-08T19:33:02Z) - Structure-Consistent Weakly Supervised Salient Object Detection with
Local Saliency Coherence [14.79639149658596]
We propose a one-round end-to-end training approach for weakly supervised salient object detection via scribble annotations.
Our method achieves a new state-of-the-art performance on six benchmarks.
arXiv Detail & Related papers (2020-12-08T12:49:40Z) - Rethinking Localization Map: Towards Accurate Object Perception with
Self-Enhancement Maps [78.2581910688094]
This work introduces a novel self-enhancement method to harvest accurate object localization maps and object boundaries with only category labels as supervision.
In particular, the proposed Self-Enhancement Maps achieve the state-of-the-art localization accuracy of 54.88% on ILSVRC.
arXiv Detail & Related papers (2020-06-09T12:35:55Z) - Cross-layer Feature Pyramid Network for Salient Object Detection [102.20031050972429]
We propose a novel Cross-layer Feature Pyramid Network to improve the progressive fusion in salient object detection.
The distributed features per layer own both semantics and salient details from all other layers simultaneously, and suffer reduced loss of important information.
arXiv Detail & Related papers (2020-02-25T14:06:27Z) - Saliency Enhancement using Gradient Domain Edges Merging [65.90255950853674]
We develop a method to merge the edges with the saliency maps to improve the performance of the saliency.
This leads to our proposed saliency enhancement using edges (SEE) with an average improvement of at least 3.4 times higher on the DUT-OMRON dataset.
The SEE algorithm is split into 2 parts, SEE-Pre for preprocessing and SEE-Post pour postprocessing.
arXiv Detail & Related papers (2020-02-11T14:04:56Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.