Disentangle Saliency Detection into Cascaded Detail Modeling and Body
Filling
- URL: http://arxiv.org/abs/2202.04112v1
- Date: Tue, 8 Feb 2022 19:33:02 GMT
- Title: Disentangle Saliency Detection into Cascaded Detail Modeling and Body
Filling
- Authors: Yue Song, Hao Tang, Nicu Sebe, Wei Wang
- Abstract summary: We propose to decompose the saliency detection task into two cascaded sub-tasks, emphi.e., detail modeling and body filling.
Specifically, the detail modeling focuses on capturing the object edges by supervision of explicitly decomposed detail label.
The body filling learns the body part which will be filled into the detail map to generate more accurate saliency map.
- Score: 68.73040261040539
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Salient object detection has been long studied to identify the most visually
attractive objects in images/videos. Recently, a growing amount of approaches
have been proposed all of which rely on the contour/edge information to improve
detection performance. The edge labels are either put into the loss directly or
used as extra supervision. The edge and body can also be learned separately and
then fused afterward. Both methods either lead to high prediction errors near
the edge or cannot be trained in an end-to-end manner. Another problem is that
existing methods may fail to detect objects of various sizes due to the lack of
efficient and effective feature fusion mechanisms. In this work, we propose to
decompose the saliency detection task into two cascaded sub-tasks, \emph{i.e.},
detail modeling and body filling. Specifically, the detail modeling focuses on
capturing the object edges by supervision of explicitly decomposed detail label
that consists of the pixels that are nested on the edge and near the edge. Then
the body filling learns the body part which will be filled into the detail map
to generate more accurate saliency map. To effectively fuse the features and
handle objects at different scales, we have also proposed two novel multi-scale
detail attention and body attention blocks for precise detail and body
modeling. Experimental results show that our method achieves state-of-the-art
performances on six public datasets.
Related papers
- Local Occupancy-Enhanced Object Grasping with Multiple Triplanar Projection [24.00828999360765]
This paper addresses the challenge of robotic grasping of general objects.
The proposed model first runs by proposing a number of most likely grasp points in the scene.
Around each grasp point, a module is designed to infer any voxel in its neighborhood to be either void or occupied by some object.
The model further estimates 6-DoF grasp poses utilizing the local occupancy-enhanced object shape information.
arXiv Detail & Related papers (2024-07-22T16:22:28Z) - Few-shot Oriented Object Detection with Memorable Contrastive Learning in Remote Sensing Images [11.217630579076237]
Few-shot object detection (FSOD) has garnered significant research attention in the field of remote sensing.
We propose a novel FSOD method for remote sensing images called Few-shot Oriented object detection with Memorable Contrastive learning (FOMC)
Specifically, we employ oriented bounding boxes instead of traditional horizontal bounding boxes to learn a better feature representation for arbitrary-oriented aerial objects.
arXiv Detail & Related papers (2024-03-20T08:15:18Z) - Joint Perceptual Learning for Enhancement and Object Detection in
Underwater Scenarios [41.34564703212461]
We propose a bilevel optimization formulation for jointly learning underwater object detection and image enhancement.
Our method outputs visually favoring images and higher detection accuracy.
arXiv Detail & Related papers (2023-07-07T11:54:06Z) - Sharp Eyes: A Salient Object Detector Working The Same Way as Human
Visual Characteristics [3.222802562733787]
We propose a sharp eyes network (SENet) that first seperates the object from scene, and then finely segments it.
The proposed method aims to utilize the expanded objects to guide the network obtain complete prediction.
arXiv Detail & Related papers (2023-01-18T11:00:45Z) - High-resolution Iterative Feedback Network for Camouflaged Object
Detection [128.893782016078]
Spotting camouflaged objects that are visually assimilated into the background is tricky for object detection algorithms.
We aim to extract the high-resolution texture details to avoid the detail degradation that causes blurred vision in edges and boundaries.
We introduce a novel HitNet to refine the low-resolution representations by high-resolution features in an iterative feedback manner.
arXiv Detail & Related papers (2022-03-22T11:20:21Z) - Ensembling object detectors for image and video data analysis [98.26061123111647]
We propose a method for ensembling the outputs of multiple object detectors for improving detection performance and precision of bounding boxes on image data.
We extend it to video data by proposing a two-stage tracking-based scheme for detection refinement.
arXiv Detail & Related papers (2021-02-09T12:38:16Z) - Slender Object Detection: Diagnoses and Improvements [74.40792217534]
In this paper, we are concerned with the detection of a particular type of objects with extreme aspect ratios, namely textbfslender objects.
For a classical object detection method, a drastic drop of $18.9%$ mAP on COCO is observed, if solely evaluated on slender objects.
arXiv Detail & Related papers (2020-11-17T09:39:42Z) - Refined Plane Segmentation for Cuboid-Shaped Objects by Leveraging Edge
Detection [63.942632088208505]
We propose a post-processing algorithm to align the segmented plane masks with edges detected in the image.
This allows us to increase the accuracy of state-of-the-art approaches, while limiting ourselves to cuboid-shaped objects.
arXiv Detail & Related papers (2020-03-28T18:51:43Z) - Weakly-Supervised Salient Object Detection via Scribble Annotations [54.40518383782725]
We propose a weakly-supervised salient object detection model to learn saliency from scribble labels.
We present a new metric, termed saliency structure measure, to measure the structure alignment of the predicted saliency maps.
Our method not only outperforms existing weakly-supervised/unsupervised methods, but also is on par with several fully-supervised state-of-the-art models.
arXiv Detail & Related papers (2020-03-17T12:59:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.