Weakly-Supervised Salient Object Detection Using Point Supervison
- URL: http://arxiv.org/abs/2203.11652v1
- Date: Tue, 22 Mar 2022 12:16:05 GMT
- Title: Weakly-Supervised Salient Object Detection Using Point Supervison
- Authors: Shuyong Gao, Wei Zhang, Yan Wang, Qianyu Guo, Chenglong Zhang, Yangji
He, Wenqiang Zhang
- Abstract summary: Current state-of-the-art saliency detection models rely heavily on large datasets of accurate pixel-wise annotations.
We propose a novel weakly-supervised salient object detection method using point supervision.
Our method outperforms the previous state-of-the-art methods trained with the stronger supervision.
- Score: 17.88596733603456
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Current state-of-the-art saliency detection models rely heavily on large
datasets of accurate pixel-wise annotations, but manually labeling pixels is
time-consuming and labor-intensive. There are some weakly supervised methods
developed for alleviating the problem, such as image label, bounding box label,
and scribble label, while point label still has not been explored in this
field. In this paper, we propose a novel weakly-supervised salient object
detection method using point supervision. To infer the saliency map, we first
design an adaptive masked flood filling algorithm to generate pseudo labels.
Then we develop a transformer-based point-supervised saliency detection model
to produce the first round of saliency maps. However, due to the sparseness of
the label, the weakly supervised model tends to degenerate into a general
foreground detection model. To address this issue, we propose a Non-Salient
Suppression (NSS) method to optimize the erroneous saliency maps generated in
the first round and leverage them for the second round of training. Moreover,
we build a new point-supervised dataset (P-DUTS) by relabeling the DUTS
dataset. In P-DUTS, there is only one labeled point for each salient object.
Comprehensive experiments on five largest benchmark datasets demonstrate our
method outperforms the previous state-of-the-art methods trained with the
stronger supervision and even surpass several fully supervised state-of-the-art
models. The code is available at: https://github.com/shuyonggao/PSOD.
Related papers
- Sparse Generation: Making Pseudo Labels Sparse for weakly supervision with points [2.2241974678268903]
We consider the generation of weakly supervised pseudo labels as the result of model's sparse output.
We propose a method called Sparse Generation to make pseudo labels sparse.
arXiv Detail & Related papers (2024-03-28T10:42:49Z) - LabelFormer: Object Trajectory Refinement for Offboard Perception from
LiDAR Point Clouds [37.87496475959941]
"Auto-labelling" offboard perception models are trained to automatically generate annotations from raw LiDAR point clouds.
We propose LabelFormer, a simple, efficient, and effective trajectory-level refinement approach.
Our approach first encodes each frame's observations separately, then exploits self-attention to reason about the trajectory with full temporal context.
arXiv Detail & Related papers (2023-11-02T17:56:06Z) - Learning Remote Sensing Object Detection with Single Point Supervision [17.12725535531483]
Pointly Supervised Object Detection (PSOD) has attracted considerable interests due to its lower labeling cost as compared to box-level supervised object detection.
We make the first attempt to achieve RS object detection with single point supervision, and propose a PSOD method tailored for RS images.
Our method can achieve significantly better performance as compared to state-of-the-art image-level and point-level supervised detection methods, and reduce the performance gap between PSOD and box-level supervised object detection.
arXiv Detail & Related papers (2023-05-23T15:06:04Z) - SOOD: Towards Semi-Supervised Oriented Object Detection [57.05141794402972]
This paper proposes a novel Semi-supervised Oriented Object Detection model, termed SOOD, built upon the mainstream pseudo-labeling framework.
Our experiments show that when trained with the two proposed losses, SOOD surpasses the state-of-the-art SSOD methods under various settings on the DOTA-v1.5 benchmark.
arXiv Detail & Related papers (2023-04-10T11:10:42Z) - Augment and Criticize: Exploring Informative Samples for Semi-Supervised
Monocular 3D Object Detection [64.65563422852568]
We improve the challenging monocular 3D object detection problem with a general semi-supervised framework.
We introduce a novel, simple, yet effective Augment and Criticize' framework that explores abundant informative samples from unlabeled data.
The two new detectors, dubbed 3DSeMo_DLE and 3DSeMo_FLEX, achieve state-of-the-art results with remarkable improvements for over 3.5% AP_3D/BEV (Easy) on KITTI.
arXiv Detail & Related papers (2023-03-20T16:28:15Z) - Weakly Supervised Monocular 3D Object Detection using Multi-View
Projection and Direction Consistency [78.76508318592552]
Monocular 3D object detection has become a mainstream approach in automatic driving for its easy application.
Most current methods still rely on 3D point cloud data for labeling the ground truths used in the training phase.
We propose a new weakly supervised monocular 3D objection detection method, which can train the model with only 2D labels marked on images.
arXiv Detail & Related papers (2023-03-15T15:14:00Z) - Unsupervised Object Detection with LiDAR Clues [70.73881791310495]
We present the first practical method for unsupervised object detection with the aid of LiDAR clues.
In our approach, candidate object segments based on 3D point clouds are firstly generated.
Then, an iterative segment labeling process is conducted to assign segment labels and to train a segment labeling network.
The labeling process is carefully designed so as to mitigate the issue of long-tailed and open-ended distribution.
arXiv Detail & Related papers (2020-11-25T18:59:54Z) - Weakly-Supervised Salient Object Detection via Scribble Annotations [54.40518383782725]
We propose a weakly-supervised salient object detection model to learn saliency from scribble labels.
We present a new metric, termed saliency structure measure, to measure the structure alignment of the predicted saliency maps.
Our method not only outperforms existing weakly-supervised/unsupervised methods, but also is on par with several fully-supervised state-of-the-art models.
arXiv Detail & Related papers (2020-03-17T12:59:50Z) - EHSOD: CAM-Guided End-to-end Hybrid-Supervised Object Detection with
Cascade Refinement [53.69674636044927]
We present EHSOD, an end-to-end hybrid-supervised object detection system.
It can be trained in one shot on both fully and weakly-annotated data.
It achieves comparable results on multiple object detection benchmarks with only 30% fully-annotated data.
arXiv Detail & Related papers (2020-02-18T08:04:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.