A Novel Video Salient Object Detection Method via Semi-supervised Motion
Quality Perception
- URL: http://arxiv.org/abs/2008.02966v1
- Date: Fri, 7 Aug 2020 02:58:51 GMT
- Title: A Novel Video Salient Object Detection Method via Semi-supervised Motion
Quality Perception
- Authors: Chenglizhao Chen, Jia Song, Chong Peng, Guodong Wang, Yuming Fang
- Abstract summary: This paper proposes a universal learning scheme to get a further 3% performance improvement for all state-of-the-art (SOTA) methods.
We resort to the "motion quality"---a brand new concept--to select a sub-group of video frames from the original testing set to construct a new training set.
The selected frames in this new training set should all contain high-quality motions, in which the salient objects will have large probability to be successfully detected by the "target SOTA method"
- Score: 52.40934043694379
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Previous video salient object detection (VSOD) approaches have mainly focused
on designing fancy networks to achieve their performance improvements. However,
with the slow-down in development of deep learning techniques recently, it may
become more and more difficult to anticipate another breakthrough via fancy
networks solely. To this end, this paper proposes a universal learning scheme
to get a further 3\% performance improvement for all state-of-the-art (SOTA)
methods. The major highlight of our method is that we resort the "motion
quality"---a brand new concept, to select a sub-group of video frames from the
original testing set to construct a new training set. The selected frames in
this new training set should all contain high-quality motions, in which the
salient objects will have large probability to be successfully detected by the
"target SOTA method"---the one we want to improve. Consequently, we can achieve
a significant performance improvement by using this new training set to start a
new round of network training. During this new round training, the VSOD results
of the target SOTA method will be applied as the pseudo training objectives.
Our novel learning scheme is simple yet effective, and its semi-supervised
methodology may have large potential to inspire the VSOD community in the
future.
Related papers
- A Simple-but-effective Baseline for Training-free Class-Agnostic
Counting [30.792198686654075]
Class-Agnostic Counting (CAC) seeks to accurately count objects in a given image with only a few reference examples.
Recent efforts have shown that it's possible to accomplish this without training by utilizing pre-existing foundation models.
We present a training-free solution that effectively bridges this performance gap, serving as a strong baseline.
arXiv Detail & Related papers (2024-03-03T07:19:50Z) - Reinforcement Learning with Action-Free Pre-Training from Videos [95.25074614579646]
We introduce a framework that learns representations useful for understanding the dynamics via generative pre-training on videos.
Our framework significantly improves both final performances and sample-efficiency of vision-based reinforcement learning.
arXiv Detail & Related papers (2022-03-25T19:44:09Z) - Label, Verify, Correct: A Simple Few Shot Object Detection Method [93.84801062680786]
We introduce a simple pseudo-labelling method to source high-quality pseudo-annotations from a training set.
We present two novel methods to improve the precision of the pseudo-labelling process.
Our method achieves state-of-the-art or second-best performance compared to existing approaches.
arXiv Detail & Related papers (2021-12-10T18:59:06Z) - Activation to Saliency: Forming High-Quality Labels for Unsupervised
Salient Object Detection [54.92703325989853]
We propose a two-stage Activation-to-Saliency (A2S) framework that effectively generates high-quality saliency cues.
No human annotations are involved in our framework during the whole training process.
Our framework reports significant performance compared with existing USOD methods.
arXiv Detail & Related papers (2021-12-07T11:54:06Z) - A Deeper Look at Salient Object Detection: Bi-stream Network with a
Small Training Dataset [62.26677215668959]
We provide a feasible way to construct a novel small-scale training set, which only contains 4K images.
We propose a novel bi-stream network to take full advantage of our proposed small training set.
arXiv Detail & Related papers (2020-08-07T01:24:33Z) - Auto-Rectify Network for Unsupervised Indoor Depth Estimation [119.82412041164372]
We establish that the complex ego-motions exhibited in handheld settings are a critical obstacle for learning depth.
We propose a data pre-processing method that rectifies training images by removing their relative rotations for effective learning.
Our results outperform the previous unsupervised SOTA method by a large margin on the challenging NYUv2 dataset.
arXiv Detail & Related papers (2020-06-04T08:59:17Z) - Incremental Few-Shot Object Detection for Robotics [15.082365880914896]
Class-Incremental Few-Shot Object Detection (CI-FSOD) framework enables deep object detection network to perform effective continual learning from just few-shot samples.
Our framework is simple yet effective and outperforms the previous SOTA with a significant margin of 2.4 points in AP performance.
arXiv Detail & Related papers (2020-05-06T08:05:08Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.