Unsupervised Learning of Accurate Siamese Tracking
- URL: http://arxiv.org/abs/2204.01475v1
- Date: Mon, 4 Apr 2022 13:39:43 GMT
- Title: Unsupervised Learning of Accurate Siamese Tracking
- Authors: Qiuhong Shen, Lei Qiao, Jinyang Guo, Peixia Li, Xin Li, Bo Li, Weitao
Feng, Weihao Gan, Wei Wu, Wanli Ouyang
- Abstract summary: We present a novel unsupervised tracking framework, in which we can learn temporal correspondence both on the classification branch and regression branch.
Our tracker outperforms preceding unsupervised methods by a substantial margin, performing on par with supervised methods on large-scale datasets such as TrackingNet and LaSOT.
- Score: 68.58171095173056
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: Unsupervised learning has been popular in various computer vision tasks,
including visual object tracking. However, prior unsupervised tracking
approaches rely heavily on spatial supervision from template-search pairs and
are still unable to track objects with strong variation over a long time span.
As unlimited self-supervision signals can be obtained by tracking a video along
a cycle in time, we investigate evolving a Siamese tracker by tracking videos
forward-backward. We present a novel unsupervised tracking framework, in which
we can learn temporal correspondence both on the classification branch and
regression branch. Specifically, to propagate reliable template feature in the
forward propagation process so that the tracker can be trained in the cycle, we
first propose a consistency propagation transformation. We then identify an
ill-posed penalty problem in conventional cycle training in backward
propagation process. Thus, a differentiable region mask is proposed to select
features as well as to implicitly penalize tracking errors on intermediate
frames. Moreover, since noisy labels may degrade training, we propose a
mask-guided loss reweighting strategy to assign dynamic weights based on the
quality of pseudo labels. In extensive experiments, our tracker outperforms
preceding unsupervised methods by a substantial margin, performing on par with
supervised methods on large-scale datasets such as TrackingNet and LaSOT. Code
is available at https://github.com/FlorinShum/ULAST.
Related papers
- Online Deep Clustering with Video Track Consistency [85.8868194550978]
We propose an unsupervised clustering-based approach to learn visual features from video object tracks.
We show that exploiting an unsupervised class-agnostic, yet noisy, track generator yields to better accuracy compared to relying on costly and precise track annotations.
arXiv Detail & Related papers (2022-06-07T08:11:00Z) - Learning to Track Objects from Unlabeled Videos [63.149201681380305]
In this paper, we propose to learn an Unsupervised Single Object Tracker (USOT) from scratch.
To narrow the gap between unsupervised trackers and supervised counterparts, we propose an effective unsupervised learning approach composed of three stages.
Experiments show that the proposed USOT learned from unlabeled videos performs well over the state-of-the-art unsupervised trackers by large margins.
arXiv Detail & Related papers (2021-08-28T22:10:06Z) - Self-supervised Object Tracking with Cycle-consistent Siamese Networks [55.040249900677225]
We exploit an end-to-end Siamese network in a cycle-consistent self-supervised framework for object tracking.
We propose to integrate a Siamese region proposal and mask regression network in our tracking framework so that a fast and more accurate tracker can be learned without the annotation of each frame.
arXiv Detail & Related papers (2020-08-03T04:10:38Z) - Unsupervised Deep Representation Learning for Real-Time Tracking [137.69689503237893]
We propose an unsupervised learning method for visual tracking.
The motivation of our unsupervised learning is that a robust tracker should be effective in bidirectional tracking.
We build our framework on a Siamese correlation filter network, and propose a multi-frame validation scheme and a cost-sensitive loss to facilitate unsupervised learning.
arXiv Detail & Related papers (2020-07-22T08:23:12Z) - Cascaded Regression Tracking: Towards Online Hard Distractor
Discrimination [202.2562153608092]
We propose a cascaded regression tracker with two sequential stages.
In the first stage, we filter out abundant easily-identified negative candidates.
In the second stage, a discrete sampling based ridge regression is designed to double-check the remaining ambiguous hard samples.
arXiv Detail & Related papers (2020-06-18T07:48:01Z) - AFAT: Adaptive Failure-Aware Tracker for Robust Visual Object Tracking [46.82222972389531]
Siamese approaches have achieved promising performance in visual object tracking recently.
Siamese paradigm uses one-shot learning to model the online tracking task, which impedes online adaptation in the tracking process.
We propose a failure-aware system, based on convolutional and LSTM modules in the decision stage, enabling online reporting of potential tracking failures.
arXiv Detail & Related papers (2020-05-27T23:21:12Z) - Unsupervised Multiple Person Tracking using AutoEncoder-Based Lifted
Multicuts [11.72025865314187]
We present an unsupervised multiple object tracking approach based on minimum visual features and lifted multicuts.
We show that, despite being trained without using the provided annotations, our model provides competitive results on the challenging MOT Benchmark for pedestrian tracking.
arXiv Detail & Related papers (2020-02-04T09:42:34Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.