Fully Convolutional Online Tracking
- URL: http://arxiv.org/abs/2004.07109v5
- Date: Sun, 26 Sep 2021 09:13:16 GMT
- Title: Fully Convolutional Online Tracking
- Authors: Yutao Cui, Cheng Jiang, Limin Wang, Gangshan Wu
- Abstract summary: We present a fully convolutional online tracking framework, coined as FCOT, for both classification and regression branches.
Our key contribution is to introduce an online regression model generator (RMG) for initializing weights of the target filter with online samples.
Thanks to the unique design of RMG, our FCOT can not only be more effective in handling target variation along temporal dimension thus generating more precise results.
- Score: 47.78513247048846
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Online learning has turned out to be effective for improving tracking
performance. However, it could be simply applied for classification branch, but
still remains challenging to adapt to regression branch due to its complex
design and intrinsic requirement for high-quality online samples. To tackle
this issue, we present the fully convolutional online tracking framework,
coined as FCOT, and focus on enabling online learning for both classification
and regression branches by using a target filter based tracking paradigm. Our
key contribution is to introduce an online regression model generator (RMG) for
initializing weights of the target filter with online samples and then
optimizing this target filter weights based on the groundtruth samples at the
first frame. Based on the online RGM, we devise a simple anchor-free tracker
(FCOT), composed of a feature backbone, an up-sampling decoder, a multi-scale
classification branch, and a multi-scale regression branch. Thanks to the
unique design of RMG, our FCOT can not only be more effective in handling
target variation along temporal dimension thus generating more precise results,
but also overcome the issue of error accumulation during the tracking
procedure. In addition, due to its simplicity in design, our FCOT could be
trained and deployed in a fully convolutional manner with a real-time running
speed. The proposed FCOT achieves the state-of-the-art performance on seven
benchmarks, including VOT2018, LaSOT, TrackingNet, GOT-10k, OTB100, UAV123, and
NFS. Code and models of our FCOT have been released at:
\url{https://github.com/MCG-NJU/FCOT}.
Related papers
- Exploring Dynamic Transformer for Efficient Object Tracking [58.120191254379854]
We propose DyTrack, a dynamic transformer framework for efficient tracking.
DyTrack automatically learns to configure proper reasoning routes for various inputs, gaining better utilization of the available computational budget.
Experiments on multiple benchmarks demonstrate that DyTrack achieves promising speed-precision trade-offs with only a single model.
arXiv Detail & Related papers (2024-03-26T12:31:58Z) - 3DMOTFormer: Graph Transformer for Online 3D Multi-Object Tracking [15.330384668966806]
State-of-the-art 3D multi-object tracking (MOT) approaches typically rely on non-learned model-based algorithms such as Kalman Filter.
We propose 3DMOTFormer, a learned geometry-based 3D MOT framework building upon the transformer architecture.
Our approach achieves 71.2% and 68.2% AMOTA on the nuScenes validation and test split, respectively.
arXiv Detail & Related papers (2023-08-12T19:19:58Z) - Efficient Few-Shot Object Detection via Knowledge Inheritance [62.36414544915032]
Few-shot object detection (FSOD) aims at learning a generic detector that can adapt to unseen tasks with scarce training samples.
We present an efficient pretrain-transfer framework (PTF) baseline with no computational increment.
We also propose an adaptive length re-scaling (ALR) strategy to alleviate the vector length inconsistency between the predicted novel weights and the pretrained base weights.
arXiv Detail & Related papers (2022-03-23T06:24:31Z) - Recursive Least-Squares Estimator-Aided Online Learning for Visual
Tracking [58.14267480293575]
We propose a simple yet effective online learning approach for few-shot online adaptation without requiring offline training.
It allows an in-built memory retention mechanism for the model to remember the knowledge about the object seen before.
We evaluate our approach based on two networks in the online learning families for tracking, i.e., multi-layer perceptrons in RT-MDNet and convolutional neural networks in DiMP.
arXiv Detail & Related papers (2021-12-28T06:51:18Z) - SiamRCR: Reciprocal Classification and Regression for Visual Object
Tracking [47.647615772027606]
We propose a novel siamese tracking algorithm called SiamRCR, addressing this problem with a simple, light and effective solution.
It builds reciprocal links between classification and regression branches, which can dynamically re-weight their losses for each positive sample.
In addition, we add a localization branch to predict the localization accuracy, so that it can work as the replacement of the regression assistance link during inference.
arXiv Detail & Related papers (2021-05-24T12:21:25Z) - Target Transformed Regression for Accurate Tracking [30.516462193231888]
This paper repurposes a Transformer-alike regression branch, termed as Target Transformed Regression (TREG) for accurate anchor-free tracking.
The core to our TREG is to model pair-wise relation between elements in target template and search region, and use the resulted target enhanced visual representation for accurate bounding box regression.
In addition, we devise a simple online template update mechanism to select reliable templates, increasing the robustness for appearance variations and geometric deformations of target in time.
arXiv Detail & Related papers (2021-04-01T11:25:23Z) - Cascaded Regression Tracking: Towards Online Hard Distractor
Discrimination [202.2562153608092]
We propose a cascaded regression tracker with two sequential stages.
In the first stage, we filter out abundant easily-identified negative candidates.
In the second stage, a discrete sampling based ridge regression is designed to double-check the remaining ambiguous hard samples.
arXiv Detail & Related papers (2020-06-18T07:48:01Z) - Dynamic R-CNN: Towards High Quality Object Detection via Dynamic
Training [70.2914594796002]
We propose Dynamic R-CNN to adjust the label assignment criteria and the shape of regression loss function.
Our method improves upon ResNet-50-FPN baseline with 1.9% AP and 5.5% AP$_90$ on the MS dataset with no extra overhead.
arXiv Detail & Related papers (2020-04-13T15:20:25Z) - AutoTrack: Towards High-Performance Visual Tracking for UAV with
Automatic Spatio-Temporal Regularization [19.379240684856423]
Most existing trackers based on discriminative correlation filters (DCF) try to introduce predefined regularization term to improve the learning of target objects.
In this work, a novel approach is proposed to online and automatically adaptively learn-temporal regularization term.
Experiments on four UAV benchmarks have proven the superiority of our method compared to the state-of-the-art CPU- and GPU-based trackers.
arXiv Detail & Related papers (2020-03-29T05:02:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.