CRACT: Cascaded Regression-Align-Classification for Robust Visual
Tracking
- URL: http://arxiv.org/abs/2011.12483v1
- Date: Wed, 25 Nov 2020 02:18:33 GMT
- Title: CRACT: Cascaded Regression-Align-Classification for Robust Visual
Tracking
- Authors: Heng Fan, Haibin Ling
- Abstract summary: We introduce an improved proposal refinement module, Cascaded Regression-Align- Classification (CRAC)
CRAC yields new state-of-the-art performances on many benchmarks.
In experiments on seven benchmarks including OTB-2015, UAV123, NfS, VOT-2018, TrackingNet, GOT-10k and LaSOT, our CRACT exhibits very promising results in comparison with state-of-the-art competitors.
- Score: 97.84109669027225
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: High quality object proposals are crucial in visual tracking algorithms that
utilize region proposal network (RPN). Refinement of these proposals, typically
by box regression and classification in parallel, has been popularly adopted to
boost tracking performance. However, it still meets problems when dealing with
complex and dynamic background. Thus motivated, in this paper we introduce an
improved proposal refinement module, Cascaded Regression-Align-Classification
(CRAC), which yields new state-of-the-art performances on many benchmarks.
First, having observed that the offsets from box regression can serve as
guidance for proposal feature refinement, we design CRAC as a cascade of box
regression, feature alignment and box classification. The key is to bridge box
regression and classification via an alignment step, which leads to more
accurate features for proposal classification with improved robustness. To
address the variation in object appearance, we introduce an
identification-discrimination component for box classification, which leverages
offline reliable fine-grained template and online rich background information
to distinguish the target from background. Moreover, we present pyramid
RoIAlign that benefits CRAC by exploiting both the local and global cues of
proposals. During inference, tracking proceeds by ranking all refined proposals
and selecting the best one. In experiments on seven benchmarks including
OTB-2015, UAV123, NfS, VOT-2018, TrackingNet, GOT-10k and LaSOT, our CRACT
exhibits very promising results in comparison with state-of-the-art competitors
and runs in real-time.
Related papers
- DPAN: Dynamic Preference-based and Attribute-aware Network for Relevant
Recommendations [3.4947076558586967]
We propose a novel method called the Dynamic Preference-based and Attribute-aware Network (DPAN) for predicting Click-Through Rate (CTR) in relevant recommendations.
DPAN has been successfully deployed on our e-commerce platform serving the primary traffic for relevant recommendations.
arXiv Detail & Related papers (2023-08-21T07:26:09Z) - Learning Dynamic Compact Memory Embedding for Deformable Visual Object
Tracking [82.34356879078955]
We propose a compact memory embedding to enhance the discrimination of the segmentation-based deformable visual tracking method.
Our method outperforms the excellent segmentation-based trackers, i.e., D3S and SiamMask on DAVIS 2017 benchmark.
arXiv Detail & Related papers (2021-11-23T03:07:12Z) - Open-Set Recognition: A Good Closed-Set Classifier is All You Need [146.6814176602689]
We show that the ability of a classifier to make the 'none-of-above' decision is highly correlated with its accuracy on the closed-set classes.
We use this correlation to boost the performance of the cross-entropy OSR 'baseline' by improving its closed-set accuracy.
We also construct new benchmarks which better respect the task of detecting semantic novelty.
arXiv Detail & Related papers (2021-10-12T17:58:59Z) - SiamRCR: Reciprocal Classification and Regression for Visual Object
Tracking [47.647615772027606]
We propose a novel siamese tracking algorithm called SiamRCR, addressing this problem with a simple, light and effective solution.
It builds reciprocal links between classification and regression branches, which can dynamically re-weight their losses for each positive sample.
In addition, we add a localization branch to predict the localization accuracy, so that it can work as the replacement of the regression assistance link during inference.
arXiv Detail & Related papers (2021-05-24T12:21:25Z) - Adversarial Feature Augmentation and Normalization for Visual
Recognition [109.6834687220478]
Recent advances in computer vision take advantage of adversarial data augmentation to ameliorate the generalization ability of classification models.
Here, we present an effective and efficient alternative that advocates adversarial augmentation on intermediate feature embeddings.
We validate the proposed approach across diverse visual recognition tasks with representative backbone networks.
arXiv Detail & Related papers (2021-03-22T20:36:34Z) - Higher Performance Visual Tracking with Dual-Modal Localization [106.91097443275035]
Visual Object Tracking (VOT) has synchronous needs for both robustness and accuracy.
We propose a dual-modal framework for target localization, consisting of robust localization suppressingors via ONR and the accurate localization attending to the target center precisely via OFC.
arXiv Detail & Related papers (2021-03-18T08:47:56Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.