Alpha-Refine: Boosting Tracking Performance by Precise Bounding Box
Estimation
- URL: http://arxiv.org/abs/2007.02024v2
- Date: Mon, 5 Apr 2021 10:50:13 GMT
- Title: Alpha-Refine: Boosting Tracking Performance by Precise Bounding Box
Estimation
- Authors: Bin Yan, Dong Wang, Huchuan Lu, Xiaoyun Yang
- Abstract summary: We propose a novel, flexible and accurate refinement module called Alpha-Refine.
It exploits a precise pixel-wise correlation layer together with a spatial-aware non-local layer to fuse features and can predict three complementary outputs: bounding box, corners and mask.
We apply the proposed Alpha-Refine module to five famous and state-of-the-art base trackers: DiMP, ATOM, SiamRPN++, RTMDNet and ECO.
- Score: 87.53808756910452
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In recent years, the multiple-stage strategy has become a popular trend for
visual tracking. This strategy first utilizes a base tracker to coarsely locate
the target and then exploits a refinement module to obtain more accurate
results. However, existing refinement modules suffer from the limited
transferability and precision. In this work, we propose a novel, flexible and
accurate refinement module called Alpha-Refine, which exploits a precise
pixel-wise correlation layer together with a spatial-aware non-local layer to
fuse features and can predict three complementary outputs: bounding box,
corners and mask. To wisely choose the most adequate output, we also design a
light-weight branch selector module. We apply the proposed Alpha-Refine module
to five famous and state-of-the-art base trackers: DiMP, ATOM, SiamRPN++,
RTMDNet and ECO. The comprehensive experiments on TrackingNet, LaSOT and
VOT2018 benchmarks demonstrate that our approach significantly improves the
tracking performance in comparison with other existing refinement methods. The
source codes will be available at
https://github.com/MasterBin-IIAU/AlphaRefine.
Related papers
- Hierarchical IoU Tracking based on Interval [21.555469501789577]
Multi-Object Tracking (MOT) aims to detect and associate all targets of given classes across frames.
We propose the Hierarchical IoU Tracking framework, dubbed HIT, which achieves unified hierarchical tracking by utilizing tracklet intervals as priors.
Our method achieves promising performance on four datasets, i.e., MOT17, KITTI, DanceTrack and VisDrone.
arXiv Detail & Related papers (2024-06-19T07:03:18Z) - Staged Depthwise Correlation and Feature Fusion for Siamese Object
Tracking [0.6827423171182154]
We propose a novel staged depthwise correlation and feature fusion network, named DCFFNet, to further optimize the feature extraction for visual tracking.
We build our deep tracker upon a siamese network architecture, which is offline trained from scratch on multiple large-scale datasets.
For comprehensive evaluations of performance, we implement our tracker on the popular benchmarks, including OTB100, VOT2018 and LaSOT.
arXiv Detail & Related papers (2023-10-15T06:04:42Z) - Can SAM Boost Video Super-Resolution? [78.29033914169025]
We propose a simple yet effective module -- SAM-guidEd refinEment Module (SEEM)
This light-weight plug-in module is specifically designed to leverage the attention mechanism for the generation of semantic-aware feature.
We apply our SEEM to two representative methods, EDVR and BasicVSR, resulting in consistently improved performance with minimal implementation effort.
arXiv Detail & Related papers (2023-05-11T02:02:53Z) - Joint Feature Learning and Relation Modeling for Tracking: A One-Stream
Framework [76.70603443624012]
We propose a novel one-stream tracking (OSTrack) framework that unifies feature learning and relation modeling.
In this way, discriminative target-oriented features can be dynamically extracted by mutual guidance.
OSTrack achieves state-of-the-art performance on multiple benchmarks, in particular, it shows impressive results on the one-shot tracking benchmark GOT-10k.
arXiv Detail & Related papers (2022-03-22T18:37:11Z) - Learning Dynamic Compact Memory Embedding for Deformable Visual Object
Tracking [82.34356879078955]
We propose a compact memory embedding to enhance the discrimination of the segmentation-based deformable visual tracking method.
Our method outperforms the excellent segmentation-based trackers, i.e., D3S and SiamMask on DAVIS 2017 benchmark.
arXiv Detail & Related papers (2021-11-23T03:07:12Z) - Pyramid Correlation based Deep Hough Voting for Visual Object Tracking [16.080776515556686]
We introduce a voting-based classification-only tracking algorithm named Pyramid Correlation based Deep Hough Voting (short for PCDHV)
Specifically we innovatively construct a Pyramid Correlation module to equip the embedded feature with fine-grained local structures and global spatial contexts.
The elaborately designed Deep Hough Voting module further take over, integrating long-range dependencies of pixels to perceive corners.
arXiv Detail & Related papers (2021-10-15T10:37:00Z) - Alpha-Refine: Boosting Tracking Performance by Precise Bounding Box
Estimation [85.22775182688798]
This work proposes a novel, flexible, and accurate refinement module called Alpha-Refine.
It can significantly improve the base trackers' box estimation quality.
Experiments on TrackingNet, LaSOT, GOT-10K, and VOT 2020 benchmarks show that our approach significantly improves the base trackers' performance with little extra latency.
arXiv Detail & Related papers (2020-12-12T13:33:25Z) - MetaDistiller: Network Self-Boosting via Meta-Learned Top-Down
Distillation [153.56211546576978]
In this work, we propose that better soft targets with higher compatibil-ity can be generated by using a label generator.
We can employ the meta-learning technique to optimize this label generator.
The experiments are conducted on two standard classificationbenchmarks, namely CIFAR-100 and ILSVRC2012.
arXiv Detail & Related papers (2020-08-27T13:04:27Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.