High-Performance Long-Term Tracking with Meta-Updater
- URL: http://arxiv.org/abs/2004.00305v1
- Date: Wed, 1 Apr 2020 09:29:23 GMT
- Title: High-Performance Long-Term Tracking with Meta-Updater
- Authors: Kenan Dai, Yunhua Zhang, Dong Wang, Jianhua Li, Huchuan Lu, and
Xiaoyun Yang
- Abstract summary: Long-term visual tracking has drawn increasing attention because it is much closer to practical applications than short-term tracking.
Most top-ranked long-term trackers adopt the offline-trained Siamese architectures, thus, they cannot benefit from great progress of short-term trackers with online update.
We propose a novel offline-trained Meta-Updater to address an important but unsolved problem: Is the tracker ready for updating in the current frame?
- Score: 75.80564183653274
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Long-term visual tracking has drawn increasing attention because it is much
closer to practical applications than short-term tracking. Most top-ranked
long-term trackers adopt the offline-trained Siamese architectures, thus, they
cannot benefit from great progress of short-term trackers with online update.
However, it is quite risky to straightforwardly introduce online-update-based
trackers to solve the long-term problem, due to long-term uncertain and noisy
observations. In this work, we propose a novel offline-trained Meta-Updater to
address an important but unsolved problem: Is the tracker ready for updating in
the current frame? The proposed meta-updater can effectively integrate
geometric, discriminative, and appearance cues in a sequential manner, and then
mine the sequential information with a designed cascaded LSTM module. Our
meta-updater learns a binary output to guide the tracker's update and can be
easily embedded into different trackers. This work also introduces a long-term
tracking framework consisting of an online local tracker, an online verifier, a
SiamRPN-based re-detector, and our meta-updater. Numerous experimental results
on the VOT2018LT, VOT2019LT, OxUvALT, TLP, and LaSOT benchmarks show that our
tracker performs remarkably better than other competing algorithms. Our project
is available on the website: https://github.com/Daikenan/LTMU.
Related papers
- LITE: A Paradigm Shift in Multi-Object Tracking with Efficient ReID Feature Integration [0.3277163122167433]
Lightweight Integrated Tracking-Feature Extraction paradigm is introduced as a novel multi-object tracking (MOT) approach.
It enhances ReID-based trackers by eliminating inference, pre-processing, post-processing, and ReID model training costs.
arXiv Detail & Related papers (2024-09-06T11:05:12Z) - Temporal Correlation Meets Embedding: Towards a 2nd Generation of JDE-based Real-Time Multi-Object Tracking [52.04679257903805]
Joint Detection and Embedding (JDE) trackers have demonstrated excellent performance in Multi-Object Tracking (MOT) tasks.
Our tracker, named TCBTrack, achieves state-of-the-art performance on multiple public benchmarks.
arXiv Detail & Related papers (2024-07-19T07:48:45Z) - Tracking with Human-Intent Reasoning [64.69229729784008]
This work proposes a new tracking task -- Instruction Tracking.
It involves providing implicit tracking instructions that require the trackers to perform tracking automatically in video frames.
TrackGPT is capable of performing complex reasoning-based tracking.
arXiv Detail & Related papers (2023-12-29T03:22:18Z) - CoTracker: It is Better to Track Together [70.63040730154984]
CoTracker is a transformer-based model that tracks a large number of 2D points in long video sequences.
We show that joint tracking significantly improves tracking accuracy and robustness, and allows CoTracker to track occluded points and points outside of the camera view.
arXiv Detail & Related papers (2023-07-14T21:13:04Z) - Improving Siamese Based Trackers with Light or No Training through Multiple Templates and Temporal Network [0.0]
We propose a framework with two ideas on Siamese-based trackers.
(i) Extending number of templates in a way that removes the need to retrain the network.
(ii) a lightweight temporal network with a novel architecture focusing on both local and global information.
arXiv Detail & Related papers (2022-11-24T22:07:33Z) - Context-aware Visual Tracking with Joint Meta-updating [11.226947525556813]
We propose a context-aware tracking model to optimize the tracker over the representation space, which jointly meta-update both branches by exploiting information along the whole sequence.
The proposed tracking method achieves an EAO score of 0.514 on VOT2018 with the speed of 40FPS, demonstrating its capability of improving the accuracy and robustness of the underlying tracker with little speed drop.
arXiv Detail & Related papers (2022-04-04T14:16:00Z) - Learning to Track Objects from Unlabeled Videos [63.149201681380305]
In this paper, we propose to learn an Unsupervised Single Object Tracker (USOT) from scratch.
To narrow the gap between unsupervised trackers and supervised counterparts, we propose an effective unsupervised learning approach composed of three stages.
Experiments show that the proposed USOT learned from unlabeled videos performs well over the state-of-the-art unsupervised trackers by large margins.
arXiv Detail & Related papers (2021-08-28T22:10:06Z) - LightTrack: Finding Lightweight Neural Networks for Object Tracking via
One-Shot Architecture Search [104.84999119090887]
We present LightTrack, which uses neural architecture search (NAS) to design more lightweight and efficient object trackers.
Comprehensive experiments show that our LightTrack is effective.
It can find trackers that achieve superior performance compared to handcrafted SOTA trackers, such as SiamRPN++ and Ocean.
arXiv Detail & Related papers (2021-04-29T17:55:24Z) - STMTrack: Template-free Visual Tracking with Space-time Memory Networks [42.06375415765325]
Existing trackers with template updating mechanisms rely on time-consuming numerical optimization and complex hand-designed strategies to achieve competitive performance.
We propose a novel tracking framework built on top of a space-time memory network that is competent to make full use of historical information related to the target.
Specifically, a novel memory mechanism is introduced, which stores the historical information of the target to guide the tracker to focus on the most informative regions in the current frame.
arXiv Detail & Related papers (2021-04-01T08:10:56Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.