Dynamic Attention guided Multi-Trajectory Analysis for Single Object
Tracking
- URL: http://arxiv.org/abs/2103.16086v1
- Date: Tue, 30 Mar 2021 05:36:31 GMT
- Title: Dynamic Attention guided Multi-Trajectory Analysis for Single Object
Tracking
- Authors: Xiao Wang, Zhe Chen, Jin Tang, Bin Luo, Yaowei Wang, Yonghong Tian,
Feng Wu
- Abstract summary: We propose to introduce more dynamics by devising a dynamic attention-guided multi-trajectory tracking strategy.
In particular, we construct dynamic appearance model that contains multiple target templates, each of which provides its own attention for locating the target in the new frame.
After spanning the whole sequence, we introduce a multi-trajectory selection network to find the best trajectory that delivers improved tracking performance.
- Score: 62.13213518417047
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Most of the existing single object trackers track the target in a unitary
local search window, making them particularly vulnerable to challenging factors
such as heavy occlusions and out-of-view movements. Despite the attempts to
further incorporate global search, prevailing mechanisms that cooperate local
and global search are relatively static, thus are still sub-optimal for
improving tracking performance. By further studying the local and global search
results, we raise a question: can we allow more dynamics for cooperating both
results? In this paper, we propose to introduce more dynamics by devising a
dynamic attention-guided multi-trajectory tracking strategy. In particular, we
construct dynamic appearance model that contains multiple target templates,
each of which provides its own attention for locating the target in the new
frame. Guided by different attention, we maintain diversified tracking results
for the target to build multi-trajectory tracking history, allowing more
candidates to represent the true target trajectory. After spanning the whole
sequence, we introduce a multi-trajectory selection network to find the best
trajectory that delivers improved tracking performance. Extensive experimental
results show that our proposed tracking strategy achieves compelling
performance on various large-scale tracking benchmarks. The project page of
this paper can be found at https://sites.google.com/view/mt-track/.
Related papers
- Multi-object Tracking by Detection and Query: an efficient end-to-end manner [23.926668750263488]
Multi-object tracking is advancing through two dominant paradigms: traditional tracking by detection and newly emerging tracking by query.
We propose the tracking-by-detection-and-query paradigm, which is achieved by a Learnable Associator.
Compared to tracking-by-query models, LAID achieves competitive tracking accuracy with notably higher training efficiency.
arXiv Detail & Related papers (2024-11-09T14:38:08Z) - Exploring Dynamic Transformer for Efficient Object Tracking [58.120191254379854]
We propose DyTrack, a dynamic transformer framework for efficient tracking.
DyTrack automatically learns to configure proper reasoning routes for various inputs, gaining better utilization of the available computational budget.
Experiments on multiple benchmarks demonstrate that DyTrack achieves promising speed-precision trade-offs with only a single model.
arXiv Detail & Related papers (2024-03-26T12:31:58Z) - Single-Shot and Multi-Shot Feature Learning for Multi-Object Tracking [55.13878429987136]
We propose a simple yet effective two-stage feature learning paradigm to jointly learn single-shot and multi-shot features for different targets.
Our method has achieved significant improvements on MOT17 and MOT20 datasets while reaching state-of-the-art performance on DanceTrack dataset.
arXiv Detail & Related papers (2023-11-17T08:17:49Z) - MotionTrack: Learning Robust Short-term and Long-term Motions for
Multi-Object Tracking [56.92165669843006]
We propose MotionTrack, which learns robust short-term and long-term motions in a unified framework to associate trajectories from a short to long range.
For dense crowds, we design a novel Interaction Module to learn interaction-aware motions from short-term trajectories, which can estimate the complex movement of each target.
For extreme occlusions, we build a novel Refind Module to learn reliable long-term motions from the target's history trajectory, which can link the interrupted trajectory with its corresponding detection.
arXiv Detail & Related papers (2023-03-18T12:38:33Z) - Towards Discriminative Representation: Multi-view Trajectory Contrastive
Learning for Online Multi-object Tracking [1.0474108328884806]
We propose a strategy, namely multi-view trajectory contrastive learning, in which each trajectory is represented as a center vector.
In the inference stage, a similarity-guided feature fusion strategy is developed for further boosting the quality of the trajectory representation.
Our method has surpassed preceding trackers and established new state-of-the-art performance.
arXiv Detail & Related papers (2022-03-27T04:53:31Z) - DSRRTracker: Dynamic Search Region Refinement for Attention-based
Siamese Multi-Object Tracking [13.104037155691644]
We propose an end-to-end MOT method, with a Gaussian filter-inspired dynamic search region refinement module.
Our method can achieve the state-of-the-art performance with reasonable speed.
arXiv Detail & Related papers (2022-03-21T04:14:06Z) - Tracking by Joint Local and Global Search: A Target-aware Attention
based Approach [63.50045332644818]
We propose a novel target-aware attention mechanism (termed TANet) to conduct joint local and global search for robust tracking.
Specifically, we extract the features of target object patch and continuous video frames, then we track and feed them into a decoder network to generate target-aware global attention maps.
In the tracking procedure, we integrate the target-aware attention with multiple trackers by exploring candidate search regions for robust tracking.
arXiv Detail & Related papers (2021-06-09T06:54:15Z) - Tracking-by-Counting: Using Network Flows on Crowd Density Maps for
Tracking Multiple Targets [96.98888948518815]
State-of-the-art multi-object tracking(MOT) methods follow the tracking-by-detection paradigm.
We propose a new MOT paradigm, tracking-by-counting, tailored for crowded scenes.
arXiv Detail & Related papers (2020-07-18T19:51:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.