Saliency-Associated Object Tracking
- URL: http://arxiv.org/abs/2108.03637v1
- Date: Sun, 8 Aug 2021 13:54:09 GMT
- Title: Saliency-Associated Object Tracking
- Authors: Zikun Zhou, Wenjie Pei, Xin Li, Hongpeng Wang, Feng Zheng, Zhenyu He
- Abstract summary: We propose to track the salient local parts of the target that are discriminative for tracking.
In particular, we propose a fine-grained saliency mining module to capture the local saliencies.
Experiments on five diverse datasets demonstrate that the proposed method performs favorably against state-of-the-art trackers.
- Score: 42.79662997199292
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Most existing trackers based on deep learning perform tracking in a holistic
strategy, which aims to learn deep representations of the whole target for
localizing the target. It is arduous for such methods to track targets with
various appearance variations. To address this limitation, another type of
methods adopts a part-based tracking strategy which divides the target into
equal patches and tracks all these patches in parallel. The target state is
inferred by summarizing the tracking results of these patches. A potential
limitation of such trackers is that not all patches are equally informative for
tracking. Some patches that are not discriminative may have adverse effects. In
this paper, we propose to track the salient local parts of the target that are
discriminative for tracking. In particular, we propose a fine-grained saliency
mining module to capture the local saliencies. Further, we design a
saliency-association modeling module to associate the captured saliencies
together to learn effective correlation representations between the exemplar
and the search image for state estimation. Extensive experiments on five
diverse datasets demonstrate that the proposed method performs favorably
against state-of-the-art trackers.
Related papers
- RTracker: Recoverable Tracking via PN Tree Structured Memory [71.05904715104411]
We propose a recoverable tracking framework, RTracker, that uses a tree-structured memory to dynamically associate a tracker and a detector to enable self-recovery.
Specifically, we propose a Positive-Negative Tree-structured memory to chronologically store and maintain positive and negative target samples.
Our core idea is to use the support samples of positive and negative target categories to establish a relative distance-based criterion for a reliable assessment of target loss.
arXiv Detail & Related papers (2024-03-28T08:54:40Z) - CiteTracker: Correlating Image and Text for Visual Tracking [114.48653709286629]
We propose the CiteTracker to enhance target modeling and inference in visual tracking by connecting images and text.
Specifically, we develop a text generation module to convert the target image patch into a descriptive text.
We then associate the target description and the search image using an attention-based correlation module to generate the correlated features for target state reference.
arXiv Detail & Related papers (2023-08-22T09:53:12Z) - Mesh-SORT: Simple and effective of location-wise tracker [0.0]
In most tracking scenarios, objects tend to move and be lost within specific locations.
We propose different strategies for tracking and association that can identify and target these regions.
We present a robust strategy for dealing with lost objects, as well as a location-wise method for tracking by detection.
arXiv Detail & Related papers (2023-02-28T08:47:53Z) - Context-aware Visual Tracking with Joint Meta-updating [11.226947525556813]
We propose a context-aware tracking model to optimize the tracker over the representation space, which jointly meta-update both branches by exploiting information along the whole sequence.
The proposed tracking method achieves an EAO score of 0.514 on VOT2018 with the speed of 40FPS, demonstrating its capability of improving the accuracy and robustness of the underlying tracker with little speed drop.
arXiv Detail & Related papers (2022-04-04T14:16:00Z) - Tracking by Joint Local and Global Search: A Target-aware Attention
based Approach [63.50045332644818]
We propose a novel target-aware attention mechanism (termed TANet) to conduct joint local and global search for robust tracking.
Specifically, we extract the features of target object patch and continuous video frames, then we track and feed them into a decoder network to generate target-aware global attention maps.
In the tracking procedure, we integrate the target-aware attention with multiple trackers by exploring candidate search regions for robust tracking.
arXiv Detail & Related papers (2021-06-09T06:54:15Z) - Discriminative Appearance Modeling with Multi-track Pooling for
Real-time Multi-object Tracking [20.66906781151]
In multi-object tracking, the tracker maintains in its memory the appearance and motion information for each object in the scene.
Many approaches model each target in isolation and lack the ability to use all the targets in the scene to jointly update the memory.
We propose a training strategy adapted to multi-track pooling which generates hard tracking episodes online.
arXiv Detail & Related papers (2021-01-28T18:12:39Z) - Tracking-by-Counting: Using Network Flows on Crowd Density Maps for
Tracking Multiple Targets [96.98888948518815]
State-of-the-art multi-object tracking(MOT) methods follow the tracking-by-detection paradigm.
We propose a new MOT paradigm, tracking-by-counting, tailored for crowded scenes.
arXiv Detail & Related papers (2020-07-18T19:51:53Z) - Know Your Surroundings: Exploiting Scene Information for Object Tracking [181.1750279330811]
Current state-of-the-art trackers only rely on a target appearance model in order to localize the object in each frame.
We propose a novel tracking architecture which can utilize scene information for tracking.
arXiv Detail & Related papers (2020-03-24T17:59:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.