Graph Attention Tracking
- URL: http://arxiv.org/abs/2011.11204v1
- Date: Mon, 23 Nov 2020 04:26:45 GMT
- Title: Graph Attention Tracking
- Authors: Dongyan Guo, Yanyan Shao, Ying Cui, Zhenhua Wang, Liyan Zhang, Chunhua
Shen
- Abstract summary: We propose a simple target-aware Siamese graph attention network for general object tracking.
Experiments on challenging benchmarks including GOT-10k, UAV123, OTB-100 and LaSOT demonstrate that the proposed SiamGAT outperforms many state-of-the-art trackers.
- Score: 76.19829750144564
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Siamese network based trackers formulate the visual tracking task as a
similarity matching problem. Almost all popular Siamese trackers realize the
similarity learning via convolutional feature cross-correlation between a
target branch and a search branch. However, since the size of target feature
region needs to be pre-fixed, these cross-correlation base methods suffer from
either reserving much adverse background information or missing a great deal of
foreground information. Moreover, the global matching between the target and
search region also largely neglects the target structure and part-level
information.
In this paper, to solve the above issues, we propose a simple target-aware
Siamese graph attention network for general object tracking. We propose to
establish part-to-part correspondence between the target and the search region
with a complete bipartite graph, and apply the graph attention mechanism to
propagate target information from the template feature to the search feature.
Further, instead of using the pre-fixed region cropping for
template-feature-area selection, we investigate a target-aware area selection
mechanism to fit the size and aspect ratio variations of different objects.
Experiments on challenging benchmarks including GOT-10k, UAV123, OTB-100 and
LaSOT demonstrate that the proposed SiamGAT outperforms many state-of-the-art
trackers and achieves leading performance. Code is available at:
https://git.io/SiamGAT
Related papers
- Optimized Information Flow for Transformer Tracking [0.7199733380797579]
One-stream Transformer trackers have shown outstanding performance in challenging benchmark datasets.
We propose a novel OIFTrack framework to enhance the discriminative capability of the tracker.
arXiv Detail & Related papers (2024-02-13T03:39:15Z) - Multi-Correlation Siamese Transformer Network with Dense Connection for
3D Single Object Tracking [14.47355191520578]
Point cloud-based 3D object tracking is an important task in autonomous driving.
It remains challenging to learn the correlation between the template and search branches effectively with the sparse LIDAR point cloud data.
We present a multi-correlation Siamese Transformer network that has multiple stages and carries out feature correlation at the end of each stage.
arXiv Detail & Related papers (2023-12-18T09:33:49Z) - Target-Aware Tracking with Long-term Context Attention [8.20858704675519]
Long-term context attention (LCA) module can perform extensive information fusion on the target and its context from long-term frames.
LCA uses the target state from the previous frame to exclude the interference of similar objects and complex backgrounds.
Our tracker achieves state-of-the-art performance on multiple benchmarks, with 71.1% AUC, 89.3% NP, and 73.0% AO on LaSOT, TrackingNet, and GOT-10k.
arXiv Detail & Related papers (2023-02-27T14:40:58Z) - Revisiting Color-Event based Tracking: A Unified Network, Dataset, and
Metric [53.88188265943762]
We propose a single-stage backbone network for Color-Event Unified Tracking (CEUTrack), which achieves the above functions simultaneously.
Our proposed CEUTrack is simple, effective, and efficient, which achieves over 75 FPS and new SOTA performance.
arXiv Detail & Related papers (2022-11-20T16:01:31Z) - 3D Siamese Transformer Network for Single Object Tracking on Point
Clouds [22.48888264770609]
Siamese network based trackers formulate 3D single object tracking as cross-correlation learning between point features of a template and a search area.
We explicitly use Transformer to form a 3D Siamese Transformer network for learning robust cross correlation between the template and the search area.
Our method achieves state-of-the-art performance on the 3D single object tracking task.
arXiv Detail & Related papers (2022-07-25T09:08:30Z) - Learning Dynamic Compact Memory Embedding for Deformable Visual Object
Tracking [82.34356879078955]
We propose a compact memory embedding to enhance the discrimination of the segmentation-based deformable visual tracking method.
Our method outperforms the excellent segmentation-based trackers, i.e., D3S and SiamMask on DAVIS 2017 benchmark.
arXiv Detail & Related papers (2021-11-23T03:07:12Z) - Tracking by Joint Local and Global Search: A Target-aware Attention
based Approach [63.50045332644818]
We propose a novel target-aware attention mechanism (termed TANet) to conduct joint local and global search for robust tracking.
Specifically, we extract the features of target object patch and continuous video frames, then we track and feed them into a decoder network to generate target-aware global attention maps.
In the tracking procedure, we integrate the target-aware attention with multiple trackers by exploring candidate search regions for robust tracking.
arXiv Detail & Related papers (2021-06-09T06:54:15Z) - Tracklets Predicting Based Adaptive Graph Tracking [51.352829280902114]
We present an accurate and end-to-end learning framework for multi-object tracking, namely textbfTPAGT.
It re-extracts the features of the tracklets in the current frame based on motion predicting, which is the key to solve the problem of features inconsistent.
arXiv Detail & Related papers (2020-10-18T16:16:49Z) - Learning Spatio-Appearance Memory Network for High-Performance Visual
Tracking [79.80401607146987]
Existing object tracking usually learns a bounding-box based template to match visual targets across frames, which cannot accurately learn a pixel-wise representation.
This paper presents a novel segmentation-based tracking architecture, which is equipped with a local-temporal memory network to learn accurate-temporal correspondence.
arXiv Detail & Related papers (2020-09-21T08:12:02Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.