DCF-ASN: Coarse-to-fine Real-time Visual Tracking via Discriminative
Correlation Filter and Attentional Siamese Network
- URL: http://arxiv.org/abs/2103.10607v1
- Date: Fri, 19 Mar 2021 03:01:21 GMT
- Title: DCF-ASN: Coarse-to-fine Real-time Visual Tracking via Discriminative
Correlation Filter and Attentional Siamese Network
- Authors: Xizhe Xue, Ying Li, Xiaoyue Yin, Qiang Shen
- Abstract summary: Discriminative correlation filters (DCF) and siamese networks have achieved promising performance on visual tracking tasks.
We propose a coarse-to-fine tracking framework, which roughly infers the target state via an online-updating DCF module.
The proposed DCF-ASN achieves the state-of-the-art performance while exhibiting good tracking efficiency.
- Score: 9.01402976480327
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Discriminative correlation filters (DCF) and siamese networks have achieved
promising performance on visual tracking tasks thanks to their superior
computational efficiency and reliable similarity metric learning, respectively.
However, how to effectively take advantages of powerful deep networks, while
maintaining the real-time response of DCF, remains a challenging problem.
Embedding the cross-correlation operator as a separate layer into siamese
networks is a popular choice to enhance the tracking accuracy. Being a key
component of such a network, the correlation layer is updated online together
with other parts of the network. Yet, when facing serious disturbance, fused
trackers may still drift away from the target completely due to accumulated
errors. To address these issues, we propose a coarse-to-fine tracking
framework, which roughly infers the target state via an online-updating DCF
module first and subsequently, finely locates the target through an
offline-training asymmetric siamese network (ASN). Benefitting from the
guidance of DCF and the learned channel weights obtained through exploiting the
given ground-truth template, ASN refines feature representation and implements
precise target localization. Systematic experiments on five popular tracking
datasets demonstrate that the proposed DCF-ASN achieves the state-of-the-art
performance while exhibiting good tracking efficiency.
Related papers
- AttenScribble: Attentive Similarity Learning for Scribble-Supervised
Medical Image Segmentation [5.8447004333496855]
In this paper, we present a straightforward yet effective scribble supervised learning framework.
We create a pluggable spatial self-attention module which could be attached on top of any internal feature layers of arbitrary fully convolutional network (FCN) backbone.
This attentive similarity leads to a novel regularization loss that imposes consistency between segmentation prediction and visual affinity.
arXiv Detail & Related papers (2023-12-11T18:42:18Z) - Optimization Guarantees of Unfolded ISTA and ADMM Networks With Smooth
Soft-Thresholding [57.71603937699949]
We study optimization guarantees, i.e., achieving near-zero training loss with the increase in the number of learning epochs.
We show that the threshold on the number of training samples increases with the increase in the network width.
arXiv Detail & Related papers (2023-09-12T13:03:47Z) - Deep Multi-Threshold Spiking-UNet for Image Processing [51.88730892920031]
This paper introduces the novel concept of Spiking-UNet for image processing, which combines the power of Spiking Neural Networks (SNNs) with the U-Net architecture.
To achieve an efficient Spiking-UNet, we face two primary challenges: ensuring high-fidelity information propagation through the network via spikes and formulating an effective training strategy.
Experimental results show that, on image segmentation and denoising, our Spiking-UNet achieves comparable performance to its non-spiking counterpart.
arXiv Detail & Related papers (2023-07-20T16:00:19Z) - SiamTHN: Siamese Target Highlight Network for Visual Tracking [11.111738354621595]
Siamese network based trackers treat each channel in the feature maps generated by the backbone network equally.
No structural links between the classification and regression branches in these trackers, and the two branches are optimized separately during training.
A Target Highlight Module is proposed to help the generated similarity response maps to be more focused on the target region.
arXiv Detail & Related papers (2023-03-22T04:33:02Z) - OFedQIT: Communication-Efficient Online Federated Learning via
Quantization and Intermittent Transmission [7.6058140480517356]
Online federated learning (OFL) is a promising framework to collaboratively learn a sequence of non-linear functions (or models) from distributed streaming data.
We propose a communication-efficient OFL algorithm (named OFedQIT) by means of a quantization and an intermittent transmission.
Our analysis reveals that OFedQIT successfully addresses the drawbacks of OFedAvg while maintaining superior learning accuracy.
arXiv Detail & Related papers (2022-05-13T07:46:43Z) - A Deep Value-network Based Approach for Multi-Driver Order Dispatching [55.36656442934531]
We propose a deep reinforcement learning based solution for order dispatching.
We conduct large scale online A/B tests on DiDi's ride-dispatching platform.
Results show that CVNet consistently outperforms other recently proposed dispatching methods.
arXiv Detail & Related papers (2021-06-08T16:27:04Z) - Higher Performance Visual Tracking with Dual-Modal Localization [106.91097443275035]
Visual Object Tracking (VOT) has synchronous needs for both robustness and accuracy.
We propose a dual-modal framework for target localization, consisting of robust localization suppressingors via ONR and the accurate localization attending to the target center precisely via OFC.
arXiv Detail & Related papers (2021-03-18T08:47:56Z) - Object Tracking through Residual and Dense LSTMs [67.98948222599849]
Deep learning-based trackers based on LSTMs (Long Short-Term Memory) recurrent neural networks have emerged as a powerful alternative.
DenseLSTMs outperform Residual and regular LSTM, and offer a higher resilience to nuisances.
Our case study supports the adoption of residual-based RNNs for enhancing the robustness of other trackers.
arXiv Detail & Related papers (2020-06-22T08:20:17Z) - Network Adjustment: Channel Search Guided by FLOPs Utilization Ratio [101.84651388520584]
This paper presents a new framework named network adjustment, which considers network accuracy as a function of FLOPs.
Experiments on standard image classification datasets and a wide range of base networks demonstrate the effectiveness of our approach.
arXiv Detail & Related papers (2020-04-06T15:51:00Z) - Effective Fusion of Deep Multitasking Representations for Robust Visual
Tracking [34.09763324745818]
This paper aims to evaluate the performance of twelve state-of-the-art ResNet-based FENs in a DCF-based framework.
It ranks their best feature maps and explores the generalized adoption of the best ResNet-based FEN into another DCF-based method.
The proposed method extracts deep semantic information from a fully convolutional FEN and fuses it with the best ResNet-based feature maps to strengthen the target representation.
arXiv Detail & Related papers (2020-04-03T05:33:59Z) - Object-Adaptive LSTM Network for Real-time Visual Tracking with
Adversarial Data Augmentation [31.842910084312265]
We propose a novel real-time visual tracking method, which adopts an object-adaptive LSTM network to effectively capture the video sequential dependencies and adaptively learn the object appearance variations.
Experiments on four visual tracking benchmarks demonstrate the state-of-the-art performance of our method in terms of both tracking accuracy and speed.
arXiv Detail & Related papers (2020-02-07T03:06:07Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.