Higher Performance Visual Tracking with Dual-Modal Localization
- URL: http://arxiv.org/abs/2103.10089v1
- Date: Thu, 18 Mar 2021 08:47:56 GMT
- Title: Higher Performance Visual Tracking with Dual-Modal Localization
- Authors: Jinghao Zhou, Bo Li, Lei Qiao, Peng Wang, Weihao Gan, Wei Wu, Junjie
Yan, Wanli Ouyang
- Abstract summary: Visual Object Tracking (VOT) has synchronous needs for both robustness and accuracy.
We propose a dual-modal framework for target localization, consisting of robust localization suppressingors via ONR and the accurate localization attending to the target center precisely via OFC.
- Score: 106.91097443275035
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Visual Object Tracking (VOT) has synchronous needs for both robustness and
accuracy. While most existing works fail to operate simultaneously on both, we
investigate in this work the problem of conflicting performance between
accuracy and robustness. We first conduct a systematic comparison among
existing methods and analyze their restrictions in terms of accuracy and
robustness. Specifically, 4 formulations-offline classification (OFC), offline
regression (OFR), online classification (ONC), and online regression (ONR)-are
considered, categorized by the existence of online update and the types of
supervision signal. To account for the problem, we resort to the idea of
ensemble and propose a dual-modal framework for target localization, consisting
of robust localization suppressing distractors via ONR and the accurate
localization attending to the target center precisely via OFC. To yield a final
representation (i.e, bounding box), we propose a simple but effective score
voting strategy to involve adjacent predictions such that the final
representation does not commit to a single location. Operating beyond the
real-time demand, our proposed method is further validated on 8
datasets-VOT2018, VOT2019, OTB2015, NFS, UAV123, LaSOT, TrackingNet, and
GOT-10k, achieving state-of-the-art performance.
Related papers
- RLOMM: An Efficient and Robust Online Map Matching Framework with Reinforcement Learning [15.616652008152098]
This paper introduces a novel framework that achieves high accuracy and efficient matching while ensuring robustness in handling diverse scenarios.
To improve efficiency, we begin by modeling the online map matching problem as an Online Markov Decision Process.
Next, to enhance the model's robustness, we design a reinforcement learning method, enabling robust handling of real-time data.
arXiv Detail & Related papers (2025-02-05T11:26:32Z) - Locally Adaptive One-Class Classifier Fusion with Dynamic $\ell$p-Norm Constraints for Robust Anomaly Detection [17.93058599783703]
We introduce a framework that dynamically adjusts fusion weights based on local data characteristics.
Our method incorporates an interior-point optimization technique that significantly improves computational efficiency.
The framework's ability to adapt to local data patterns while maintaining computational efficiency makes it particularly valuable for real-time applications.
arXiv Detail & Related papers (2024-11-10T09:57:13Z) - Logit Calibration and Feature Contrast for Robust Federated Learning on Non-IID Data [45.11652096723593]
Federated learning (FL) is a privacy-preserving distributed framework for collaborative model training on devices in edge networks.
This paper proposes FatCC, which incorporates local logit underlineCalibration and global feature underlineContrast into the vanilla federated adversarial training process from both logit and feature perspectives.
arXiv Detail & Related papers (2024-04-10T06:35:25Z) - Semi-DETR: Semi-Supervised Object Detection with Detection Transformers [105.45018934087076]
We analyze the DETR-based framework on semi-supervised object detection (SSOD)
We present Semi-DETR, the first transformer-based end-to-end semi-supervised object detector.
Our method outperforms all state-of-the-art methods by clear margins.
arXiv Detail & Related papers (2023-07-16T16:32:14Z) - Learning Localization-aware Target Confidence for Siamese Visual
Tracking [13.684278662495204]
We propose a novel tracking paradigm, called SiamLA.
Within this paradigm, a series of simple, yet effective localization-aware components are introduced.
Our SiamLA achieves state-of-the-art performance in terms of both accuracy and efficiency.
arXiv Detail & Related papers (2022-04-29T13:37:15Z) - Joint Feature Learning and Relation Modeling for Tracking: A One-Stream
Framework [76.70603443624012]
We propose a novel one-stream tracking (OSTrack) framework that unifies feature learning and relation modeling.
In this way, discriminative target-oriented features can be dynamically extracted by mutual guidance.
OSTrack achieves state-of-the-art performance on multiple benchmarks, in particular, it shows impressive results on the one-shot tracking benchmark GOT-10k.
arXiv Detail & Related papers (2022-03-22T18:37:11Z) - Open-Set Recognition: A Good Closed-Set Classifier is All You Need [146.6814176602689]
We show that the ability of a classifier to make the 'none-of-above' decision is highly correlated with its accuracy on the closed-set classes.
We use this correlation to boost the performance of the cross-entropy OSR 'baseline' by improving its closed-set accuracy.
We also construct new benchmarks which better respect the task of detecting semantic novelty.
arXiv Detail & Related papers (2021-10-12T17:58:59Z) - Inter-class Discrepancy Alignment for Face Recognition [55.578063356210144]
We propose a unified framework calledInter-class DiscrepancyAlignment(IDA)
IDA-DAO is used to align the similarity scores considering the discrepancy between the images and its neighbors.
IDA-SSE can provide convincing inter-class neighbors by introducing virtual candidate images generated with GAN.
arXiv Detail & Related papers (2021-03-02T08:20:08Z) - CRACT: Cascaded Regression-Align-Classification for Robust Visual
Tracking [97.84109669027225]
We introduce an improved proposal refinement module, Cascaded Regression-Align- Classification (CRAC)
CRAC yields new state-of-the-art performances on many benchmarks.
In experiments on seven benchmarks including OTB-2015, UAV123, NfS, VOT-2018, TrackingNet, GOT-10k and LaSOT, our CRACT exhibits very promising results in comparison with state-of-the-art competitors.
arXiv Detail & Related papers (2020-11-25T02:18:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.