Training-Set Distillation for Real-Time UAV Object Tracking
- URL: http://arxiv.org/abs/2003.05326v1
- Date: Wed, 11 Mar 2020 14:28:09 GMT
- Title: Training-Set Distillation for Real-Time UAV Object Tracking
- Authors: Fan Li, Changhong Fu, Fuling Lin, Yiming Li, Peng Lu
- Abstract summary: Correlation filter (CF) has recently exhibited promising performance in visual object tracking for unmanned aerial vehicle (UAV)
In this work, a novel time slot-based distillation approach is proposed to efficiently and effectively optimize the training-set's quality on the fly.
Comprehensive tests on two well-known UAV benchmarks prove the effectiveness of our method with real-time speed on a single CPU.
- Score: 23.04319685796588
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Correlation filter (CF) has recently exhibited promising performance in
visual object tracking for unmanned aerial vehicle (UAV). Such online learning
method heavily depends on the quality of the training-set, yet complicated
aerial scenarios like occlusion or out of view can reduce its reliability. In
this work, a novel time slot-based distillation approach is proposed to
efficiently and effectively optimize the training-set's quality on the fly. A
cooperative energy minimization function is established to score the historical
samples adaptively. To accelerate the scoring process, frames with high
confident tracking results are employed as the keyframes to divide the tracking
process into multiple time slots. After the establishment of a new slot, the
weighted fusion of the previous samples generates one key-sample, in order to
reduce the number of samples to be scored. Besides, when the current time slot
exceeds the maximum frame number, which can be scored, the sample with the
lowest score will be discarded. Consequently, the training-set can be
efficiently and reliably distilled. Comprehensive tests on two well-known UAV
benchmarks prove the effectiveness of our method with real-time speed on a
single CPU.
Related papers
- Towards Discriminative Representations with Contrastive Instances for
Real-Time UAV Tracking [5.557099240958562]
Discriminative correlation filters (DCF)-based trackers can yield high efficiency on a single CPU but with inferior precision.
Lightweight Deep learning (DL)-based trackers can achieve a good balance between efficiency and precision but performance gains are limited by the compression rate.
This paper aims to enhance the discriminative power of feature representations from a new feature-learning perspective.
arXiv Detail & Related papers (2023-08-22T13:58:45Z) - Learning Disentangled Representation with Mutual Information
Maximization for Real-Time UAV Tracking [1.0541541376305243]
This paper exploits disentangled representation with mutual information (DR-MIM) to improve precision and efficiency for UAV tracking.
Our DR-MIM tracker significantly outperforms state-of-the-art UAV tracking methods.
arXiv Detail & Related papers (2023-08-20T13:16:15Z) - Efficient Few-Shot Object Detection via Knowledge Inheritance [62.36414544915032]
Few-shot object detection (FSOD) aims at learning a generic detector that can adapt to unseen tasks with scarce training samples.
We present an efficient pretrain-transfer framework (PTF) baseline with no computational increment.
We also propose an adaptive length re-scaling (ALR) strategy to alleviate the vector length inconsistency between the predicted novel weights and the pretrained base weights.
arXiv Detail & Related papers (2022-03-23T06:24:31Z) - An Efficient Combinatorial Optimization Model Using Learning-to-Rank
Distillation [2.0137632982900207]
We present the learning-to-rank distillation-based COP framework, where a high-performance ranking policy can be distilled into a non-iterative, simple model.
Specifically, we employ the approximated ranking distillation to render a score-based ranking model learnable via gradient descent.
We demonstrate that a distilled model achieves comparable performance to its respective, high-performance RL, but also provides several times faster inferences.
arXiv Detail & Related papers (2021-12-24T10:52:47Z) - Label, Verify, Correct: A Simple Few Shot Object Detection Method [93.84801062680786]
We introduce a simple pseudo-labelling method to source high-quality pseudo-annotations from a training set.
We present two novel methods to improve the precision of the pseudo-labelling process.
Our method achieves state-of-the-art or second-best performance compared to existing approaches.
arXiv Detail & Related papers (2021-12-10T18:59:06Z) - Fast Variational AutoEncoder with Inverted Multi-Index for Collaborative
Filtering [59.349057602266]
Variational AutoEncoder (VAE) has been extended as a representative nonlinear method for collaborative filtering.
We propose to decompose the inner-product-based softmax probability based on the inverted multi-index.
FastVAE can outperform the state-of-the-art baselines in terms of both sampling quality and efficiency.
arXiv Detail & Related papers (2021-09-13T08:31:59Z) - Lite-FPN for Keypoint-based Monocular 3D Object Detection [18.03406686769539]
Keypoint-based monocular 3D object detection has made tremendous progress and achieved great speed-accuracy trade-off.
We propose a sort of lightweight feature pyramid network called Lite-FPN to achieve multi-scale feature fusion.
Our proposed method achieves significantly higher accuracy and frame rate at the same time.
arXiv Detail & Related papers (2021-05-01T14:44:31Z) - Few-shot Action Recognition with Prototype-centered Attentive Learning [88.10852114988829]
Prototype-centered Attentive Learning (PAL) model composed of two novel components.
First, a prototype-centered contrastive learning loss is introduced to complement the conventional query-centered learning objective.
Second, PAL integrates a attentive hybrid learning mechanism that can minimize the negative impacts of outliers.
arXiv Detail & Related papers (2021-01-20T11:48:12Z) - Multi-Scale Positive Sample Refinement for Few-Shot Object Detection [61.60255654558682]
Few-shot object detection (FSOD) helps detectors adapt to unseen classes with few training instances.
We propose a Multi-scale Positive Sample Refinement (MPSR) approach to enrich object scales in FSOD.
MPSR generates multi-scale positive samples as object pyramids and refines the prediction at various scales.
arXiv Detail & Related papers (2020-07-18T09:48:29Z) - Cascaded Regression Tracking: Towards Online Hard Distractor
Discrimination [202.2562153608092]
We propose a cascaded regression tracker with two sequential stages.
In the first stage, we filter out abundant easily-identified negative candidates.
In the second stage, a discrete sampling based ridge regression is designed to double-check the remaining ambiguous hard samples.
arXiv Detail & Related papers (2020-06-18T07:48:01Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.