Faster and Simpler Siamese Network for Single Object Tracking
- URL: http://arxiv.org/abs/2105.03049v1
- Date: Fri, 7 May 2021 03:37:19 GMT
- Title: Faster and Simpler Siamese Network for Single Object Tracking
- Authors: Shaokui Jiang, Baile Xu, Jian Zhao, Furao Shen
- Abstract summary: Single object tracking (SOT) is one of the most important tasks in computer vision.
Siamese networks have been proposed and perform better than most of the traditional methods.
Most of these methods could only meet the needs of real-time object tracking in ideal environments.
- Score: 9.365739363728983
- License: http://creativecommons.org/publicdomain/zero/1.0/
- Abstract: Single object tracking (SOT) is currently one of the most important tasks in
computer vision. With the development of the deep network and the release for a
series of large scale datasets for single object tracking, siamese networks
have been proposed and perform better than most of the traditional methods.
However, recent siamese networks get deeper and slower to obtain better
performance. Most of these methods could only meet the needs of real-time
object tracking in ideal environments. In order to achieve a better balance
between efficiency and accuracy, we propose a simpler siamese network for
single object tracking, which runs fast in poor hardware configurations while
remaining an excellent accuracy. We use a more efficient regression method to
compute the location of the tracked object in a shorter time without losing
much precision. For improving the accuracy and speeding up the training
progress, we introduce the Squeeze-and-excitation (SE) network into the feature
extractor. In this paper, we compare the proposed method with some
state-of-the-art trackers and analysis their performances. Using our method, a
siamese network could be trained with shorter time and less data. The fast
processing speed enables combining object tracking with object detection or
other tasks in real time.
Related papers
- Temporal Correlation Meets Embedding: Towards a 2nd Generation of JDE-based Real-Time Multi-Object Tracking [52.04679257903805]
Joint Detection and Embedding (JDE) trackers have demonstrated excellent performance in Multi-Object Tracking (MOT) tasks.
Our tracker, named TCBTrack, achieves state-of-the-art performance on multiple public benchmarks.
arXiv Detail & Related papers (2024-07-19T07:48:45Z) - Optimizing Sensor Network Design for Multiple Coverage [0.9668407688201359]
We introduce a new objective function for the greedy (next-best-view) algorithm to design efficient and robust sensor networks.
We also introduce a Deep Learning model to accelerate the algorithm for near real-time computations.
arXiv Detail & Related papers (2024-05-15T05:13:20Z) - Exploring Dynamic Transformer for Efficient Object Tracking [58.120191254379854]
We propose DyTrack, a dynamic transformer framework for efficient tracking.
DyTrack automatically learns to configure proper reasoning routes for various inputs, gaining better utilization of the available computational budget.
Experiments on multiple benchmarks demonstrate that DyTrack achieves promising speed-precision trade-offs with only a single model.
arXiv Detail & Related papers (2024-03-26T12:31:58Z) - PNAS-MOT: Multi-Modal Object Tracking with Pareto Neural Architecture Search [64.28335667655129]
Multiple object tracking is a critical task in autonomous driving.
As tracking accuracy improves, neural networks become increasingly complex, posing challenges for their practical application in real driving scenarios due to the high level of latency.
In this paper, we explore the use of the neural architecture search (NAS) methods to search for efficient architectures for tracking, aiming for low real-time latency while maintaining relatively high accuracy.
arXiv Detail & Related papers (2024-03-23T04:18:49Z) - Transformer-based assignment decision network for multiple object
tracking [0.0]
We introduce Transformer-based Assignment Decision Network (TADN) that tackles data association without the need of explicit optimization during inference.
Our proposed approach outperforms the state-of-the-art in most evaluation metrics despite its simple nature as a tracker.
arXiv Detail & Related papers (2022-08-06T19:47:32Z) - Task-Adaptive Neural Network Retrieval with Meta-Contrastive Learning [34.27089256930098]
We propose a novel neural network retrieval method, which retrieves the most optimal pre-trained network for a given task.
We train this framework by meta-learning a cross-modal latent space with contrastive loss, to maximize the similarity between a dataset and a network.
We validate the efficacy of our method on ten real-world datasets, against existing NAS baselines.
arXiv Detail & Related papers (2021-03-02T06:30:51Z) - Anchor-free Small-scale Multispectral Pedestrian Detection [88.7497134369344]
We propose a method for effective and efficient multispectral fusion of the two modalities in an adapted single-stage anchor-free base architecture.
We aim at learning pedestrian representations based on object center and scale rather than direct bounding box predictions.
Results show our method's effectiveness in detecting small-scaled pedestrians.
arXiv Detail & Related papers (2020-08-19T13:13:01Z) - FairMOT: On the Fairness of Detection and Re-Identification in Multiple
Object Tracking [92.48078680697311]
Multi-object tracking (MOT) is an important problem in computer vision.
We present a simple yet effective approach termed as FairMOT based on the anchor-free object detection architecture CenterNet.
The approach achieves high accuracy for both detection and tracking.
arXiv Detail & Related papers (2020-04-04T08:18:00Z) - RetinaTrack: Online Single Stage Joint Detection and Tracking [22.351109024452462]
We focus on the tracking-by-detection paradigm for autonomous driving where both tasks are mission critical.
We propose a conceptually simple and efficient joint model of detection and tracking, called RetinaTrack, which modifies the popular single stage RetinaNet approach.
arXiv Detail & Related papers (2020-03-30T23:46:29Z) - Depthwise Non-local Module for Fast Salient Object Detection Using a
Single Thread [136.2224792151324]
We propose a new deep learning algorithm for fast salient object detection.
The proposed algorithm achieves competitive accuracy and high inference efficiency simultaneously with a single CPU thread.
arXiv Detail & Related papers (2020-01-22T15:23:48Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.