Person Monitoring by Full Body Tracking in Uniform Crowd Environment
- URL: http://arxiv.org/abs/2209.01274v1
- Date: Fri, 2 Sep 2022 21:21:47 GMT
- Title: Person Monitoring by Full Body Tracking in Uniform Crowd Environment
- Authors: Zhibo Zhang, Omar Alremeithi, Maryam Almheiri, Marwa Albeshr,
Xiaoxiong Zhang, Sajid Javed, Naoufel Werghi
- Abstract summary: In the Middle East, uniform crowd environments are the norm which challenges state-of-the-art trackers.
In this work, we develop an annotated dataset with one specific target per video in a uniform crowd environment.
The dataset was used in evaluating and fine-tuning a state-of-the-art tracker.
- Score: 10.71804432329509
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Full body trackers are utilized for surveillance and security purposes, such
as person-tracking robots. In the Middle East, uniform crowd environments are
the norm which challenges state-of-the-art trackers. Despite tremendous
improvements in tracker technology documented in the past literature, these
trackers have not been trained using a dataset that captures these
environments. In this work, we develop an annotated dataset with one specific
target per video in a uniform crowd environment. The dataset was generated in
four different scenarios where mainly the target was moving alongside the
crowd, sometimes occluding with them, and other times the camera's view of the
target is blocked by the crowd for a short period. After the annotations, it
was used in evaluating and fine-tuning a state-of-the-art tracker. Our results
have shown that the fine-tuned tracker performed better on the evaluation
dataset based on two quantitative evaluation metrics, compared to the initial
pre-trained tracker.
Related papers
- RTracker: Recoverable Tracking via PN Tree Structured Memory [71.05904715104411]
We propose a recoverable tracking framework, RTracker, that uses a tree-structured memory to dynamically associate a tracker and a detector to enable self-recovery.
Specifically, we propose a Positive-Negative Tree-structured memory to chronologically store and maintain positive and negative target samples.
Our core idea is to use the support samples of positive and negative target categories to establish a relative distance-based criterion for a reliable assessment of target loss.
arXiv Detail & Related papers (2024-03-28T08:54:40Z) - OmniTracker: Unifying Object Tracking by Tracking-with-Detection [119.51012668709502]
OmniTracker is presented to resolve all the tracking tasks with a fully shared network architecture, model weights, and inference pipeline.
Experiments on 7 tracking datasets, including LaSOT, TrackingNet, DAVIS16-17, MOT17, MOTS20, and YTVIS19, demonstrate that OmniTracker achieves on-par or even better results than both task-specific and unified tracking models.
arXiv Detail & Related papers (2023-03-21T17:59:57Z) - SOMPT22: A Surveillance Oriented Multi-Pedestrian Tracking Dataset [5.962184741057505]
We introduce SOMPT22 dataset; a new set for multi person tracking with annotated short videos captured from static cameras located on poles with 6-8 meters in height positioned for city surveillance.
We analyze MOT trackers classified as one-shot and two-stage with respect to the way of use of detection and reID networks on this new dataset.
The experimental results of our new dataset indicate that SOTA is still far from high efficiency, and single-shot trackers are good candidates to unify fast execution and accuracy with competitive performance.
arXiv Detail & Related papers (2022-08-04T11:09:19Z) - Robot Person Following in Uniform Crowd Environment [13.708992331117281]
Person-tracking robots have many applications, such as in security, elderly care, and socializing robots.
In this work, we focus on improving the perceptivity of a robot for a person following task by developing a robust and real-time applicable object tracker.
We present a new robot person tracking system with a new RGB-D tracker, Deep Tracking with RGB-D (DTRD) that is resilient to tricky challenges introduced by the uniform crowd environment.
arXiv Detail & Related papers (2022-05-21T10:20:14Z) - Unified Transformer Tracker for Object Tracking [58.65901124158068]
We present the Unified Transformer Tracker (UTT) to address tracking problems in different scenarios with one paradigm.
A track transformer is developed in our UTT to track the target in both Single Object Tracking (SOT) and Multiple Object Tracking (MOT)
arXiv Detail & Related papers (2022-03-29T01:38:49Z) - Benchmarking Deep Trackers on Aerial Videos [5.414308305392762]
In this paper, we compare ten trackers based on deep learning techniques on four aerial datasets.
We choose top performing trackers utilizing different approaches, specifically tracking by detection, discriminative correlation filters, Siamese networks and reinforcement learning.
Our findings indicate that the trackers perform significantly worse in aerial datasets compared to standard ground level videos.
arXiv Detail & Related papers (2021-03-24T01:45:19Z) - Unsupervised Deep Representation Learning for Real-Time Tracking [137.69689503237893]
We propose an unsupervised learning method for visual tracking.
The motivation of our unsupervised learning is that a robust tracker should be effective in bidirectional tracking.
We build our framework on a Siamese correlation filter network, and propose a multi-frame validation scheme and a cost-sensitive loss to facilitate unsupervised learning.
arXiv Detail & Related papers (2020-07-22T08:23:12Z) - Tracking-by-Counting: Using Network Flows on Crowd Density Maps for
Tracking Multiple Targets [96.98888948518815]
State-of-the-art multi-object tracking(MOT) methods follow the tracking-by-detection paradigm.
We propose a new MOT paradigm, tracking-by-counting, tailored for crowded scenes.
arXiv Detail & Related papers (2020-07-18T19:51:53Z) - TAO: A Large-Scale Benchmark for Tracking Any Object [95.87310116010185]
Tracking Any Object dataset consists of 2,907 high resolution videos, captured in diverse environments, which are half a minute long on average.
We ask annotators to label objects that move at any point in the video, and give names to them post factum.
Our vocabulary is both significantly larger and qualitatively different from existing tracking datasets.
arXiv Detail & Related papers (2020-05-20T21:07:28Z) - Supervised and Unsupervised Detections for Multiple Object Tracking in
Traffic Scenes: A Comparative Study [11.024591739346294]
We propose a multiple object tracker, called MF-Tracker, that integrates multiple classical features (spatial and colours) and modern features (detection labels and re-identification features) in its tracking framework.
Since our tracker can work with detections coming either from unsupervised and supervised object detectors, we also investigated the impact of supervised and unsupervised detection inputs in our method.
Results show that our proposed method is performing very well in both datasets with different inputs.
arXiv Detail & Related papers (2020-03-30T17:27:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.