Spatio-Temporal Point Process for Multiple Object Tracking
- URL: http://arxiv.org/abs/2302.02444v1
- Date: Sun, 5 Feb 2023 18:14:08 GMT
- Title: Spatio-Temporal Point Process for Multiple Object Tracking
- Authors: Tao Wang, Kean Chen, Weiyao Lin, John See, Zenghui Zhang, Qian Xu, and
Xia Jia
- Abstract summary: Multiple Object Tracking (MOT) focuses on modeling the relationship of detected objects among consecutive frames and merge them into different trajectories.
We propose a novel framework that can effectively predict and mask-out noisy and confusing detection results before associating objects into trajectories.
- Score: 30.041104276095624
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Multiple Object Tracking (MOT) focuses on modeling the relationship of
detected objects among consecutive frames and merge them into different
trajectories. MOT remains a challenging task as noisy and confusing detection
results often hinder the final performance. Furthermore, most existing research
are focusing on improving detection algorithms and association strategies. As
such, we propose a novel framework that can effectively predict and mask-out
the noisy and confusing detection results before associating the objects into
trajectories. In particular, we formulate such "bad" detection results as a
sequence of events and adopt the spatio-temporal point process}to model such
events. Traditionally, the occurrence rate in a point process is characterized
by an explicitly defined intensity function, which depends on the prior
knowledge of some specific tasks. Thus, designing a proper model is expensive
and time-consuming, with also limited ability to generalize well. To tackle
this problem, we adopt the convolutional recurrent neural network (conv-RNN) to
instantiate the point process, where its intensity function is automatically
modeled by the training data. Furthermore, we show that our method captures
both temporal and spatial evolution, which is essential in modeling events for
MOT. Experimental results demonstrate notable improvements in addressing noisy
and confusing detection results in MOT datasets. An improved state-of-the-art
performance is achieved by incorporating our baseline MOT algorithm with the
spatio-temporal point process model.
Related papers
- DeTra: A Unified Model for Object Detection and Trajectory Forecasting [68.85128937305697]
Our approach formulates the union of the two tasks as a trajectory refinement problem.
To tackle this unified task, we design a refinement transformer that infers the presence, pose, and multi-modal future behaviors of objects.
In our experiments, we observe that ourmodel outperforms the state-of-the-art on Argoverse 2 Sensor and Open dataset.
arXiv Detail & Related papers (2024-06-06T18:12:04Z) - EAS-SNN: End-to-End Adaptive Sampling and Representation for Event-based Detection with Recurrent Spiking Neural Networks [14.046487518350792]
Spiking Neural Networks (SNNs) operate on an event-driven through sparse spike communication.
We introduce Residual Potential Dropout (RPD) and Spike-Aware Training (SAT) to regulate potential distribution.
Our method yields a 4.4% mAP improvement on the Gen1 dataset, while requiring 38% fewer parameters and only three time steps.
arXiv Detail & Related papers (2024-03-19T09:34:11Z) - Graph Spatiotemporal Process for Multivariate Time Series Anomaly
Detection with Missing Values [67.76168547245237]
We introduce a novel framework called GST-Pro, which utilizes a graphtemporal process and anomaly scorer to detect anomalies.
Our experimental results show that the GST-Pro method can effectively detect anomalies in time series data and outperforms state-of-the-art methods.
arXiv Detail & Related papers (2024-01-11T10:10:16Z) - A Target Detection Algorithm in Traffic Scenes Based on Deep
Reinforcement Learning [2.8554857235549753]
This research presents a novel active detection model utilizing deep reinforcement learning to accurately detect traffic objects in real-world scenarios.
The model employs a deep Q-network based on LSTM-CNN that identifies and aligns target zones with specific categories of traffic objects.
Tests conducted demonstrate the model's proficiency, exhibiting exceptional precision and performance in locating traffic signal lights and speed limit signs.
arXiv Detail & Related papers (2023-12-25T04:23:30Z) - Custom DNN using Reward Modulated Inverted STDP Learning for Temporal
Pattern Recognition [0.0]
Temporal spike recognition plays a crucial role in various domains, including anomaly detection, keyword spotting and neuroscience.
This paper presents a novel algorithm for efficient temporal spike pattern recognition on sparse event series data.
arXiv Detail & Related papers (2023-07-15T18:57:27Z) - Uncovering the Missing Pattern: Unified Framework Towards Trajectory
Imputation and Prediction [60.60223171143206]
Trajectory prediction is a crucial undertaking in understanding entity movement or human behavior from observed sequences.
Current methods often assume that the observed sequences are complete while ignoring the potential for missing values.
This paper presents a unified framework, the Graph-based Conditional Variational Recurrent Neural Network (GC-VRNN), which can perform trajectory imputation and prediction simultaneously.
arXiv Detail & Related papers (2023-03-28T14:27:27Z) - Object-centric and memory-guided normality reconstruction for video
anomaly detection [56.64792194894702]
This paper addresses anomaly detection problem for videosurveillance.
Due to the inherent rarity and heterogeneity of abnormal events, the problem is viewed as a normality modeling strategy.
Our model learns object-centric normal patterns without seeing anomalous samples during training.
arXiv Detail & Related papers (2022-03-07T19:28:39Z) - STURE: Spatial-Temporal Mutual Representation Learning for Robust Data
Association in Online Multi-Object Tracking [7.562844934117318]
The proposed approach is capable of extracting more distinguishing detection and sequence representations.
It is applied to the public MOT challenge benchmarks and performs well compared with various state-of-the-art online MOT trackers.
arXiv Detail & Related papers (2022-01-18T08:52:40Z) - Learning to associate detections for real-time multiple object tracking [0.0]
This study investigates the use of artificial neural networks to learn a similarity function that can be used among detections.
The proposed tracker matches the results obtained by state-of-the-art methods, it has run 58% faster than a recent and similar method, used as baseline.
arXiv Detail & Related papers (2020-07-12T17:08:41Z) - One-Shot Object Detection without Fine-Tuning [62.39210447209698]
We introduce a two-stage model consisting of a first stage Matching-FCOS network and a second stage Structure-Aware Relation Module.
We also propose novel training strategies that effectively improve detection performance.
Our method exceeds the state-of-the-art one-shot performance consistently on multiple datasets.
arXiv Detail & Related papers (2020-05-08T01:59:23Z) - ArTIST: Autoregressive Trajectory Inpainting and Scoring for Tracking [80.02322563402758]
One of the core components in online multiple object tracking (MOT) frameworks is associating new detections with existing tracklets.
We introduce a probabilistic autoregressive generative model to score tracklet proposals by directly measuring the likelihood that a tracklet represents natural motion.
arXiv Detail & Related papers (2020-04-16T06:43:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.