Multiple Object Tracking by Flowing and Fusing
- URL: http://arxiv.org/abs/2001.11180v1
- Date: Thu, 30 Jan 2020 05:17:22 GMT
- Title: Multiple Object Tracking by Flowing and Fusing
- Authors: Jimuyang Zhang, Sanping Zhou, Xin Chang, Fangbin Wan, Jinjun Wang,
Yang Wu, Dong Huang
- Abstract summary: Flow-Fuse-Tracker (FFT) is a tracking approach that learns the indefinite number of target-wise motions jointly from pixel-level optical flows.
In target fusing, a FuseTracker module refines and fuses targets proposed by FlowTracker and frame-wise object detection.
As an online MOT approach, FFT produced the top MOTA of 46.3 on the 2DMOT15, 56.5 on the MOT16, and 56.5 on the MOT17 tracking benchmarks.
- Score: 31.58422046611455
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Most of Multiple Object Tracking (MOT) approaches compute individual target
features for two subtasks: estimating target-wise motions and conducting
pair-wise Re-Identification (Re-ID). Because of the indefinite number of
targets among video frames, both subtasks are very difficult to scale up
efficiently in end-to-end Deep Neural Networks (DNNs). In this paper, we design
an end-to-end DNN tracking approach, Flow-Fuse-Tracker (FFT), that addresses
the above issues with two efficient techniques: target flowing and target
fusing. Specifically, in target flowing, a FlowTracker DNN module learns the
indefinite number of target-wise motions jointly from pixel-level optical
flows. In target fusing, a FuseTracker DNN module refines and fuses targets
proposed by FlowTracker and frame-wise object detection, instead of trusting
either of the two inaccurate sources of target proposal. Because FlowTracker
can explore complex target-wise motion patterns and FuseTracker can refine and
fuse targets from FlowTracker and detectors, our approach can achieve the
state-of-the-art results on several MOT benchmarks. As an online MOT approach,
FFT produced the top MOTA of 46.3 on the 2DMOT15, 56.5 on the MOT16, and 56.5
on the MOT17 tracking benchmarks, surpassing all the online and offline methods
in existing publications.
Related papers
- FlowTrack: Point-level Flow Network for 3D Single Object Tracking [24.93453336062018]
3D single object tracking (SOT) is a crucial task in fields of mobile robotics and autonomous driving.
Traditional motion-based approaches achieve target tracking by estimating the relative movement of target between two consecutive frames.
We propose a point-level flow method with multi-frame information for 3D SOT task, called FlowTrack.
arXiv Detail & Related papers (2024-07-02T05:31:34Z) - SparseTrack: Multi-Object Tracking by Performing Scene Decomposition
based on Pseudo-Depth [84.64121608109087]
We propose a pseudo-depth estimation method for obtaining the relative depth of targets from 2D images.
Secondly, we design a depth cascading matching (DCM) algorithm, which can use the obtained depth information to convert a dense target set into multiple sparse target subsets.
By integrating the pseudo-depth method and the DCM strategy into the data association process, we propose a new tracker, called SparseTrack.
arXiv Detail & Related papers (2023-06-08T14:36:10Z) - Multi-Object Tracking by Iteratively Associating Detections with Uniform
Appearance for Trawl-Based Fishing Bycatch Monitoring [22.228127377617028]
The aim of in-trawl catch monitoring for use in fishing operations is to detect, track and classify fish targets in real-time from video footage.
We propose a novel MOT method, built upon an existing observation-centric tracking algorithm, by adopting a new iterative association step.
Our method offers improved performance in tracking targets with uniform appearance and outperforms state-of-the-art techniques on our underwater fish datasets as well as the MOT17 dataset.
arXiv Detail & Related papers (2023-04-10T18:55:10Z) - Minkowski Tracker: A Sparse Spatio-Temporal R-CNN for Joint Object
Detection and Tracking [53.64390261936975]
We present Minkowski Tracker, a sparse-temporal R-CNN that jointly solves object detection and tracking problems.
Inspired by region-based CNN (R-CNN), we propose to track motion as a second stage of the object detector R-CNN.
We show in large-scale experiments that the overall performance gain of our method is due to four factors.
arXiv Detail & Related papers (2022-08-22T04:47:40Z) - Unified Transformer Tracker for Object Tracking [58.65901124158068]
We present the Unified Transformer Tracker (UTT) to address tracking problems in different scenarios with one paradigm.
A track transformer is developed in our UTT to track the target in both Single Object Tracking (SOT) and Multiple Object Tracking (MOT)
arXiv Detail & Related papers (2022-03-29T01:38:49Z) - Track to Detect and Segment: An Online Multi-Object Tracker [81.15608245513208]
TraDeS is an online joint detection and tracking model, exploiting tracking clues to assist detection end-to-end.
TraDeS infers object tracking offset by a cost volume, which is used to propagate previous object features.
arXiv Detail & Related papers (2021-03-16T02:34:06Z) - SMOT: Single-Shot Multi Object Tracking [39.34493475666044]
Single-shot multi-object tracker (SMOT) is a new tracking framework that converts any single-shot detector (SSD) model into an online multiple object tracker.
On three benchmarks of object tracking: Hannah, Music Videos, and MOT17, the proposed SMOT achieves state-of-the-art performance.
arXiv Detail & Related papers (2020-10-30T02:46:54Z) - Simultaneous Detection and Tracking with Motion Modelling for Multiple
Object Tracking [94.24393546459424]
We introduce Deep Motion Modeling Network (DMM-Net) that can estimate multiple objects' motion parameters to perform joint detection and association.
DMM-Net achieves PR-MOTA score of 12.80 @ 120+ fps for the popular UA-DETRAC challenge, which is better performance and orders of magnitude faster.
We also contribute a synthetic large-scale public dataset Omni-MOT for vehicle tracking that provides precise ground-truth annotations.
arXiv Detail & Related papers (2020-08-20T08:05:33Z) - Tracking-by-Counting: Using Network Flows on Crowd Density Maps for
Tracking Multiple Targets [96.98888948518815]
State-of-the-art multi-object tracking(MOT) methods follow the tracking-by-detection paradigm.
We propose a new MOT paradigm, tracking-by-counting, tailored for crowded scenes.
arXiv Detail & Related papers (2020-07-18T19:51:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.