HM-Net: A Regression Network for Object Center Detection and Tracking on
Wide Area Motion Imagery
- URL: http://arxiv.org/abs/2110.09881v1
- Date: Tue, 19 Oct 2021 11:56:30 GMT
- Title: HM-Net: A Regression Network for Object Center Detection and Tracking on
Wide Area Motion Imagery
- Authors: Hakki Motorcu, Hasan F. Ates, H. Fatih Ugurdag, and Bahadir Gunturk
- Abstract summary: We present our deep neural network-based combined object detection and tracking model, namely, Heat Map Network (HM-Net)
HM-Net is significantly faster than state-of-the-art frame differencing and background subtraction-based methods.
It outperforms state-of-the-art WAMI moving object detection and tracking methods on WPAFB dataset.
- Score: 1.2249546377051437
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Wide Area Motion Imagery (WAMI) yields high resolution images with a large
number of extremely small objects. Target objects have large spatial
displacements throughout consecutive frames. This nature of WAMI images makes
object tracking and detection challenging. In this paper, we present our deep
neural network-based combined object detection and tracking model, namely, Heat
Map Network (HM-Net). HM-Net is significantly faster than state-of-the-art
frame differencing and background subtraction-based methods, without
compromising detection and tracking performances. HM-Net follows object
center-based joint detection and tracking paradigm. Simple heat map-based
predictions support unlimited number of simultaneous detections. The proposed
method uses two consecutive frames and the object detection heat map obtained
from the previous frame as input, which helps HM-Net monitor spatio-temporal
changes between frames and keeps track of previously predicted objects.
Although reuse of prior object detection heat map acts as a vital
feedback-based memory element, it can lead to unintended surge of false
positive detections. To increase robustness of the method against false
positives and to eliminate low confidence detections, HM-Net employs novel
feedback filters and advanced data augmentations. HM-Net outperforms
state-of-the-art WAMI moving object detection and tracking methods on WPAFB
dataset with its 96.2% F1 and 94.4% mAP detection scores, while achieving a
61.8% mAP tracking score on the same dataset.
Related papers
- UnsMOT: Unified Framework for Unsupervised Multi-Object Tracking with
Geometric Topology Guidance [6.577227592760559]
UnsMOT is a novel framework that combines appearance and motion features of objects with geometric information to provide more accurate tracking.
Experimental results show remarkable performance in terms of HOTA, IDF1, and MOTA metrics in comparison with state-of-the-art methods.
arXiv Detail & Related papers (2023-09-03T04:58:12Z) - ByteTrackV2: 2D and 3D Multi-Object Tracking by Associating Every
Detection Box [81.45219802386444]
Multi-object tracking (MOT) aims at estimating bounding boxes and identities of objects across video frames.
We propose a hierarchical data association strategy to mine the true objects in low-score detection boxes.
In 3D scenarios, it is much easier for the tracker to predict object velocities in the world coordinate.
arXiv Detail & Related papers (2023-03-27T15:35:21Z) - Real-time Multi-Object Tracking Based on Bi-directional Matching [0.0]
This study offers a bi-directional matching algorithm for multi-object tracking.
A stranded area is used in the matching algorithm to temporarily store the objects that fail to be tracked.
In the MOT17 challenge, the proposed algorithm achieves 63.4% MOTA, 55.3% IDF1, and 20.1 FPS tracking speed.
arXiv Detail & Related papers (2023-03-15T08:38:08Z) - DSRRTracker: Dynamic Search Region Refinement for Attention-based
Siamese Multi-Object Tracking [13.104037155691644]
We propose an end-to-end MOT method, with a Gaussian filter-inspired dynamic search region refinement module.
Our method can achieve the state-of-the-art performance with reasonable speed.
arXiv Detail & Related papers (2022-03-21T04:14:06Z) - FMODetect: Robust Detection and Trajectory Estimation of Fast Moving
Objects [110.29738581961955]
We propose the first learning-based approach for detection and trajectory estimation of fast moving objects.
The proposed method first detects all fast moving objects as a truncated distance function to the trajectory.
For the sharp appearance estimation, we propose an energy minimization based deblurring.
arXiv Detail & Related papers (2020-12-15T11:05:34Z) - DS-Net: Dynamic Spatiotemporal Network for Video Salient Object
Detection [78.04869214450963]
We propose a novel dynamic temporal-temporal network (DSNet) for more effective fusion of temporal and spatial information.
We show that the proposed method achieves superior performance than state-of-the-art algorithms.
arXiv Detail & Related papers (2020-12-09T06:42:30Z) - Learning Spatio-Appearance Memory Network for High-Performance Visual
Tracking [79.80401607146987]
Existing object tracking usually learns a bounding-box based template to match visual targets across frames, which cannot accurately learn a pixel-wise representation.
This paper presents a novel segmentation-based tracking architecture, which is equipped with a local-temporal memory network to learn accurate-temporal correspondence.
arXiv Detail & Related papers (2020-09-21T08:12:02Z) - 3D Object Detection and Tracking Based on Streaming Data [9.085584050311178]
We set up a dual-way network for 3D object detection based on ons, and then propagate predictions to non-key frames through a motion based algorithm guided by temporal information.
Our framework is not only shown to have significant improvements compared with frame-by-frame paradigm, but also proven to produce competitive results on KITTI Object Tracking Benchmark.
arXiv Detail & Related papers (2020-09-14T03:15:41Z) - End-to-End Multi-Object Tracking with Global Response Map [23.755882375664875]
We present a completely end-to-end approach that takes image-sequence/video as input and outputs directly the located and tracked objects of learned types.
Specifically, with our introduced multi-object representation strategy, a global response map can be accurately generated over frames.
Experimental results based on the MOT16 and MOT17 benchmarks show that our proposed on-line tracker achieved state-of-the-art performance on several tracking metrics.
arXiv Detail & Related papers (2020-07-13T12:30:49Z) - Depthwise Non-local Module for Fast Salient Object Detection Using a
Single Thread [136.2224792151324]
We propose a new deep learning algorithm for fast salient object detection.
The proposed algorithm achieves competitive accuracy and high inference efficiency simultaneously with a single CPU thread.
arXiv Detail & Related papers (2020-01-22T15:23:48Z) - NETNet: Neighbor Erasing and Transferring Network for Better Single Shot
Object Detection [170.30694322460045]
We propose a new Neighbor Erasing and Transferring (NET) mechanism to reconfigure the pyramid features and explore scale-aware features.
A single-shot network called NETNet is constructed for scale-aware object detection.
arXiv Detail & Related papers (2020-01-18T15:21:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.