Online Multi-Object Tracking and Segmentation with GMPHD Filter and
Mask-based Affinity Fusion
- URL: http://arxiv.org/abs/2009.00100v2
- Date: Fri, 11 Jun 2021 10:55:36 GMT
- Title: Online Multi-Object Tracking and Segmentation with GMPHD Filter and
Mask-based Affinity Fusion
- Authors: Young-min Song, Young-chul Yoon, Kwangjin Yoon, Moongu Jeon,
Seong-Whan Lee, Witold Pedrycz
- Abstract summary: We propose a fully online multi-object tracking and segmentation (MOTS) method that uses instance segmentation results as an input.
The proposed method is based on the Gaussian mixture probability hypothesis density (GMPHD) filter, a hierarchical data association (HDA), and a mask-based affinity fusion (MAF) model.
In the experiments on the two popular MOTS datasets, the key modules show some improvements.
- Score: 79.87371506464454
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this paper, we propose a highly practical fully online multi-object
tracking and segmentation (MOTS) method that uses instance segmentation results
as an input. The proposed method is based on the Gaussian mixture probability
hypothesis density (GMPHD) filter, a hierarchical data association (HDA), and a
mask-based affinity fusion (MAF) model to achieve high-performance online
tracking. The HDA consists of two associations: segment-to-track and
track-to-track associations. One affinity, for position and motion, is computed
by using the GMPHD filter, and the other affinity, for appearance is computed
by using the responses from a single object tracker such as a kernalized
correlation filter. These two affinities are simply fused by using a
score-level fusion method such as min-max normalization referred to as MAF. In
addition, to reduce the number of false positive segments, we adopt mask
IoU-based merging (mask merging). The proposed MOTS framework with the key
modules: HDA, MAF, and mask merging, is easily extensible to simultaneously
track multiple types of objects with CPU only execution in parallel processing.
In addition, the developed framework only requires simple parameter tuning
unlike many existing MOTS methods that need intensive hyperparameter
optimization. In the experiments on the two popular MOTS datasets, the key
modules show some improvements. For instance, ID-switch decreases by more than
half compared to a baseline method in the training sets. In conclusion, our
tracker achieves state-of-the-art MOTS performance in the test sets.
Related papers
- Bridge the Points: Graph-based Few-shot Segment Anything Semantically [79.1519244940518]
Recent advancements in pre-training techniques have enhanced the capabilities of vision foundation models.
Recent studies extend the SAM to Few-shot Semantic segmentation (FSS)
We propose a simple yet effective approach based on graph analysis.
arXiv Detail & Related papers (2024-10-09T15:02:28Z) - Building Brain Tumor Segmentation Networks with User-Assisted Filter Estimation and Selection [42.62139206176152]
We present Multi-Step (MS) FLIM, a user-assisted approach to estimating and selecting the most relevant filters from multiple FLIM executions.
MS-FLIM is used only for the first convolutional layer, and the results already indicate improvement over FLIM.
We build a simple U-shaped encoder-decoder network, named sU-Net, for glioblastoma segmentation using T1Gd and FLAIR MRI scans.
arXiv Detail & Related papers (2024-03-19T14:11:26Z) - Filter Pruning for Efficient CNNs via Knowledge-driven Differential
Filter Sampler [103.97487121678276]
Filter pruning simultaneously accelerates the computation and reduces the memory overhead of CNNs.
We propose a novel Knowledge-driven Differential Filter Sampler(KDFS) with Masked Filter Modeling(MFM) framework for filter pruning.
arXiv Detail & Related papers (2023-07-01T02:28:41Z) - Interaction-Aware Labeled Multi-Bernoulli Filter [5.255783459833821]
We present a novel approach to incorporate target interactions within the prediction step of an RFS-based multi-target filter.
The method has been developed for two practical applications of tracking a coordinated swarm and vehicles.
arXiv Detail & Related papers (2022-04-19T04:23:32Z) - SODAR: Segmenting Objects by DynamicallyAggregating Neighboring Mask
Representations [90.8752454643737]
Recent state-of-the-art one-stage instance segmentation model SOLO divides the input image into a grid and directly predicts per grid cell object masks with fully-convolutional networks.
We observe SOLO generates similar masks for an object at nearby grid cells, and these neighboring predictions can complement each other as some may better segment certain object part.
Motivated by the observed gap, we develop a novel learning-based aggregation method that improves upon SOLO by leveraging the rich neighboring information.
arXiv Detail & Related papers (2022-02-15T13:53:03Z) - Segment as Points for Efficient Online Multi-Object Tracking and
Segmentation [66.03023110058464]
We propose a highly effective method for learning instance embeddings based on segments by converting the compact image representation to un-ordered 2D point cloud representation.
Our method generates a new tracking-by-points paradigm where discriminative instance embeddings are learned from randomly selected points rather than images.
The resulting online MOTS framework, named PointTrack, surpasses all the state-of-the-art methods by large margins.
arXiv Detail & Related papers (2020-07-03T08:29:35Z) - BlendMask: Top-Down Meets Bottom-Up for Instance Segmentation [103.74690082121079]
In this work, we achieve improved mask prediction by effectively combining instance-level information with semantic information with lower-level fine-granularity.
Our main contribution is a blender module which draws inspiration from both top-down and bottom-up instance segmentation approaches.
BlendMask can effectively predict dense per-pixel position-sensitive instance features with very few channels, and learn attention maps for each instance with merely one convolution layer.
arXiv Detail & Related papers (2020-01-02T03:30:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.