Simultaneous Clutter Detection and Semantic Segmentation of Moving
Objects for Automotive Radar Data
- URL: http://arxiv.org/abs/2311.07247v2
- Date: Tue, 14 Nov 2023 07:36:39 GMT
- Title: Simultaneous Clutter Detection and Semantic Segmentation of Moving
Objects for Automotive Radar Data
- Authors: Johannes Kopp, Dominik Kellner, Aldi Piroli, Vinzenz Dallabetta, Klaus
Dietmayer
- Abstract summary: Radar sensors are an important part of the environment perception system of autonomous vehicles.
One of the first steps during the processing of radar point clouds is often the detection of clutter.
Another common objective is the semantic segmentation of moving road users.
We show that our setup is highly effective and outperforms every existing network for semantic segmentation on the RadarScenes dataset.
- Score: 12.96486891333286
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The unique properties of radar sensors, such as their robustness to adverse
weather conditions, make them an important part of the environment perception
system of autonomous vehicles. One of the first steps during the processing of
radar point clouds is often the detection of clutter, i.e. erroneous points
that do not correspond to real objects. Another common objective is the
semantic segmentation of moving road users. These two problems are handled
strictly separate from each other in literature. The employed neural networks
are always focused entirely on only one of the tasks. In contrast to this, we
examine ways to solve both tasks at the same time with a single jointly used
model. In addition to a new augmented multi-head architecture, we also devise a
method to represent a network's predictions for the two tasks with only one
output value. This novel approach allows us to solve the tasks simultaneously
with the same inference time as a conventional task-specific model. In an
extensive evaluation, we show that our setup is highly effective and
outperforms every existing network for semantic segmentation on the RadarScenes
dataset.
Related papers
- DeTra: A Unified Model for Object Detection and Trajectory Forecasting [68.85128937305697]
Our approach formulates the union of the two tasks as a trajectory refinement problem.
To tackle this unified task, we design a refinement transformer that infers the presence, pose, and multi-modal future behaviors of objects.
In our experiments, we observe that ourmodel outperforms the state-of-the-art on Argoverse 2 Sensor and Open dataset.
arXiv Detail & Related papers (2024-06-06T18:12:04Z) - Traffic Light Recognition using Convolutional Neural Networks: A Survey [4.451479907610764]
We conduct a comprehensive survey and analysis of traffic light recognition methods that use convolutional neural networks (CNNs)
Based on an underlying architecture, we cluster methods into three major groups.
We describe the most important works in each cluster, discuss the usage of the datasets, and identify research gaps.
arXiv Detail & Related papers (2023-09-05T11:50:38Z) - A Dynamic Feature Interaction Framework for Multi-task Visual Perception [100.98434079696268]
We devise an efficient unified framework to solve multiple common perception tasks.
These tasks include instance segmentation, semantic segmentation, monocular 3D detection, and depth estimation.
Our proposed framework, termed D2BNet, demonstrates a unique approach to parameter-efficient predictions for multi-task perception.
arXiv Detail & Related papers (2023-06-08T09:24:46Z) - ALSO: Automotive Lidar Self-supervision by Occupancy estimation [70.70557577874155]
We propose a new self-supervised method for pre-training the backbone of deep perception models operating on point clouds.
The core idea is to train the model on a pretext task which is the reconstruction of the surface on which the 3D points are sampled.
The intuition is that if the network is able to reconstruct the scene surface, given only sparse input points, then it probably also captures some fragments of semantic information.
arXiv Detail & Related papers (2022-12-12T13:10:19Z) - Multitask Network for Joint Object Detection, Semantic Segmentation and
Human Pose Estimation in Vehicle Occupancy Monitoring [0.0]
Multitask Detection, neural Pose and Estimation Network (DSPM)
We propose our Multitask Detection, neural Pose and Estimation Network (DSPM)
Our architecture allows a flexible combination of the three mentioned tasks during a simple end-to-end training.
We perform comprehensive evaluations on the public datasets SVIRO and TiCaM in order to demonstrate the superior performance.
arXiv Detail & Related papers (2022-05-03T14:11:18Z) - SegmentMeIfYouCan: A Benchmark for Anomaly Segmentation [111.61261419566908]
Deep neural networks (DNNs) are usually trained on a closed set of semantic classes.
They are ill-equipped to handle previously-unseen objects.
detecting and localizing such objects is crucial for safety-critical applications such as perception for automated driving.
arXiv Detail & Related papers (2021-04-30T07:58:19Z) - Multi-View Radar Semantic Segmentation [3.2093811507874768]
Automotive radars are low-cost active sensors that measure properties of surrounding objects.
They are seldom used for scene understanding due to the size and complexity of radar raw data.
We propose several novel architectures, and their associated losses, which analyse multiple "views" of the range-angle-Doppler radar tensor to segment it semantically.
arXiv Detail & Related papers (2021-03-30T09:56:41Z) - Multi-object Tracking with a Hierarchical Single-branch Network [31.680667324595557]
We propose an online multi-object tracking framework based on a hierarchical single-branch network.
Our novel iHOIM loss function unifies the objectives of the two sub-tasks and encourages better detection performance.
Experimental results on MOT16 and MOT20 datasets show that we can achieve state-of-the-art tracking performance.
arXiv Detail & Related papers (2021-01-06T12:14:58Z) - LiDAR-based Panoptic Segmentation via Dynamic Shifting Network [56.71765153629892]
LiDAR-based panoptic segmentation aims to parse both objects and scenes in a unified manner.
We propose the Dynamic Shifting Network (DS-Net), which serves as an effective panoptic segmentation framework in the point cloud realm.
Our proposed DS-Net achieves superior accuracies over current state-of-the-art methods.
arXiv Detail & Related papers (2020-11-24T08:44:46Z) - FairMOT: On the Fairness of Detection and Re-Identification in Multiple
Object Tracking [92.48078680697311]
Multi-object tracking (MOT) is an important problem in computer vision.
We present a simple yet effective approach termed as FairMOT based on the anchor-free object detection architecture CenterNet.
The approach achieves high accuracy for both detection and tracking.
arXiv Detail & Related papers (2020-04-04T08:18:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.