Related papers: Novelty Detection and Analysis of Traffic Scenario Infrastructures in the Latent Space of a Vision Transformer-Based Triplet Autoencoder

Novelty Detection and Analysis of Traffic Scenario Infrastructures in the Latent Space of a Vision Transformer-Based Triplet Autoencoder

URL: http://arxiv.org/abs/2105.01924v1
Date: Wed, 5 May 2021 08:24:03 GMT
Title: Novelty Detection and Analysis of Traffic Scenario Infrastructures in the Latent Space of a Vision Transformer-Based Triplet Autoencoder
Authors: Jonas Wurst, Lakshman Balasubramanian, Michael Botsch and Wolfgang Utschick
Abstract summary: A method to detect novel traffic scenarios based on their infrastructure images is presented. An autoencoder triplet network provides latent representations for infrastructure images which are used for outlier detection. The presented method outperforms other state-of-the-art outlier detection approaches.
Score: 12.194597074511863
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Detecting unknown and untested scenarios is crucial for scenario-based testing. Scenario-based testing is considered to be a possible approach to validate autonomous vehicles. A traffic scenario consists of multiple components, with infrastructure being one of it. In this work, a method to detect novel traffic scenarios based on their infrastructure images is presented. An autoencoder triplet network provides latent representations for infrastructure images which are used for outlier detection. The triplet training of the network is based on the connectivity graphs of the infrastructure. By using the proposed architecture, expert-knowledge is used to shape the latent space such that it incorporates a pre-defined similarity in the neighborhood relationships of an autoencoder. An ablation study on the architecture is highlighting the importance of the triplet autoencoder combination. The best performing architecture is based on vision transformers, a convolution-free attention-based network. The presented method outperforms other state-of-the-art outlier detection approaches.

Related papers

Knowledge-Informed Multi-Agent Trajectory Prediction at Signalized Intersections for Infrastructure-to-Everything [7.452533291998081]
We propose a multi-agent trajectory prediction framework at signalized intersections dedicated to Infrastructure-to-Everything (I2XTraj) Our framework leverages dynamic graph attention to integrate knowledge from traffic signals and driving behaviors. Our approach outperforms existing methods by more than 30% in both multi-agent and single-agent scenarios.
arXiv Detail & Related papers (2025-01-23T08:23:45Z)
Cross-Domain Transfer Learning using Attention Latent Features for Multi-Agent Trajectory Prediction [4.292918274985369]
We propose a novel spatial-temporal trajectory prediction framework that performs cross-domain adaption on the attention representation of a Transformer-based model. A graph convolutional network is also integrated to construct dynamic graph feature embeddings that accurately model the complex spatial-temporal interactions between the multi-agent vehicles.
arXiv Detail & Related papers (2024-11-09T06:39:44Z)
Neural Semantic Map-Learning for Autonomous Vehicles [85.8425492858912]
We present a mapping system that fuses local submaps gathered from a fleet of vehicles at a central instance to produce a coherent map of the road environment. Our method jointly aligns and merges the noisy and incomplete local submaps using a scene-specific Neural Signed Distance Field. We leverage memory-efficient sparse feature-grids to scale to large areas and introduce a confidence score to model uncertainty in scene reconstruction.
arXiv Detail & Related papers (2024-10-10T10:10:03Z)
Traffic Light Recognition using Convolutional Neural Networks: A Survey [4.451479907610764]
We conduct a comprehensive survey and analysis of traffic light recognition methods that use convolutional neural networks (CNNs) Based on an underlying architecture, we cluster methods into three major groups. We describe the most important works in each cluster, discuss the usage of the datasets, and identify research gaps.
arXiv Detail & Related papers (2023-09-05T11:50:38Z)
Spatial-Temporal Graph Enhanced DETR Towards Multi-Frame 3D Object Detection [54.041049052843604]
We present STEMD, a novel end-to-end framework that enhances the DETR-like paradigm for multi-frame 3D object detection. First, to model the inter-object spatial interaction and complex temporal dependencies, we introduce the spatial-temporal graph attention network. Finally, it poses a challenge for the network to distinguish between the positive query and other highly similar queries that are not the best match.
arXiv Detail & Related papers (2023-07-01T13:53:14Z)
OpenLane-V2: A Topology Reasoning Benchmark for Unified 3D HD Mapping [84.65114565766596]
We present OpenLane-V2, the first dataset on topology reasoning for traffic scene structure. OpenLane-V2 consists of 2,000 annotated road scenes that describe traffic elements and their correlation to the lanes. We evaluate various state-of-the-art methods, and present their quantitative and qualitative results on OpenLane-V2 to indicate future avenues for investigating topology reasoning in traffic scenes.
arXiv Detail & Related papers (2023-04-20T16:31:22Z)
Federated Deep Learning Meets Autonomous Vehicle Perception: Design and Verification [168.67190934250868]
Federated learning empowered connected autonomous vehicle (FLCAV) has been proposed. FLCAV preserves privacy while reducing communication and annotation costs. It is challenging to determine the network resources and road sensor poses for multi-stage training.
arXiv Detail & Related papers (2022-06-03T23:55:45Z)
Collaborative 3D Object Detection for Automatic Vehicle Systems via Learnable Communications [8.633120731620307]
We propose a novel collaborative 3D object detection framework that consists of three components. Experiment results and bandwidth usage analysis demonstrate that our approach can save communication and computation costs.
arXiv Detail & Related papers (2022-05-24T07:17:32Z)
A Hierarchical Terminal Recognition Approach based on Network Traffic Analysis [0.48298211429517085]
We propose a hierarchical terminal recognition approach that applies the details of grid data. We have formed a two-level model structure by segmenting the grid data. Through the selection and reconstruction of features, we combine three algorithms to achieve accurate identification of terminal types.
arXiv Detail & Related papers (2022-04-16T05:33:01Z)
Attentive Prototypes for Source-free Unsupervised Domain Adaptive 3D Object Detection [85.11649974840758]
3D object detection networks tend to be biased towards the data they are trained on. We propose a single-frame approach for source-free, unsupervised domain adaptation of lidar-based 3D object detectors.
arXiv Detail & Related papers (2021-11-30T18:42:42Z)
MD-CSDNetwork: Multi-Domain Cross Stitched Network for Deepfake Detection [80.83725644958633]
Current deepfake generation methods leave discriminative artifacts in the frequency spectrum of fake images and videos. We present a novel approach, termed as MD-CSDNetwork, for combining the features in the spatial and frequency domains to mine a shared discriminative representation.
arXiv Detail & Related papers (2021-09-15T14:11:53Z)
Intelligent Railway Foreign Object Detection: A Semi-supervised Convolutional Autoencoder Based Method [7.557470133155959]
We develop a semi-supervised convolutional autoencoder based framework that only requires railway track images without prior knowledge on the foreign objects in the training process. The proposed framework is useful for data analytics via the train Internet-of-Things (IoT) systems.
arXiv Detail & Related papers (2021-08-05T07:32:23Z)

This list is automatically generated from the titles and abstracts of the papers in this site.