Related papers: A Dynamic Transformer Network for Vehicle Detection

A Dynamic Transformer Network for Vehicle Detection

URL: http://arxiv.org/abs/2506.02765v1
Date: Tue, 03 Jun 2025 11:29:35 GMT
Title: A Dynamic Transformer Network for Vehicle Detection
Authors: Chunwei Tian, Kai Liu, Bob Zhang, Zhixiang Huang, Chia-Wen Lin, David Zhang,
Abstract summary: We present a dynamic Transformer network for vehicle detection (DTNet)<n>DTNet utilizes a dynamic convolution to guide a deep network to dynamically generate weights to enhance adaptability of an obtained detector.<n>To overcome the drawback of difference in an image account, a translation-variant convolution relies on spatial location information to refine obtained structural information for vehicle detection.
Score: 57.4144097001218
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Stable consumer electronic systems can assist traffic better. Good traffic consumer electronic systems require collaborative work between traffic algorithms and hardware. However, performance of popular traffic algorithms containing vehicle detection methods based on deep networks via learning data relation rather than learning differences in different lighting and occlusions is limited. In this paper, we present a dynamic Transformer network for vehicle detection (DTNet). DTNet utilizes a dynamic convolution to guide a deep network to dynamically generate weights to enhance adaptability of an obtained detector. Taking into relations of different information account, a mixed attention mechanism based channel attention and Transformer is exploited to strengthen relations of channels and pixels to extract more salient information for vehicle detection. To overcome the drawback of difference in an image account, a translation-variant convolution relies on spatial location information to refine obtained structural information for vehicle detection. Experimental results illustrate that our DTNet is competitive for vehicle detection. Code of the proposed DTNet can be obtained at https://github.com/hellloxiaotian/DTNet.

Related papers

CNN+Transformer Based Anomaly Traffic Detection in UAV Networks for Emergency Rescue [12.074051347588963]
We propose a novel anomaly traffic detection architecture for UAV networks based on the software-defined networking (SDN) framework and blockchain technology.<n>An integrated algorithm combining convolutional neural networks (CNNs) and Transformer (CNN+Transformer) for anomaly traffic detection is developed, which is called CTranATD.
arXiv Detail & Related papers (2025-03-26T09:27:26Z)
TraffNet: Learning Causality of Traffic Generation for What-if Prediction [4.604622556490027]
Real-time what-if traffic prediction is crucial for decision making in intelligent traffic management and control. Here, we present a simple deep learning framework called TraffNet that learns the mechanisms of traffic generation for what-if pre-diction.
arXiv Detail & Related papers (2023-03-28T13:12:17Z)
Large-Scale Traffic Data Imputation with Spatiotemporal Semantic Understanding [26.86356769330179]
This study proposes Graph Transformer for Traffic Imputation (GT-TDI) model to impute large-scale traffic data with semantic understanding of a network. The proposed model takes incomplete data, social connectivity of sensors, and semantic descriptions as input to perform tasks with the help of Graph Neural Networks (GNN) and Transformer. The results show that proposed GT-TDI model outperforms existing methods in complex missing patterns and diverse missing rates.
arXiv Detail & Related papers (2023-01-27T13:02:19Z)
Dense-TNT: Efficient Vehicle Type Classification Neural Network Using Satellite Imagery [16.025849552108983]
This study proposes a novel Densely Connected Convolutional Network (DenseNet) framework for the vehicle type classification. Three-region vehicle data and four different weather conditions are deployed for recognition capability evaluation. Experimental findings validate the recognition ability of our proposed vehicle classification model with little decay, even under the heavy foggy weather condition.
arXiv Detail & Related papers (2022-09-27T16:17:53Z)
Pyramid Transformer for Traffic Sign Detection [1.933681537640272]
A novel Pyramid Transformer with locality mechanisms is proposed in this paper. Specifically, Pyramid Transformer has several spatial pyramid reduction layers to shrink and embed the input image into tokens with rich multi-scale context. The experiments are conducted on the German Traffic Sign Detection Benchmark (GTSDB)
arXiv Detail & Related papers (2022-07-13T09:21:19Z)
Cross-receptive Focused Inference Network for Lightweight Image Super-Resolution [64.25751738088015]
Transformer-based methods have shown impressive performance in single image super-resolution (SISR) tasks. Transformers that need to incorporate contextual information to extract features dynamically are neglected. We propose a lightweight Cross-receptive Focused Inference Network (CFIN) that consists of a cascade of CT Blocks mixed with CNN and Transformer.
arXiv Detail & Related papers (2022-07-06T16:32:29Z)
Federated Deep Learning Meets Autonomous Vehicle Perception: Design and Verification [168.67190934250868]
Federated learning empowered connected autonomous vehicle (FLCAV) has been proposed. FLCAV preserves privacy while reducing communication and annotation costs. It is challenging to determine the network resources and road sensor poses for multi-stage training.
arXiv Detail & Related papers (2022-06-03T23:55:45Z)
Efficient Federated Learning with Spike Neural Networks for Traffic Sign Recognition [70.306089187104]
We introduce powerful Spike Neural Networks (SNNs) into traffic sign recognition for energy-efficient and fast model training. Numerical results indicate that the proposed federated SNN outperforms traditional federated convolutional neural networks in terms of accuracy, noise immunity, and energy efficiency as well.
arXiv Detail & Related papers (2022-05-28T03:11:48Z)
Road Network Guided Fine-Grained Urban Traffic Flow Inference [108.64631590347352]
Accurate inference of fine-grained traffic flow from coarse-grained one is an emerging yet crucial problem. We propose a novel Road-Aware Traffic Flow Magnifier (RATFM) that exploits the prior knowledge of road networks. Our method can generate high-quality fine-grained traffic flow maps.
arXiv Detail & Related papers (2021-09-29T07:51:49Z)
Fine-Grained Vehicle Perception via 3D Part-Guided Visual Data Augmentation [77.60050239225086]
We propose an effective training data generation process by fitting a 3D car model with dynamic parts to vehicles in real images. Our approach is fully automatic without any human interaction. We present a multi-task network for VUS parsing and a multi-stream network for VHI parsing.
arXiv Detail & Related papers (2020-12-15T03:03:38Z)
Cooperative Perception with Deep Reinforcement Learning for Connected Vehicles [7.7003495898919265]
We present a cooperative perception scheme with deep reinforcement learning to enhance the detection accuracy for the surrounding objects. Our scheme mitigates the network load in vehicular communication networks and enhances the communication reliability.
arXiv Detail & Related papers (2020-04-23T01:44:12Z)

This list is automatically generated from the titles and abstracts of the papers in this site.