Enabling Efficient Deep Convolutional Neural Network-based Sensor Fusion
for Autonomous Driving
- URL: http://arxiv.org/abs/2202.11231v1
- Date: Tue, 22 Feb 2022 23:35:30 GMT
- Title: Enabling Efficient Deep Convolutional Neural Network-based Sensor Fusion
for Autonomous Driving
- Authors: Xiaoming Zeng, Zhendong Wang, Yang Hu
- Abstract summary: Fusion between DCNNs has been proved as a promising strategy to achieve satisfactory perception accuracy.
We propose a feature disparity metric to measure the degree of feature disparity between the feature maps being fused.
We also propose a Layer-sharing technique in the deep layer that can achieve better accuracy with less computational overhead.
- Score: 10.326217500172689
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Autonomous driving demands accurate perception and safe decision-making. To
achieve this, automated vehicles are now equipped with multiple sensors (e.g.,
camera, Lidar, etc.), enabling them to exploit complementary environmental
context by fusing data from different sensing modalities. With the success of
Deep Convolutional Neural Network(DCNN), the fusion between DCNNs has been
proved as a promising strategy to achieve satisfactory perception accuracy.
However, mainstream existing DCNN fusion schemes conduct fusion by directly
element-wisely adding feature maps extracted from different modalities together
at various stages, failing to consider whether the features being fused are
matched or not. Therefore, we first propose a feature disparity metric to
quantitatively measure the degree of feature disparity between the feature maps
being fused. We then propose Fusion-filter as a feature-matching techniques to
tackle the feature-mismatching issue. We also propose a Layer-sharing technique
in the deep layer that can achieve better accuracy with less computational
overhead. Together with the help of the feature disparity to be an additional
loss, our proposed technologies enable DCNN to learn corresponding feature maps
with similar characteristics and complementary visual context from different
modalities to achieve better accuracy. Experimental results demonstrate that
our proposed fusion technique can achieve better accuracy on KITTI dataset with
less computational resources demand.
Related papers
- Multi-scale Quaternion CNN and BiGRU with Cross Self-attention Feature Fusion for Fault Diagnosis of Bearing [5.3598912592106345]
Deep learning has led to significant advances in bearing fault diagnosis (FD)
We propose a novel FD model by integrating multiscale quaternion convolutional neural network (MQCNN), bidirectional gated recurrent unit (BiG), and cross self-attention feature fusion (CSAFF)
arXiv Detail & Related papers (2024-05-25T07:55:02Z) - AMFD: Distillation via Adaptive Multimodal Fusion for Multispectral Pedestrian Detection [23.91870504363899]
Double-stream networks in multispectral detection employ two separate feature extraction branches for multi-modal data.
This has hindered the widespread employment of multispectral pedestrian detection in embedded devices for autonomous systems.
We introduce the Adaptive Modal Fusion Distillation (AMFD) framework, which can fully utilize the original modal features of the teacher network.
arXiv Detail & Related papers (2024-05-21T17:17:17Z) - Mutual-Guided Dynamic Network for Image Fusion [51.615598671899335]
We propose a novel mutual-guided dynamic network (MGDN) for image fusion, which allows for effective information utilization across different locations and inputs.
Experimental results on five benchmark datasets demonstrate that our proposed method outperforms existing methods on four image fusion tasks.
arXiv Detail & Related papers (2023-08-24T03:50:37Z) - Point-aware Interaction and CNN-induced Refinement Network for RGB-D
Salient Object Detection [95.84616822805664]
We introduce CNNs-assisted Transformer architecture and propose a novel RGB-D SOD network with Point-aware Interaction and CNN-induced Refinement.
In order to alleviate the block effect and detail destruction problems brought by the Transformer naturally, we design a CNN-induced refinement (CNNR) unit for content refinement and supplementation.
arXiv Detail & Related papers (2023-08-17T11:57:49Z) - T-Fusion Net: A Novel Deep Neural Network Augmented with Multiple
Localizations based Spatial Attention Mechanisms for Covid-19 Detection [0.7614628596146599]
The present work proposes a new deep neural network (called as, T-Fusion Net) that augments multiple localizations based spatial attention.
A homogeneous ensemble of the said network is further used to enhance image classification accuracy.
The proposed T-Fusion Net and the homogeneous ensemble model exhibit better performance, as compared to other state-of-the-art methods.
arXiv Detail & Related papers (2023-07-31T18:18:01Z) - Low-Energy Convolutional Neural Networks (CNNs) using Hadamard Method [0.0]
Convolutional neural networks (CNNs) are a potential approach for object recognition and detection.
A new approach based on the Hadamard transformation as an alternative to the convolution operation is demonstrated.
The method is helpful for other computer vision tasks when the kernel size is smaller than the input image size.
arXiv Detail & Related papers (2022-09-06T21:36:57Z) - Task-Oriented Sensing, Computation, and Communication Integration for
Multi-Device Edge AI [108.08079323459822]
This paper studies a new multi-intelligent edge artificial-latency (AI) system, which jointly exploits the AI model split inference and integrated sensing and communication (ISAC)
We measure the inference accuracy by adopting an approximate but tractable metric, namely discriminant gain.
arXiv Detail & Related papers (2022-07-03T06:57:07Z) - Enhanced Exploration in Neural Feature Selection for Deep Click-Through
Rate Prediction Models via Ensemble of Gating Layers [7.381829794276824]
The goal of neural feature selection (NFS) is to choose a relatively small subset of features with the best explanatory power.
Gating approach inserts a set of differentiable binary gates to drop less informative features.
To improve the exploration capacity of gradient-based solutions, we propose a simple but effective ensemble learning approach.
arXiv Detail & Related papers (2021-12-07T04:37:05Z) - Adaptive Anomaly Detection for Internet of Things in Hierarchical Edge
Computing: A Contextual-Bandit Approach [81.5261621619557]
We propose an adaptive anomaly detection scheme with hierarchical edge computing (HEC)
We first construct multiple anomaly detection DNN models with increasing complexity, and associate each of them to a corresponding HEC layer.
Then, we design an adaptive model selection scheme that is formulated as a contextual-bandit problem and solved by using a reinforcement learning policy network.
arXiv Detail & Related papers (2021-08-09T08:45:47Z) - Learning Deep Interleaved Networks with Asymmetric Co-Attention for
Image Restoration [65.11022516031463]
We present a deep interleaved network (DIN) that learns how information at different states should be combined for high-quality (HQ) images reconstruction.
In this paper, we propose asymmetric co-attention (AsyCA) which is attached at each interleaved node to model the feature dependencies.
Our presented DIN can be trained end-to-end and applied to various image restoration tasks.
arXiv Detail & Related papers (2020-10-29T15:32:00Z) - Hierarchical Dynamic Filtering Network for RGB-D Salient Object
Detection [91.43066633305662]
The main purpose of RGB-D salient object detection (SOD) is how to better integrate and utilize cross-modal fusion information.
In this paper, we explore these issues from a new perspective.
We implement a kind of more flexible and efficient multi-scale cross-modal feature processing.
arXiv Detail & Related papers (2020-07-13T07:59:55Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.