DeepWORD: A GCN-based Approach for Owner-Member Relationship Detection
in Autonomous Driving
- URL: http://arxiv.org/abs/2103.16099v1
- Date: Tue, 30 Mar 2021 06:12:29 GMT
- Title: DeepWORD: A GCN-based Approach for Owner-Member Relationship Detection
in Autonomous Driving
- Authors: Zizhang Wu, Man Wang, Jason Wang, Wenkai Zhang, Muqing Fang, Tianhao
Xu
- Abstract summary: We propose an innovative relationship prediction method, namely DeepWORD, by designing a graph convolution network (GCN)
Specifically, we utilize the feature maps with local correlation as the input of nodes to improve the information richness.
We establish an annotated owner-member relationship dataset called WORD as a large-scale benchmark, which will be available soon.
- Score: 2.895229237964064
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: It's worth noting that the owner-member relationship between wheels and
vehicles has an significant contribution to the 3D perception of vehicles,
especially in the embedded environment. However, there are currently two main
challenges about the above relationship prediction: i) The traditional
heuristic methods based on IoU can hardly deal with the traffic jam scenarios
for the occlusion. ii) It is difficult to establish an efficient applicable
solution for the vehicle-mounted system. To address these issues, we propose an
innovative relationship prediction method, namely DeepWORD, by designing a
graph convolution network (GCN). Specifically, we utilize the feature maps with
local correlation as the input of nodes to improve the information richness.
Besides, we introduce the graph attention network (GAT) to dynamically amend
the prior estimation deviation. Furthermore, we establish an annotated
owner-member relationship dataset called WORD as a large-scale benchmark, which
will be available soon. The experiments demonstrate that our solution achieves
state-of-the-art accuracy and real-time in practice.
Related papers
- Enhancing Lane Segment Perception and Topology Reasoning with Crowdsourcing Trajectory Priors [12.333249510969289]
In this paper, we investigate prior augmentation from a novel perspective of trajectory priors.
We design a confidence-based fusion module that takes alignment into account during the fusion process.
The results indicate that our method's performance significantly outperforms the current state-of-the-art methods.
arXiv Detail & Related papers (2024-11-26T07:05:05Z) - Semantic Communication for Cooperative Perception using HARQ [51.148203799109304]
We leverage an importance map to distill critical semantic information, introducing a cooperative perception semantic communication framework.
To counter the challenges posed by time-varying multipath fading, our approach incorporates the use of frequency-division multiplexing (OFDM) along with channel estimation and equalization strategies.
We introduce a novel semantic error detection method that is integrated with our semantic communication framework in the spirit of hybrid automatic repeated request (HARQ)
arXiv Detail & Related papers (2024-08-29T08:53:26Z) - GraphAD: Interaction Scene Graph for End-to-end Autonomous Driving [16.245949174447574]
We propose the Interaction Scene Graph (ISG) as a unified method to model the interactions among the ego-vehicle, road agents, and map elements.
We evaluate the proposed method for end-to-end autonomous driving on the nuScenes dataset.
arXiv Detail & Related papers (2024-03-28T02:22:28Z) - Interactive Autonomous Navigation with Internal State Inference and
Interactivity Estimation [58.21683603243387]
We propose three auxiliary tasks with relational-temporal reasoning and integrate them into the standard Deep Learning framework.
These auxiliary tasks provide additional supervision signals to infer the behavior patterns other interactive agents.
Our approach achieves robust and state-of-the-art performance in terms of standard evaluation metrics.
arXiv Detail & Related papers (2023-11-27T18:57:42Z) - RSG-Net: Towards Rich Sematic Relationship Prediction for Intelligent
Vehicle in Complex Environments [72.04891523115535]
We propose RSG-Net (Road Scene Graph Net): a graph convolutional network designed to predict potential semantic relationships from object proposals.
The experimental results indicate that this network, trained on Road Scene Graph dataset, could efficiently predict potential semantic relationships among objects around the ego-vehicle.
arXiv Detail & Related papers (2022-07-16T12:40:17Z) - Federated Deep Learning Meets Autonomous Vehicle Perception: Design and
Verification [168.67190934250868]
Federated learning empowered connected autonomous vehicle (FLCAV) has been proposed.
FLCAV preserves privacy while reducing communication and annotation costs.
It is challenging to determine the network resources and road sensor poses for multi-stage training.
arXiv Detail & Related papers (2022-06-03T23:55:45Z) - Detecting Owner-member Relationship with Graph Convolution Network in
Fisheye Camera System [9.665475078766017]
We propose an innovative relationship prediction method, DeepWORD, by designing a graph convolutional network (GCN)
In the experiments we learned that the proposed method achieved state-of-the-art accuracy and real-time performance.
arXiv Detail & Related papers (2022-01-28T13:12:27Z) - Fine-Grained Vehicle Perception via 3D Part-Guided Visual Data
Augmentation [77.60050239225086]
We propose an effective training data generation process by fitting a 3D car model with dynamic parts to vehicles in real images.
Our approach is fully automatic without any human interaction.
We present a multi-task network for VUS parsing and a multi-stream network for VHI parsing.
arXiv Detail & Related papers (2020-12-15T03:03:38Z) - Constructing Geographic and Long-term Temporal Graph for Traffic
Forecasting [88.5550074808201]
We propose Geographic and Long term Temporal Graph Convolutional Recurrent Neural Network (GLT-GCRNN) for traffic forecasting.
In this work, we propose a novel framework for traffic forecasting that learns the rich interactions between roads sharing similar geographic or longterm temporal patterns.
arXiv Detail & Related papers (2020-04-23T03:50:46Z) - Cooperative LIDAR Object Detection via Feature Sharing in Deep Networks [11.737037965090535]
We introduce the concept of feature sharing for cooperative object detection (FS-COD)
In our proposed approach, a better understanding of the environment is achieved by sharing partially processed data between cooperative vehicles.
It is shown that the proposed approach has significant performance superiority over the conventional single-vehicle object detection approaches.
arXiv Detail & Related papers (2020-02-19T20:47:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.