End-to-End Trainable One-Stage Parking Slot Detection Integrating Global
and Local Information
- URL: http://arxiv.org/abs/2003.02445v1
- Date: Thu, 5 Mar 2020 05:57:20 GMT
- Title: End-to-End Trainable One-Stage Parking Slot Detection Integrating Global
and Local Information
- Authors: Jae Kyu Suhr and Ho Gi Jung
- Abstract summary: This paper proposes an end-to-end trainable one-stage parking slot detection method for around view monitor (AVM) images.
The proposed method simultaneously acquires global information (entrance, type, and occupancy of parking slot) and local information (location and orientation of junction) by using a convolutional neural network (CNN)
In experiments, this method was quantitatively evaluated using the public dataset and outperforms previous methods by showing both recall and precision of 99.77%, type classification accuracy of 100%, and occupancy classification accuracy of 99.31% while processing 60 frames per second.
- Score: 14.62008690460147
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This paper proposes an end-to-end trainable one-stage parking slot detection
method for around view monitor (AVM) images. The proposed method simultaneously
acquires global information (entrance, type, and occupancy of parking slot) and
local information (location and orientation of junction) by using a
convolutional neural network (CNN), and integrates them to detect parking slots
with their properties. This method divides an AVM image into a grid and
performs a CNN-based feature extraction. For each cell of the grid, the global
and local information of the parking slot is obtained by applying convolution
filters to the extracted feature map. Final detection results are produced by
integrating the global and local information of the parking slot through
non-maximum suppression (NMS). Since the proposed method obtains most of the
information of the parking slot using a fully convolutional network without a
region proposal stage, it is an end-to-end trainable one-stage detector. In
experiments, this method was quantitatively evaluated using the public dataset
and outperforms previous methods by showing both recall and precision of
99.77%, type classification accuracy of 100%, and occupancy classification
accuracy of 99.31% while processing 60 frames per second.
Related papers
- Neural Semantic Map-Learning for Autonomous Vehicles [85.8425492858912]
We present a mapping system that fuses local submaps gathered from a fleet of vehicles at a central instance to produce a coherent map of the road environment.
Our method jointly aligns and merges the noisy and incomplete local submaps using a scene-specific Neural Signed Distance Field.
We leverage memory-efficient sparse feature-grids to scale to large areas and introduce a confidence score to model uncertainty in scene reconstruction.
arXiv Detail & Related papers (2024-10-10T10:10:03Z) - Deep Homography Estimation for Visual Place Recognition [49.235432979736395]
We propose a transformer-based deep homography estimation (DHE) network.
It takes the dense feature map extracted by a backbone network as input and fits homography for fast and learnable geometric verification.
Experiments on benchmark datasets show that our method can outperform several state-of-the-art methods.
arXiv Detail & Related papers (2024-02-25T13:22:17Z) - Improving Online Lane Graph Extraction by Object-Lane Clustering [106.71926896061686]
We propose an architecture and loss formulation to improve the accuracy of local lane graph estimates.
The proposed method learns to assign the objects to centerlines by considering the centerlines as cluster centers.
We show that our method can achieve significant performance improvements by using the outputs of existing 3D object detection methods.
arXiv Detail & Related papers (2023-07-20T15:21:28Z) - Spatial Likelihood Voting with Self-Knowledge Distillation for Weakly
Supervised Object Detection [54.24966006457756]
We propose a WSOD framework called the Spatial Likelihood Voting with Self-knowledge Distillation Network (SLV-SD Net)
SLV-SD Net converges region proposal localization without bounding box annotations.
Experiments on the PASCAL VOC 2007/2012 and MS-COCO datasets demonstrate the excellent performance of SLV-SD Net.
arXiv Detail & Related papers (2022-04-14T11:56:19Z) - CNN-based Two-Stage Parking Slot Detection Using Region-Specific
Multi-Scale Feature Extraction [7.652701739127332]
Parking slot detection performance has been dramatically improved by deep learning techniques.
Deep learning-based object detection methods can be categorized into one-stage and two-stage approaches.
This paper proposes a highly specialized two-stage parking slot detector that uses region-specific multi-scale feature extraction.
arXiv Detail & Related papers (2021-08-13T12:02:02Z) - Automated Parking Space Detection Using Convolutional Neural Networks [0.30458514384586394]
We have used computer vision techniques to infer the state of the parking lot given the data collected from the University of The Witwatersrand.
This paper presents an approach for a real-time parking space classification based on Convolutional Neural Networks (CNN) using Caffe and Nvidia DiGITS framework.
arXiv Detail & Related papers (2021-06-14T08:30:38Z) - Attentional Graph Neural Network for Parking-slot Detection [10.095984750382478]
We propose an attentional graph neural network based parking-slot detection method, which refers the marking-points in an around-view image as graph-structured data.
Without any manually designed post-processing, the proposed method is end-to-end trainable.
Experiments have been conducted on public benchmark dataset, where the proposed method achieves state-of-the-art accuracy.
arXiv Detail & Related papers (2021-04-06T15:14:39Z) - End-to-End Object Detection with Fully Convolutional Network [71.56728221604158]
We introduce a Prediction-aware One-To-One (POTO) label assignment for classification to enable end-to-end detection.
A simple 3D Max Filtering (3DMF) is proposed to utilize the multi-scale features and improve the discriminability of convolutions in the local region.
Our end-to-end framework achieves competitive performance against many state-of-the-art detectors with NMS on COCO and CrowdHuman datasets.
arXiv Detail & Related papers (2020-12-07T09:14:55Z) - LABNet: Local Graph Aggregation Network with Class Balanced Loss for
Vehicle Re-Identification [0.0]
Vehicle re-identification is an important computer vision task where the objective is to identify a specific vehicle.
Recent methods based on deep learning utilize a global average pooling layer after the backbone feature extractor.
We propose local graph aggregation on the backbone feature map, to learn associations of local information.
arXiv Detail & Related papers (2020-11-29T18:43:30Z) - Auto-Classifier: A Robust Defect Detector Based on an AutoML Head [2.685668802278155]
We show that the use of Convolutional Neural Networks achieves better results than traditional methods.
We also show that Auto-Classifier out-performs all other methods, by achieving 100% accuracy and 100% AUC results throughout all the datasets.
arXiv Detail & Related papers (2020-09-03T10:39:02Z) - R-FCN: Object Detection via Region-based Fully Convolutional Networks [87.62557357527861]
We present region-based, fully convolutional networks for accurate and efficient object detection.
Our result is achieved at a test-time speed of 170ms per image, 2.5-20x faster than the Faster R-CNN counterpart.
arXiv Detail & Related papers (2016-05-20T15:50:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.