CNN-based Two-Stage Parking Slot Detection Using Region-Specific
Multi-Scale Feature Extraction
- URL: http://arxiv.org/abs/2108.06185v1
- Date: Fri, 13 Aug 2021 12:02:02 GMT
- Title: CNN-based Two-Stage Parking Slot Detection Using Region-Specific
Multi-Scale Feature Extraction
- Authors: Quang Huy Bui and Jae Kyu Suhr
- Abstract summary: Parking slot detection performance has been dramatically improved by deep learning techniques.
Deep learning-based object detection methods can be categorized into one-stage and two-stage approaches.
This paper proposes a highly specialized two-stage parking slot detector that uses region-specific multi-scale feature extraction.
- Score: 7.652701739127332
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Autonomous parking systems start with the detection of available parking
slots. Parking slot detection performance has been dramatically improved by
deep learning techniques. Deep learning-based object detection methods can be
categorized into one-stage and two-stage approaches. Although it is well-known
that the two-stage approach outperforms the one-stage approach in general
object detection, they have performed similarly in parking slot detection so
far. We consider this is because the two-stage approach has not yet been
adequately specialized for parking slot detection. Thus, this paper proposes a
highly specialized two-stage parking slot detector that uses region-specific
multi-scale feature extraction. In the first stage, the proposed method finds
the entrance of the parking slot as a region proposal by estimating its center,
length, and orientation. The second stage of this method designates specific
regions that most contain the desired information and extracts features from
them. That is, features for the location and orientation are separately
extracted from only the specific regions that most contain the locational and
orientational information. In addition, multi-resolution feature maps are
utilized to increase both positioning and classification accuracies. A
high-resolution feature map is used to extract detailed information (location
and orientation), while another low-resolution feature map is used to extract
semantic information (type and occupancy). In experiments, the proposed method
was quantitatively evaluated with two large-scale public parking slot detection
datasets and outperformed previous methods, including both one-stage and
two-stage approaches.
Related papers
- Transcending Forgery Specificity with Latent Space Augmentation for Generalizable Deepfake Detection [57.646582245834324]
We propose a simple yet effective deepfake detector called LSDA.
It is based on a idea: representations with a wider variety of forgeries should be able to learn a more generalizable decision boundary.
We show that our proposed method is surprisingly effective and transcends state-of-the-art detectors across several widely used benchmarks.
arXiv Detail & Related papers (2023-11-19T09:41:10Z) - Towards Generalizable Deepfake Detection by Primary Region
Regularization [52.41801719896089]
This paper enhances the generalization capability from a novel regularization perspective.
Our method consists of two stages, namely the static localization for primary region maps, and the dynamic exploitation of primary region masks.
We conduct extensive experiments over three widely used deepfake datasets - DFDC, DF-1.0, and Celeb-DF with five backbones.
arXiv Detail & Related papers (2023-07-24T05:43:34Z) - SFD2: Semantic-guided Feature Detection and Description [34.36397639248686]
We propose to extract globally reliable features by implicitly embedding high-level semantics into both the detection and description processes.
Specifically, our semantic-aware detector is able to detect keypoints from reliable regions and suppress unreliable areas.
This boosts the accuracy of keypoint matching by reducing the number of features sensitive to appearance changes.
arXiv Detail & Related papers (2023-04-28T13:34:26Z) - Attention-based Domain Adaptation for Single Stage Detectors [75.88557558238841]
We introduce an attention mechanism that lets us identify the important regions on which adaptation should focus.
Our approach is generic and can be integrated into any single-stage detector.
For an equivalent single-stage architecture, our method outperforms the state-of-the-art domain adaptation technique.
arXiv Detail & Related papers (2021-06-14T10:30:44Z) - Multi-attentional Deepfake Detection [79.80308897734491]
Face forgery by deepfake is widely spread over the internet and has raised severe societal concerns.
We propose a new multi-attentional deepfake detection network. Specifically, it consists of three key components: 1) multiple spatial attention heads to make the network attend to different local parts; 2) textural feature enhancement block to zoom in the subtle artifacts in shallow features; 3) aggregate the low-level textural feature and high-level semantic features guided by the attention maps.
arXiv Detail & Related papers (2021-03-03T13:56:14Z) - Line-Circle-Square (LCS): A Multilayered Geometric Filter for Edge-Based
Detection [2.4054377316708964]
The proposed filter applies detection, tracking and learning to each defined expert to extract higher level information for judging scenes without over-calculation.
The experiment validates the effectiveness of the proposed filter in terms of detection precision and resource usage in both experimental and real-world scenarios.
arXiv Detail & Related papers (2020-08-21T05:28:12Z) - D2D: Keypoint Extraction with Describe to Detect Approach [48.0325745125635]
We present a novel approach that exploits the information within the descriptor space to propose keypoint locations.
We propose an approach that inverts this process by first describing and then detecting the keypoint locations.
arXiv Detail & Related papers (2020-05-27T19:27:46Z) - PSDet: Efficient and Universal Parking Slot Detection [14.085693334348827]
Real-time parking slot detection plays a critical role in valet parking systems.
Existing methods have limited success in real-world applications.
We argue two reasons accounting for the unsatisfactory performance:.
romannumeral1, The available datasets have limited diversity, which causes the low generalization ability.
arXiv Detail & Related papers (2020-05-12T03:06:25Z) - End-to-End Trainable One-Stage Parking Slot Detection Integrating Global
and Local Information [14.62008690460147]
This paper proposes an end-to-end trainable one-stage parking slot detection method for around view monitor (AVM) images.
The proposed method simultaneously acquires global information (entrance, type, and occupancy of parking slot) and local information (location and orientation of junction) by using a convolutional neural network (CNN)
In experiments, this method was quantitatively evaluated using the public dataset and outperforms previous methods by showing both recall and precision of 99.77%, type classification accuracy of 100%, and occupancy classification accuracy of 99.31% while processing 60 frames per second.
arXiv Detail & Related papers (2020-03-05T05:57:20Z) - Road Curb Detection and Localization with Monocular Forward-view Vehicle
Camera [74.45649274085447]
We propose a robust method for estimating road curb 3D parameters using a calibrated monocular camera equipped with a fisheye lens.
Our approach is able to estimate the vehicle to curb distance in real time with mean accuracy of more than 90%.
arXiv Detail & Related papers (2020-02-28T00:24:18Z) - Deep Fusion of Local and Non-Local Features for Precision Landslide
Recognition [17.896249114628336]
This paper proposes an effective approach to fuse both local and non-local features to surmount the contextual problem.
Built upon the U-Net architecture that is widely adopted in the remote sensing community, we utilize two additional modules.
Experimental evaluations revealed that the proposed method outperformed state-of-the-art general-purpose semantic segmentation approaches.
arXiv Detail & Related papers (2020-02-20T03:18:59Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.