DisPlacing Objects: Improving Dynamic Vehicle Detection via Visual Place
Recognition under Adverse Conditions
- URL: http://arxiv.org/abs/2306.17536v1
- Date: Fri, 30 Jun 2023 10:46:51 GMT
- Title: DisPlacing Objects: Improving Dynamic Vehicle Detection via Visual Place
Recognition under Adverse Conditions
- Authors: Stephen Hausler, Sourav Garg, Punarjay Chakravarty, Shubham
Shrivastava, Ankit Vora, Michael Milford
- Abstract summary: We investigate whether a prior map can be leveraged to aid in the detection of dynamic objects in a scene without the need for a 3D map.
We contribute an algorithm which refines an initial set of candidate object detections and produces a refined subset of highly accurate detections using a prior map.
- Score: 29.828201168816243
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Can knowing where you are assist in perceiving objects in your surroundings,
especially under adverse weather and lighting conditions? In this work we
investigate whether a prior map can be leveraged to aid in the detection of
dynamic objects in a scene without the need for a 3D map or pixel-level
map-query correspondences. We contribute an algorithm which refines an initial
set of candidate object detections and produces a refined subset of highly
accurate detections using a prior map. We begin by using visual place
recognition (VPR) to retrieve a reference map image for a given query image,
then use a binary classification neural network that compares the query and
mapping image regions to validate the query detection. Once our classification
network is trained, on approximately 1000 query-map image pairs, it is able to
improve the performance of vehicle detection when combined with an existing
off-the-shelf vehicle detector. We demonstrate our approach using standard
datasets across two cities (Oxford and Zurich) under different settings of
train-test separation of map-query traverse pairs. We further emphasize the
performance gains of our approach against alternative design choices and show
that VPR suffices for the task, eliminating the need for precise ground truth
localization.
Related papers
- Map-aided annotation for pole base detection [0.0]
In this paper, a 2D HD map is used to automatically annotate pole-like features in images.
In the absence of height information, the map features are represented as pole bases at the ground level.
We show how an object detector can be trained to detect a pole base.
arXiv Detail & Related papers (2024-03-04T09:23:11Z) - Deep Homography Estimation for Visual Place Recognition [49.235432979736395]
We propose a transformer-based deep homography estimation (DHE) network.
It takes the dense feature map extracted by a backbone network as input and fits homography for fast and learnable geometric verification.
Experiments on benchmark datasets show that our method can outperform several state-of-the-art methods.
arXiv Detail & Related papers (2024-02-25T13:22:17Z) - Improving the matching of deformable objects by learning to detect
keypoints [6.4587163310833855]
We propose a novel learned keypoint detection method to increase the number of correct matches for the task of non-rigid image correspondence.
We train an end-to-end convolutional neural network (CNN) to find keypoint locations that are more appropriate to the considered descriptor.
Experiments demonstrate that our method enhances the Mean Matching Accuracy of numerous descriptors when used in conjunction with our detection method.
We also apply our method on the complex real-world task object retrieval where our detector performs on par with the finest keypoint detectors currently available for this task.
arXiv Detail & Related papers (2023-09-01T13:02:19Z) - Unleash the Potential of Image Branch for Cross-modal 3D Object
Detection [67.94357336206136]
We present a new cross-modal 3D object detector, namely UPIDet, which aims to unleash the potential of the image branch from two aspects.
First, UPIDet introduces a new 2D auxiliary task called normalized local coordinate map estimation.
Second, we discover that the representational capability of the point cloud backbone can be enhanced through the gradients backpropagated from the training objectives of the image branch.
arXiv Detail & Related papers (2023-01-22T08:26:58Z) - HPointLoc: Point-based Indoor Place Recognition using Synthetic RGB-D
Images [58.720142291102135]
We present a novel dataset named as HPointLoc, specially designed for exploring capabilities of visual place recognition in indoor environment.
The dataset is based on the popular Habitat simulator, in which it is possible to generate indoor scenes using both own sensor data and open datasets.
arXiv Detail & Related papers (2022-12-30T12:20:56Z) - Satellite Image Based Cross-view Localization for Autonomous Vehicle [59.72040418584396]
This paper shows that by using an off-the-shelf high-definition satellite image as a ready-to-use map, we are able to achieve cross-view vehicle localization up to a satisfactory accuracy.
Our method is validated on KITTI and Ford Multi-AV Seasonal datasets as ground view and Google Maps as the satellite view.
arXiv Detail & Related papers (2022-07-27T13:16:39Z) - Beyond Cross-view Image Retrieval: Highly Accurate Vehicle Localization
Using Satellite Image [91.29546868637911]
This paper addresses the problem of vehicle-mounted camera localization by matching a ground-level image with an overhead-view satellite map.
The key idea is to formulate the task as pose estimation and solve it by neural-net based optimization.
Experiments on standard autonomous vehicle localization datasets have confirmed the superiority of the proposed method.
arXiv Detail & Related papers (2022-04-10T19:16:58Z) - Semantic Image Alignment for Vehicle Localization [111.59616433224662]
We present a novel approach to vehicle localization in dense semantic maps using semantic segmentation from a monocular camera.
In contrast to existing visual localization approaches, the system does not require additional keypoint features, handcrafted localization landmark extractors or expensive LiDAR sensors.
arXiv Detail & Related papers (2021-10-08T14:40:15Z) - Aerial Map-Based Navigation Using Semantic Segmentation and Pattern
Matching [1.6244541005112747]
The proposed system attempts label-to-label matching, not image-to-image matching between aerial images and a map database.
The use of the deep learning technique as a tool for extracting high-level features reduces the image-based localization problem to a pattern matching problem.
arXiv Detail & Related papers (2021-07-01T18:31:42Z) - SpotNet: Self-Attention Multi-Task Network for Object Detection [11.444576186559487]
We produce foreground/background segmentation labels in a semi-supervised way, using background subtraction or optical flow.
We use those segmentation maps inside the network as a self-attention mechanism to weight the feature map used to produce the bounding boxes.
We show that by using this method, we obtain a significant mAP improvement on two traffic surveillance datasets.
arXiv Detail & Related papers (2020-02-13T14:43:24Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.