Is Geometry Enough for Matching in Visual Localization?
        - URL: http://arxiv.org/abs/2203.12979v1
- Date: Thu, 24 Mar 2022 10:55:17 GMT
- Title: Is Geometry Enough for Matching in Visual Localization?
- Authors: Qunjie Zhou, Sergio Agostinho, Aljosa Osep, Laura Leal-Taixe
- Abstract summary: GoMatch is an alternative to visual-based matching that relies on geometric information for matching image keypoints to maps, represented as sets of bearing vectors.
GoMatch improves over prior geometric-based matching work with a reduction of ($10.67m, 95.7circ$) and ($1.43m$, $34.7circ$) in average median pose errors on Cambridge Landmarks and 7-Scenes.
- Score: 12.984256838490795
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract:   In this paper, we propose to go beyond the well-established approach to
vision-based localization that relies on visual descriptor matching between a
query image and a 3D point cloud. While matching keypoints via visual
descriptors makes localization highly accurate, it has significant storage
demands, raises privacy concerns and increases map maintenance complexity. To
elegantly address those practical challenges for large-scale localization, we
present GoMatch, an alternative to visual-based matching that solely relies on
geometric information for matching image keypoints to maps, represented as sets
of bearing vectors. Our novel bearing vectors representation of 3D points,
significantly relieves the cross-domain challenge in geometric-based matching
that prevented prior work to tackle localization in a realistic environment.
With additional careful architecture design, GoMatch improves over prior
geometric-based matching work with a reduction of ($10.67m, 95.7^{\circ}$) and
($1.43m$, $34.7^{\circ}$) in average median pose errors on Cambridge Landmarks
and 7-Scenes, while requiring as little as $1.5/1.7\%$ of storage capacity in
comparison to the best visual-based matching methods. This confirms its
potential and feasibility for real-world localization and opens the door to
future efforts in advancing city-scale visual localization methods that do not
require storing visual descriptors.
 
      
        Related papers
        - $π^3$: Scalable Permutation-Equivariant Visual Geometry Learning [50.80418813055225]
 $pi3$ is a feed-forward neural network that offers a novel approach to visual geometry reconstruction.<n>pi3$ employs a fully permutation-equivariant architecture to predict affine-invariant camera poses and scale-invariant local point maps.
 arXiv  Detail & Related papers  (2025-07-17T17:59:53Z)
- R-SCoRe: Revisiting Scene Coordinate Regression for Robust Large-Scale   Visual Localization [66.87005863868181]
 We introduce a covisibility graph-based global encoding learning and data augmentation strategy.
We revisit the network architecture and local feature extraction module.
Our method achieves state-of-the-art on challenging large-scale datasets without relying on network ensembles or 3D supervision.
 arXiv  Detail & Related papers  (2025-01-02T18:59:08Z)
- SplatLoc: 3D Gaussian Splatting-based Visual Localization for Augmented   Reality [50.179377002092416]
 We propose an efficient visual localization method capable of high-quality rendering with fewer parameters.
Our method achieves superior or comparable rendering and localization performance to state-of-the-art implicit-based visual localization approaches.
 arXiv  Detail & Related papers  (2024-09-21T08:46:16Z)
- Coupled Laplacian Eigenmaps for Locally-Aware 3D Rigid Point Cloud   Matching [0.0]
 We propose a new technique, based on graph Laplacian eigenmaps, to match point clouds by taking into account fine local structures.
To deal with the order and sign ambiguity of Laplacian eigenmaps, we introduce a new operator, called Coupled Laplacian.
We show that the similarity between those aligned high-dimensional spaces provides a locally meaningful score to match shapes.
 arXiv  Detail & Related papers  (2024-02-27T10:10:12Z)
- Learning to Produce Semi-dense Correspondences for Visual Localization [11.415451542216559]
 This study addresses the challenge of performing visual localization in demanding conditions such as night-time scenarios, adverse weather, and seasonal changes.
We propose a novel method that extracts reliable semi-dense 2D-3D matching points based on dense keypoint matches.
The network utilizes both geometric and visual cues to effectively infer 3D coordinates for unobserved keypoints from the observed ones.
 arXiv  Detail & Related papers  (2024-02-13T10:40:10Z)
- Yes, we CANN: Constrained Approximate Nearest Neighbors for local
  feature-based visual localization [2.915868985330569]
 Constrained Approximate Nearest Neighbors (CANN) is a joint solution of k-nearest-neighbors across both the geometry and appearance space using only local features.
Our method significantly outperforms both state-of-the-art global feature-based retrieval and approaches using local feature aggregation schemes.
 arXiv  Detail & Related papers  (2023-06-15T10:12:10Z)
- Quadric Representations for LiDAR Odometry, Mapping and Localization [93.24140840537912]
 Current LiDAR odometry, mapping and localization methods leverage point-wise representations of 3D scenes.
We propose a novel method of describing scenes using quadric surfaces, which are far more compact representations of 3D objects.
Our method maintains low latency and memory utility while achieving competitive, and even superior, accuracy.
 arXiv  Detail & Related papers  (2023-04-27T13:52:01Z)
- MeshLoc: Mesh-Based Visual Localization [54.731309449883284]
 We explore a more flexible alternative based on dense 3D meshes that does not require features matching between database images to build the scene representation.
Surprisingly competitive results can be obtained when extracting features on renderings of these meshes, without any neural rendering stage.
Our results show that dense 3D model-based representations are a promising alternative to existing representations and point to interesting and challenging directions for future research.
 arXiv  Detail & Related papers  (2022-07-21T21:21:10Z)
- Progressive Coordinate Transforms for Monocular 3D Object Detection [52.00071336733109]
 We propose a novel and lightweight approach, dubbed em Progressive Coordinate Transforms (PCT) to facilitate learning coordinate representations.
In this paper, we propose a novel and lightweight approach, dubbed em Progressive Coordinate Transforms (PCT) to facilitate learning coordinate representations.
 arXiv  Detail & Related papers  (2021-08-12T15:22:33Z)
- i3dLoc: Image-to-range Cross-domain Localization Robust to Inconsistent
  Environmental Conditions [9.982307144353713]
 We present a method for localizing a single camera with respect to a point cloud map in indoor and outdoor scenes.
Our method can match equirectangular images to the 3D range projections by extracting cross-domain symmetric place descriptors.
With a single trained model, i3dLoc can demonstrate reliable visual localization in random conditions.
 arXiv  Detail & Related papers  (2021-05-27T00:13:11Z)
- VS-Net: Voting with Segmentation for Visual Localization [72.8165619061249]
 We propose a novel visual localization framework that establishes 2D-to-3D correspondences between the query image and the 3D map with a series of learnable scene-specific landmarks.
Our proposed VS-Net is extensively tested on multiple public benchmarks and can outperform state-of-the-art visual localization methods.
 arXiv  Detail & Related papers  (2021-05-23T08:44:11Z)
- Geometrically Mappable Image Features [85.81073893916414]
 Vision-based localization of an agent in a map is an important problem in robotics and computer vision.
We propose a method that learns image features targeted for image-retrieval-based localization.
 arXiv  Detail & Related papers  (2020-03-21T15:36:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.