Local Feature Matching Using Deep Learning: A Survey
- URL: http://arxiv.org/abs/2401.17592v2
- Date: Mon, 11 Mar 2024 01:32:03 GMT
- Title: Local Feature Matching Using Deep Learning: A Survey
- Authors: Shibiao Xu, Shunpeng Chen, Rongtao Xu, Changwei Wang, Peng Lu, Li Guo
- Abstract summary: Local feature matching enjoys wide-ranging applications in the realm of computer vision, encompassing domains such as image retrieval, 3D reconstruction, and object recognition.
In recent years, the introduction of deep learning models has sparked widespread exploration into local feature matching techniques.
The paper also explores the practical application of local feature matching in diverse domains such as Structure from Motion, Remote Sensing Image Registration, and Medical Image Registration.
- Score: 19.322545965903608
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Local feature matching enjoys wide-ranging applications in the realm of
computer vision, encompassing domains such as image retrieval, 3D
reconstruction, and object recognition. However, challenges persist in
improving the accuracy and robustness of matching due to factors like viewpoint
and lighting variations. In recent years, the introduction of deep learning
models has sparked widespread exploration into local feature matching
techniques. The objective of this endeavor is to furnish a comprehensive
overview of local feature matching methods. These methods are categorized into
two key segments based on the presence of detectors. The Detector-based
category encompasses models inclusive of Detect-then-Describe, Joint Detection
and Description, Describe-then-Detect, as well as Graph Based techniques. In
contrast, the Detector-free category comprises CNN Based, Transformer Based,
and Patch Based methods. Our study extends beyond methodological analysis,
incorporating evaluations of prevalent datasets and metrics to facilitate a
quantitative comparison of state-of-the-art techniques. The paper also explores
the practical application of local feature matching in diverse domains such as
Structure from Motion, Remote Sensing Image Registration, and Medical Image
Registration, underscoring its versatility and significance across various
fields. Ultimately, we endeavor to outline the current challenges faced in this
domain and furnish future research directions, thereby serving as a reference
for researchers involved in local feature matching and its interconnected
domains. A comprehensive list of studies in this survey is available at
https://github.com/vignywang/Awesome-Local-Feature-Matching .
Related papers
- Cross-view geo-localization: a survey [1.3686993145787065]
Cross-view geo-localization has garnered notable attention in the realm of computer vision, spurred by the widespread availability of copious geotagged datasets.
This paper provides a thorough survey of cutting-edge methodologies, techniques, and associated challenges that are integral to this domain.
arXiv Detail & Related papers (2024-06-14T05:14:54Z) - Spatial Reasoning for Few-Shot Object Detection [21.3564383157159]
We propose a spatial reasoning framework that detects novel objects with only a few training examples in a context.
We employ a graph convolutional network as the RoIs and their relatedness are defined as nodes and edges, respectively.
We demonstrate that the proposed method significantly outperforms the state-of-the-art methods and verify its efficacy through extensive ablation studies.
arXiv Detail & Related papers (2022-11-02T12:38:08Z) - Point-Level Region Contrast for Object Detection Pre-Training [147.47349344401806]
We present point-level region contrast, a self-supervised pre-training approach for the task of object detection.
Our approach performs contrastive learning by directly sampling individual point pairs from different regions.
Compared to an aggregated representation per region, our approach is more robust to the change in input region quality.
arXiv Detail & Related papers (2022-02-09T18:56:41Z) - Contrastive Object Detection Using Knowledge Graph Embeddings [72.17159795485915]
We compare the error statistics of the class embeddings learned from a one-hot approach with semantically structured embeddings from natural language processing or knowledge graphs.
We propose a knowledge-embedded design for keypoint-based and transformer-based object detection architectures.
arXiv Detail & Related papers (2021-12-21T17:10:21Z) - Unsupervised Domain Adaption of Object Detectors: A Survey [87.08473838767235]
Recent advances in deep learning have led to the development of accurate and efficient models for various computer vision applications.
Learning highly accurate models relies on the availability of datasets with a large number of annotated images.
Due to this, model performance drops drastically when evaluated on label-scarce datasets having visually distinct images.
arXiv Detail & Related papers (2021-05-27T23:34:06Z) - RoRD: Rotation-Robust Descriptors and Orthographic Views for Local
Feature Matching [32.10261486751993]
We present a novel framework that combines learning of invariant descriptors through data augmentation and viewpoint projection.
We evaluate the effectiveness of the proposed approach on key tasks including pose estimation and visual place recognition.
arXiv Detail & Related papers (2021-03-15T17:40:25Z) - Region Comparison Network for Interpretable Few-shot Image
Classification [97.97902360117368]
Few-shot image classification has been proposed to effectively use only a limited number of labeled examples to train models for new classes.
We propose a metric learning based method named Region Comparison Network (RCN), which is able to reveal how few-shot learning works.
We also present a new way to generalize the interpretability from the level of tasks to categories.
arXiv Detail & Related papers (2020-09-08T07:29:05Z) - Cross-Domain Facial Expression Recognition: A Unified Evaluation
Benchmark and Adversarial Graph Learning [85.6386289476598]
We develop a novel adversarial graph representation adaptation (AGRA) framework for cross-domain holistic-local feature co-adaptation.
We conduct extensive and fair evaluations on several popular benchmarks and show that the proposed AGRA framework outperforms previous state-of-the-art methods.
arXiv Detail & Related papers (2020-08-03T15:00:31Z) - Character Region Attention For Text Spotting [18.713194210876136]
A scene text spotter is composed of text detection and recognition modules.
A typical architecture places detection and recognition modules into separate branches, and a RoI pooling is commonly used to let the branches share a visual feature.
This is possible since the two modules share a common sub-task which is to find the location of the character regions.
This architecture is formed by utilizing detection outputs in the recognizer and propagating the recognition loss through the detection stage.
arXiv Detail & Related papers (2020-07-19T09:12:23Z) - Cross-domain Object Detection through Coarse-to-Fine Feature Adaptation [62.29076080124199]
This paper proposes a novel coarse-to-fine feature adaptation approach to cross-domain object detection.
At the coarse-grained stage, foreground regions are extracted by adopting the attention mechanism, and aligned according to their marginal distributions.
At the fine-grained stage, we conduct conditional distribution alignment of foregrounds by minimizing the distance of global prototypes with the same category but from different domains.
arXiv Detail & Related papers (2020-03-23T13:40:06Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.