DFM: A Performance Baseline for Deep Feature Matching
- URL: http://arxiv.org/abs/2106.07791v1
- Date: Mon, 14 Jun 2021 22:55:06 GMT
- Title: DFM: A Performance Baseline for Deep Feature Matching
- Authors: Ufuk Efe, Kutalmis Gokalp Ince, A. Aydin Alatan
- Abstract summary: The proposed method uses pre-trained VGG architecture as a feature extractor and does not require any additional training specific to improve matching.
Our algorithm achieves 0.57 and 0.80 overall scores in terms of Mean Matching Accuracy (MMA) for 1 pixel and 2 pixels thresholds respectively on Hpatches dataset.
- Score: 10.014010310188821
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: A novel image matching method is proposed that utilizes learned features
extracted by an off-the-shelf deep neural network to obtain a promising
performance. The proposed method uses pre-trained VGG architecture as a feature
extractor and does not require any additional training specific to improve
matching. Inspired by well-established concepts in the psychology area, such as
the Mental Rotation paradigm, an initial warping is performed as a result of a
preliminary geometric transformation estimate. These estimates are simply based
on dense matching of nearest neighbors at the terminal layer of VGG network
outputs of the images to be matched. After this initial alignment, the same
approach is repeated again between reference and aligned images in a
hierarchical manner to reach a good localization and matching performance. Our
algorithm achieves 0.57 and 0.80 overall scores in terms of Mean Matching
Accuracy (MMA) for 1 pixel and 2 pixels thresholds respectively on Hpatches
dataset, which indicates a better performance than the state-of-the-art.
Related papers
- Ensemble Quadratic Assignment Network for Graph Matching [52.20001802006391]
Graph matching is a commonly used technique in computer vision and pattern recognition.
Recent data-driven approaches have improved the graph matching accuracy remarkably.
We propose a graph neural network (GNN) based approach to combine the advantages of data-driven and traditional methods.
arXiv Detail & Related papers (2024-03-11T06:34:05Z) - PairingNet: A Learning-based Pair-searching and -matching Network for
Image Fragments [6.694162736590122]
We propose a learning-based image fragment pair-searching and -matching approach to solve the challenging restoration problem.
Our proposed network achieves excellent pair-searching accuracy, reduces matching errors, and significantly reduces computational time.
arXiv Detail & Related papers (2023-12-14T07:43:53Z) - ATASI-Net: An Efficient Sparse Reconstruction Network for Tomographic
SAR Imaging with Adaptive Threshold [13.379416816598873]
This paper proposes a novel efficient sparse unfolding network based on the analytic learned iterative shrinkage thresholding algorithm (ALISTA)
The weight matrix in each layer of ATASI-Net is pre-computed as the solution of an off-line optimization problem.
In addition, adaptive threshold is introduced for each azimuth-range pixel, enabling the threshold shrinkage to be not only layer-varied but also element-wise.
arXiv Detail & Related papers (2022-11-30T09:55:45Z) - Pushing the Efficiency Limit Using Structured Sparse Convolutions [82.31130122200578]
We propose Structured Sparse Convolution (SSC), which leverages the inherent structure in images to reduce the parameters in the convolutional filter.
We show that SSC is a generalization of commonly used layers (depthwise, groupwise and pointwise convolution) in efficient architectures''
Architectures based on SSC achieve state-of-the-art performance compared to baselines on CIFAR-10, CIFAR-100, Tiny-ImageNet, and ImageNet classification benchmarks.
arXiv Detail & Related papers (2022-10-23T18:37:22Z) - Dense Contrastive Learning for Self-Supervised Visual Pre-Training [102.15325936477362]
We present dense contrastive learning, which implements self-supervised learning by optimizing a pairwise contrastive (dis)similarity loss at the pixel level between two views of input images.
Compared to the baseline method MoCo-v2, our method introduces negligible computation overhead (only 1% slower)
arXiv Detail & Related papers (2020-11-18T08:42:32Z) - JGR-P2O: Joint Graph Reasoning based Pixel-to-Offset Prediction Network
for 3D Hand Pose Estimation from a Single Depth Image [28.753759115780515]
State-of-the-art single depth image-based 3D hand pose estimation methods are based on dense predictions.
A novel pixel-wise prediction-based method is proposed to address the above issues.
The proposed model is implemented with an efficient 2D fully convolutional network backbone and has only about 1.4M parameters.
arXiv Detail & Related papers (2020-07-09T08:57:19Z) - A deep primal-dual proximal network for image restoration [8.797434238081372]
We design a deep network, named DeepPDNet, built from primal-dual iterations associated with the minimization of a standard penalized likelihood with an analysis prior.
Two different learning strategies: "Full learning" and "Partial learning" are proposed, the first one is the most efficient numerically.
Extensive results show that the proposed DeepPDNet demonstrates excellent performance on the MNIST and the more complex BSD68, BSD100, and SET14 datasets for image restoration and single image super-resolution task.
arXiv Detail & Related papers (2020-07-02T08:29:52Z) - The Power of Triply Complementary Priors for Image Compressive Sensing [89.14144796591685]
We propose a joint low-rank deep (LRD) image model, which contains a pair of complementaryly trip priors.
We then propose a novel hybrid plug-and-play framework based on the LRD model for image CS.
To make the optimization tractable, a simple yet effective algorithm is proposed to solve the proposed H-based image CS problem.
arXiv Detail & Related papers (2020-05-16T08:17:44Z) - High-Order Information Matters: Learning Relation and Topology for
Occluded Person Re-Identification [84.43394420267794]
We propose a novel framework by learning high-order relation and topology information for discriminative features and robust alignment.
Our framework significantly outperforms state-of-the-art by6.5%mAP scores on Occluded-Duke dataset.
arXiv Detail & Related papers (2020-03-18T12:18:35Z) - Adaptive binarization based on fuzzy integrals [7.4836284046629995]
This document presents a new adaptive binarization technique based on fuzzy integral images through an efficient design of a modified SAT for fuzzy integrals.
The experimental results show that the proposed methodology have produced an image quality thresholding often better than traditional algorithms and saliency neural networks.
arXiv Detail & Related papers (2020-03-04T18:30:57Z) - Image Matching across Wide Baselines: From Paper to Practice [80.9424750998559]
We introduce a comprehensive benchmark for local features and robust estimation algorithms.
Our pipeline's modular structure allows easy integration, configuration, and combination of different methods.
We show that with proper settings, classical solutions may still outperform the perceived state of the art.
arXiv Detail & Related papers (2020-03-03T15:20:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.