Revisiting Binary Local Image Description for Resource Limited Devices
- URL: http://arxiv.org/abs/2108.08380v1
- Date: Wed, 18 Aug 2021 20:42:43 GMT
- Title: Revisiting Binary Local Image Description for Resource Limited Devices
- Authors: Iago Su\'arez, Jos\'e M. Buenaposada, Luis Baumela
- Abstract summary: We present new binary image descriptors that emerge from the application of triplet ranking loss, hard negative mining and anchor swapping.
Bad and HashSIFT establish new operating points in the state-of-the-art's accuracy vs. resources trade-off curve.
- Score: 2.470815298095903
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The advent of a panoply of resource limited devices opens up new challenges
in the design of computer vision algorithms with a clear compromise between
accuracy and computational requirements. In this paper we present new binary
image descriptors that emerge from the application of triplet ranking loss,
hard negative mining and anchor swapping to traditional features based on pixel
differences and image gradients. These descriptors, BAD (Box Average
Difference) and HashSIFT, establish new operating points in the
state-of-the-art's accuracy vs.\ resources trade-off curve. In our experiments
we evaluate the accuracy, execution time and energy consumption of the proposed
descriptors. We show that BAD bears the fastest descriptor implementation in
the literature while HashSIFT approaches in accuracy that of the top deep
learning-based descriptors, being computationally more efficient. We have made
the source code public.
Related papers
- BEBLID: Boosted efficient binary local image descriptor [2.8538628855541397]
We introduce BEBLID, an efficient learned binary image descriptor.
It improves our previous real-valued descriptor, BELID, making it both more efficient for matching and more accurate.
In experiments BEBLID achieves an accuracy close to SIFT and better computational efficiency than ORB, the fastest algorithm in the literature.
arXiv Detail & Related papers (2024-02-07T00:14:32Z) - Residual Learning for Image Point Descriptors [56.917951170421894]
We propose a very simple and effective approach to learning local image descriptors by using a hand-crafted detector and descriptor.
We optimize the final descriptor by leveraging the knowledge already present in the handcrafted descriptor.
Our approach has potential applications in ensemble learning and learning with non-differentiable functions.
arXiv Detail & Related papers (2023-12-24T12:51:30Z) - Illumination-insensitive Binary Descriptor for Visual Measurement Based
on Local Inter-patch Invariance [32.758878779207365]
Existing binary descriptors may not perform well for long-term visual measurement tasks due to their sensitivity to illumination variations.
This study presents an illumination-insensitive binary (IIB) descriptor by leveraging the local inter-patch invariance exhibited in multiple spatial granularities.
Numerical experiments on both natural and synthetic datasets reveal that the proposed IIB descriptor outperforms state-of-the-art binary descriptors.
arXiv Detail & Related papers (2023-05-13T15:15:18Z) - Learning-Based Dimensionality Reduction for Computing Compact and
Effective Local Feature Descriptors [101.62384271200169]
A distinctive representation of image patches in form of features is a key component of many computer vision and robotics tasks.
We investigate multi-layer perceptrons (MLPs) to extract low-dimensional but high-quality descriptors.
We consider different applications, including visual localization, patch verification, image matching and retrieval.
arXiv Detail & Related papers (2022-09-27T17:59:04Z) - Learning Weighting Map for Bit-Depth Expansion within a Rational Range [64.15915577164894]
Bit-depth expansion (BDE) is one of the emerging technologies to display high bit-depth (HBD) image from low bit-depth (LBD) source.
Existing BDE methods have no unified solution for various BDE situations.
We design a bit restoration network (BRNet) to learn a weight for each pixel, which indicates the ratio of the replenished value within a rational range.
arXiv Detail & Related papers (2022-04-26T02:27:39Z) - ZippyPoint: Fast Interest Point Detection, Description, and Matching
through Mixed Precision Discretization [71.91942002659795]
We investigate and adapt network quantization techniques to accelerate inference and enable its use on compute limited platforms.
ZippyPoint, our efficient quantized network with binary descriptors, improves the network runtime speed, the descriptor matching speed, and the 3D model size.
These improvements come at a minor performance degradation as evaluated on the tasks of homography estimation, visual localization, and map-free visual relocalization.
arXiv Detail & Related papers (2022-03-07T18:59:03Z) - Local Quadruple Pattern: A Novel Descriptor for Facial Image Recognition
and Retrieval [20.77994516381]
A novel hand crafted local quadruple pattern (LQPAT) is proposed for facial image recognition and retrieval.
The proposed descriptor encodes relations amongst the neighbours in quadruple space.
The retrieval and recognition accuracies of the proposed descriptor has been compared with state of the art hand crafted descriptors on bench mark databases.
arXiv Detail & Related papers (2022-01-03T08:04:38Z) - Pixel-wise Dense Detector for Image Inpainting [34.721991959357425]
Recent GAN-based image inpainting approaches adopt an average strategy to discriminate the generated image and output a scalar.
We propose a novel detection-based generative framework for image inpainting, which adopts the min-max strategy in an adversarial process.
Experiments on multiple public datasets show the superior performance of the proposed framework.
arXiv Detail & Related papers (2020-11-04T13:45:27Z) - Blind Face Restoration via Deep Multi-scale Component Dictionaries [75.02640809505277]
We propose a deep face dictionary network (termed as DFDNet) to guide the restoration process of degraded observations.
DFDNet generates deep dictionaries for perceptually significant face components from high-quality images.
component AdaIN is leveraged to eliminate the style diversity between the input and dictionary features.
arXiv Detail & Related papers (2020-08-02T07:02:07Z) - Learning Feature Descriptors using Camera Pose Supervision [101.56783569070221]
We propose a novel weakly-supervised framework that can learn feature descriptors solely from relative camera poses between images.
Because we no longer need pixel-level ground-truth correspondences, our framework opens up the possibility of training on much larger and more diverse datasets for better and unbiased descriptors.
arXiv Detail & Related papers (2020-04-28T06:35:27Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.