Illumination-insensitive Binary Descriptor for Visual Measurement Based
on Local Inter-patch Invariance
- URL: http://arxiv.org/abs/2305.07943v1
- Date: Sat, 13 May 2023 15:15:18 GMT
- Title: Illumination-insensitive Binary Descriptor for Visual Measurement Based
on Local Inter-patch Invariance
- Authors: Xinyu Lin, Yingjie Zhou, Xun Zhang, Yipeng Liu, and Ce Zhu
- Abstract summary: Existing binary descriptors may not perform well for long-term visual measurement tasks due to their sensitivity to illumination variations.
This study presents an illumination-insensitive binary (IIB) descriptor by leveraging the local inter-patch invariance exhibited in multiple spatial granularities.
Numerical experiments on both natural and synthetic datasets reveal that the proposed IIB descriptor outperforms state-of-the-art binary descriptors.
- Score: 32.758878779207365
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Binary feature descriptors have been widely used in various visual
measurement tasks, particularly those with limited computing resources and
storage capacities. Existing binary descriptors may not perform well for
long-term visual measurement tasks due to their sensitivity to illumination
variations. It can be observed that when image illumination changes
dramatically, the relative relationship among local patches mostly remains
intact. Based on the observation, consequently, this study presents an
illumination-insensitive binary (IIB) descriptor by leveraging the local
inter-patch invariance exhibited in multiple spatial granularities to deal with
unfavorable illumination variations. By taking advantage of integral images for
local patch feature computation, a highly efficient IIB descriptor is achieved.
It can encode scalable features in multiple spatial granularities, thus
facilitating a computationally efficient hierarchical matching from coarse to
fine. Moreover, the IIB descriptor can also apply to other types of image data,
such as depth maps and semantic segmentation results, when available in some
applications. Numerical experiments on both natural and synthetic datasets
reveal that the proposed IIB descriptor outperforms state-of-the-art binary
descriptors and some testing float descriptors. The proposed IIB descriptor has
also been successfully employed in a demo system for long-term visual
localization. The code of the IIB descriptor will be publicly available.
Related papers
- Binary Code Similarity Detection via Graph Contrastive Learning on Intermediate Representations [52.34030226129628]
Binary Code Similarity Detection (BCSD) plays a crucial role in numerous fields, including vulnerability detection, malware analysis, and code reuse identification.
In this paper, we propose IRBinDiff, which mitigates compilation differences by leveraging LLVM-IR with higher-level semantic abstraction.
Our extensive experiments, conducted under varied compilation settings, demonstrate that IRBinDiff outperforms other leading BCSD methods in both One-to-one comparison and One-to-many search scenarios.
arXiv Detail & Related papers (2024-10-24T09:09:20Z) - FUSELOC: Fusing Global and Local Descriptors to Disambiguate 2D-3D Matching in Visual Localization [57.59857784298536]
Direct 2D-3D matching algorithms require significantly less memory but suffer from lower accuracy due to the larger and more ambiguous search space.
We address this ambiguity by fusing local and global descriptors using a weighted average operator within a 2D-3D search framework.
We consistently improve the accuracy over local-only systems and achieve performance close to hierarchical methods while halving memory requirements.
arXiv Detail & Related papers (2024-08-21T23:42:16Z) - Neuromorphic Synergy for Video Binarization [54.195375576583864]
Bimodal objects serve as a visual form to embed information that can be easily recognized by vision systems.
Neuromorphic cameras offer new capabilities for alleviating motion blur, but it is non-trivial to first de-blur and then binarize the images in a real-time manner.
We propose an event-based binary reconstruction method that leverages the prior knowledge of the bimodal target's properties to perform inference independently in both event space and image space.
We also develop an efficient integration method to propagate this binary image to high frame rate binary video.
arXiv Detail & Related papers (2024-02-20T01:43:51Z) - A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot
Semantic Correspondence [83.90531416914884]
We exploit Stable Diffusion features for semantic and dense correspondence.
With simple post-processing, SD features can perform quantitatively similar to SOTA representations.
We show that these correspondences can enable interesting applications such as instance swapping in two images.
arXiv Detail & Related papers (2023-05-24T16:59:26Z) - ALIKED: A Lighter Keypoint and Descriptor Extraction Network via
Deformable Transformation [27.04762347838776]
We propose the Sparse Deformable Descriptor Head (SDDH), which learns the deformable positions of supporting features for each keypoint and constructs deformable descriptors.
We show that the proposed network is both efficient and powerful in various visual measurement tasks, including image matching, 3D reconstruction, and visual relocalization.
arXiv Detail & Related papers (2023-04-07T12:05:39Z) - Learning-Based Dimensionality Reduction for Computing Compact and
Effective Local Feature Descriptors [101.62384271200169]
A distinctive representation of image patches in form of features is a key component of many computer vision and robotics tasks.
We investigate multi-layer perceptrons (MLPs) to extract low-dimensional but high-quality descriptors.
We consider different applications, including visual localization, patch verification, image matching and retrieval.
arXiv Detail & Related papers (2022-09-27T17:59:04Z) - Learning Geodesic-Aware Local Features from RGB-D Images [8.115075181267109]
We propose a new approach to compute descriptors from RGB-D images that are invariant to non-rigid deformations.
Our proposed description strategies are grounded on the key idea of learning feature representations on undistorted local image patches.
In different experiments using real and publicly available RGB-D data benchmarks, they consistently outperforms state-of-the-art handcrafted and learning-based image and RGB-D descriptors.
arXiv Detail & Related papers (2022-03-22T19:52:49Z) - ZippyPoint: Fast Interest Point Detection, Description, and Matching
through Mixed Precision Discretization [71.91942002659795]
We investigate and adapt network quantization techniques to accelerate inference and enable its use on compute limited platforms.
ZippyPoint, our efficient quantized network with binary descriptors, improves the network runtime speed, the descriptor matching speed, and the 3D model size.
These improvements come at a minor performance degradation as evaluated on the tasks of homography estimation, visual localization, and map-free visual relocalization.
arXiv Detail & Related papers (2022-03-07T18:59:03Z) - Revisiting Binary Local Image Description for Resource Limited Devices [2.470815298095903]
We present new binary image descriptors that emerge from the application of triplet ranking loss, hard negative mining and anchor swapping.
Bad and HashSIFT establish new operating points in the state-of-the-art's accuracy vs. resources trade-off curve.
arXiv Detail & Related papers (2021-08-18T20:42:43Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.