Region Feature Descriptor Adapted to High Affine Transformations
- URL: http://arxiv.org/abs/2402.09724v3
- Date: Sun, 25 Feb 2024 08:09:49 GMT
- Title: Region Feature Descriptor Adapted to High Affine Transformations
- Authors: Shaojie Zhang, Yinghui Wang, Bin Nan, Wei Li, Jinlong Yang, Tao Yan,
Yukai Wang, Liangyi Huang, Mingfeng Wang, and Ibragim R. Atadjanov
- Abstract summary: This paper proposes a region feature descriptor based on simulating affine transformations using classification.
It exhibits higher precision and robustness compared to existing classical descriptors.
- Score: 11.300618381337777
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: To address the issue of feature descriptors being ineffective in representing
grayscale feature information when images undergo high affine transformations,
leading to a rapid decline in feature matching accuracy, this paper proposes a
region feature descriptor based on simulating affine transformations using
classification. The proposed method initially categorizes images with different
affine degrees to simulate affine transformations and generate a new set of
images. Subsequently, it calculates neighborhood information for feature points
on this new image set. Finally, the descriptor is generated by combining the
grayscale histogram of the maximum stable extremal region to which the feature
point belongs and the normalized position relative to the grayscale centroid of
the feature point's region. Experimental results, comparing feature matching
metrics under affine transformation scenarios, demonstrate that the proposed
descriptor exhibits higher precision and robustness compared to existing
classical descriptors. Additionally, it shows robustness when integrated with
other descriptors.
Related papers
- Global-Local Similarity for Efficient Fine-Grained Image Recognition with Vision Transformers [5.825612611197359]
Fine-grained recognition involves the classification of images from subordinate macro-categories.
We propose a novel and computationally inexpensive metric to identify discriminative regions in an image.
Our method achieves these results at a much lower computational cost compared to the alternatives.
arXiv Detail & Related papers (2024-07-17T10:04:54Z) - Motion Estimation for Large Displacements and Deformations [7.99536002595393]
Variational optical flow techniques based on a coarse-to-fine scheme interpolate sparse matches and locally optimize an energy model conditioned on colour, gradient and smoothness.
This paper addresses this problem and presents HybridFlow, a variational motion estimation framework for large displacements and deformations.
arXiv Detail & Related papers (2022-06-24T18:53:22Z) - Multiscale Analysis for Improving Texture Classification [62.226224120400026]
This paper employs the Gaussian-Laplacian pyramid to treat different spatial frequency bands of a texture separately.
We aggregate features extracted from gray and color texture images using bio-inspired texture descriptors, information-theoretic measures, gray-level co-occurrence matrix features, and Haralick statistical features into a single feature vector.
arXiv Detail & Related papers (2022-04-21T01:32:22Z) - Local Gradient Hexa Pattern: A Descriptor for Face Recognition and
Retrieval [20.77994516381]
A local gradient hexa pattern (LGHP) is proposed that identifies the relationship amongst the reference pixel and its neighboring pixels.
Discriminative information exists in the local neighborhood as well as in different derivative directions.
The proposed descriptor has better recognition as well as retrieval rates compared to state-of-the-art descriptors.
arXiv Detail & Related papers (2022-01-03T07:45:36Z) - Contextual Similarity Aggregation with Self-attention for Visual
Re-ranking [96.55393026011811]
We propose a visual re-ranking method by contextual similarity aggregation with self-attention.
We conduct comprehensive experiments on four benchmark datasets to demonstrate the generality and effectiveness of our proposed visual re-ranking method.
arXiv Detail & Related papers (2021-10-26T06:20:31Z) - Reservoir Computing Approach for Gray Images Segmentation [0.0]
The paper proposes a novel approach for gray scale images segmentation.
It is based on multiple features extraction from single feature per image pixel, namely its intensity value, using Echo state network.
The newly extracted features -- reservoir equilibrium states -- reveal hidden image characteristics that improve its segmentation via a clustering algorithm.
arXiv Detail & Related papers (2021-07-23T08:37:24Z) - A generalised feature for low level vision [0.0]
The Sinclair-Town transform subsumes the rolls of both edge-detector, MSER style region detector and corner detector.
The difference from the local mean is quantised to 3 values (dark-neutral-light)
arXiv Detail & Related papers (2021-02-03T11:02:03Z) - Learning to Compose Hypercolumns for Visual Correspondence [57.93635236871264]
We introduce a novel approach to visual correspondence that dynamically composes effective features by leveraging relevant layers conditioned on the images to match.
The proposed method, dubbed Dynamic Hyperpixel Flow, learns to compose hypercolumn features on the fly by selecting a small number of relevant layers from a deep convolutional neural network.
arXiv Detail & Related papers (2020-07-21T04:03:22Z) - Self-Supervised Tuning for Few-Shot Segmentation [82.32143982269892]
Few-shot segmentation aims at assigning a category label to each image pixel with few annotated samples.
Existing meta-learning method tends to fail in generating category-specifically discriminative descriptor when the visual features extracted from support images are marginalized in embedding space.
This paper presents an adaptive framework tuning, in which the distribution of latent features across different episodes is dynamically adjusted based on a self-segmentation scheme.
arXiv Detail & Related papers (2020-04-12T03:53:53Z) - Steering Self-Supervised Feature Learning Beyond Local Pixel Statistics [60.92229707497999]
We introduce a novel principle for self-supervised feature learning based on the discrimination of specific transformations of an image.
We demonstrate experimentally that learning to discriminate transformations such as LCI, image warping and rotations, yields features with state of the art generalization capabilities.
arXiv Detail & Related papers (2020-04-05T22:09:08Z) - Cross-Domain Few-Shot Classification via Learned Feature-Wise
Transformation [109.89213619785676]
Few-shot classification aims to recognize novel categories with only few labeled images in each class.
Existing metric-based few-shot classification algorithms predict categories by comparing the feature embeddings of query images with those from a few labeled images.
While promising performance has been demonstrated, these methods often fail to generalize to unseen domains.
arXiv Detail & Related papers (2020-01-23T18:55:43Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.