Recognition of Geometrical Shapes by Dictionary Learning
- URL: http://arxiv.org/abs/2504.10958v1
- Date: Tue, 15 Apr 2025 08:05:16 GMT
- Title: Recognition of Geometrical Shapes by Dictionary Learning
- Authors: Alexander Köhler, Michael Breuß,
- Abstract summary: We present a first approach to make dictionary learning work for shape recognition.<n>The choice of the underlying optimization method has a significant impact on recognition quality.<n> Experimental results confirm that dictionary learning may be an interesting method for shape recognition tasks.
- Score: 49.30082271910632
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Dictionary learning is a versatile method to produce an overcomplete set of vectors, called atoms, to represent a given input with only a few atoms. In the literature, it has been used primarily for tasks that explore its powerful representation capabilities, such as for image reconstruction. In this work, we present a first approach to make dictionary learning work for shape recognition, considering specifically geometrical shapes. As we demonstrate, the choice of the underlying optimization method has a significant impact on recognition quality. Experimental results confirm that dictionary learning may be an interesting method for shape recognition tasks.
Related papers
- Online multidimensional dictionary learning [0.0]
We propose a generalization of the dictionary learning technique using the t-product framework.<n>We address the dictionary learning problem through online methods suitable for tensor structures.
arXiv Detail & Related papers (2025-03-12T12:31:29Z) - Adaptive Language-Guided Abstraction from Contrastive Explanations [53.48583372522492]
It is necessary to determine which features of the environment are relevant before determining how these features should be used to compute reward.
End-to-end methods for joint feature and reward learning often yield brittle reward functions that are sensitive to spurious state features.
This paper describes a method named ALGAE which alternates between using language models to iteratively identify human-meaningful features.
arXiv Detail & Related papers (2024-09-12T16:51:58Z) - Object Topological Character Acquisition by Inductive Learning [0.0]
In this paper, a formal representation of topological structure based on object's skeleton (RTS) was proposed and the induction process of "seeking common ground" is realized.
It is clear that implementing object recognition is not based on simple physical features such as colors, edges, textures, etc., but on their common geometry, such as topologies.
arXiv Detail & Related papers (2023-06-19T01:19:37Z) - Aligning Visual Prototypes with BERT Embeddings for Few-Shot Learning [48.583388368897126]
Few-shot learning is the task of learning to recognize previously unseen categories of images.
We propose a method that takes into account the names of the image classes.
arXiv Detail & Related papers (2021-05-21T08:08:28Z) - Brain Inspired Object Recognition System [0.0]
Histogram of Oriented Gradients, Local Binary Patterns, and Principal components extracted from target images are used.
A computational theory is first developed by using concepts from the information processing mechanism of the brain.
Experiments are carried out using fifteen publicly available datasets to validate the performance of our proposed model.
arXiv Detail & Related papers (2021-05-15T14:42:17Z) - Accurate Word Representations with Universal Visual Guidance [55.71425503859685]
This paper proposes a visual representation method to explicitly enhance conventional word embedding with multiple-aspect senses from visual guidance.
We build a small-scale word-image dictionary from a multimodal seed dataset where each word corresponds to diverse related images.
Experiments on 12 natural language understanding and machine translation tasks further verify the effectiveness and the generalization capability of the proposed approach.
arXiv Detail & Related papers (2020-12-30T09:11:50Z) - DIABLO: Dictionary-based Attention Block for Deep Metric Learning [23.083900077464442]
DIABLO is a dictionary-based attention method for image embedding.
It produces richer representations by aggregating only visually-related features together.
It is experimentally confirmed on four deep metric learning datasets.
arXiv Detail & Related papers (2020-04-30T09:05:56Z) - Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks [207.52609682812147]
We propose a new learning method Oscar (Object-Semantics Aligned Pre-training)
It uses object tags detected in images as anchor points to significantly ease the learning of alignments.
We pre-train an Oscar model on the public corpus of 6.5 million text-image pairs, and fine-tune it on downstream tasks.
arXiv Detail & Related papers (2020-04-13T19:18:10Z) - Learning Representations by Predicting Bags of Visual Words [55.332200948110895]
Self-supervised representation learning targets to learn convnet-based image representations from unlabeled data.
Inspired by the success of NLP methods in this area, in this work we propose a self-supervised approach based on spatially dense image descriptions.
arXiv Detail & Related papers (2020-02-27T16:45:25Z) - Learning Deep Analysis Dictionaries -- Part II: Convolutional
Dictionaries [38.7315182732103]
We introduce a Deep Convolutional Analysis Dictionary Model (DeepCAM) by learning convolutional dictionaries instead of unstructured dictionaries.
A L-layer DeepCAM consists of L layers of convolutional analysis dictionary and element-wise soft-thresholding pairs.
We demonstrate that DeepCAM is an effective multilayer convolutional model and, on single image super-resolution, achieves performance comparable with other methods.
arXiv Detail & Related papers (2020-01-31T19:02:10Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.