Data augmentation and image understanding
- URL: http://arxiv.org/abs/2012.14185v1
- Date: Mon, 28 Dec 2020 11:00:52 GMT
- Title: Data augmentation and image understanding
- Authors: Alex Hernandez-Garcia
- Abstract summary: dissertation explores some advantageous synergies between machine learning, cognitive science and neuroscience.
dissertation focuses on learning representations that are more aligned with visual perception and the biological vision.
- Score: 2.123756175601459
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Interdisciplinary research is often at the core of scientific progress. This
dissertation explores some advantageous synergies between machine learning,
cognitive science and neuroscience. In particular, this thesis focuses on
vision and images. The human visual system has been widely studied from both
behavioural and neuroscientific points of view, as vision is the dominant sense
of most people. In turn, machine vision has also been an active area of
research, currently dominated by the use of artificial neural networks. This
work focuses on learning representations that are more aligned with visual
perception and the biological vision. For that purpose, I have studied tools
and aspects from cognitive science and computational neuroscience, and
attempted to incorporate them into machine learning models of vision.
A central subject of this dissertation is data augmentation, a commonly used
technique for training artificial neural networks to augment the size of data
sets through transformations of the images. Although often overlooked, data
augmentation implements transformations that are perceptually plausible, since
they correspond to the transformations we see in our visual world -- changes in
viewpoint or illumination, for instance. Furthermore, neuroscientists have
found that the brain invariantly represents objects under these
transformations. Throughout this dissertation, I use these insights to analyse
data augmentation as a particularly useful inductive bias, a more effective
regularisation method for artificial neural networks, and as the framework to
analyse and improve the invariance of vision models to perceptually plausible
transformations. Overall, this work aims to shed more light on the properties
of data augmentation and demonstrate the potential of interdisciplinary
research.
Related papers
- Connecting metrics for shape-texture knowledge in computer vision [1.7785095623975342]
Deep neural networks remain brittle and susceptible to many changes in the image that do not cause humans to misclassify images.
Part of this different behavior may be explained by the type of features humans and deep neural networks use in vision tasks.
arXiv Detail & Related papers (2023-01-25T14:37:42Z) - BI AVAN: Brain inspired Adversarial Visual Attention Network [67.05560966998559]
We propose a brain-inspired adversarial visual attention network (BI-AVAN) to characterize human visual attention directly from functional brain activity.
Our model imitates the biased competition process between attention-related/neglected objects to identify and locate the visual objects in a movie frame the human brain focuses on in an unsupervised manner.
arXiv Detail & Related papers (2022-10-27T22:20:36Z) - Adapting Brain-Like Neural Networks for Modeling Cortical Visual
Prostheses [68.96380145211093]
Cortical prostheses are devices implanted in the visual cortex that attempt to restore lost vision by electrically stimulating neurons.
Currently, the vision provided by these devices is limited, and accurately predicting the visual percepts resulting from stimulation is an open challenge.
We propose to address this challenge by utilizing 'brain-like' convolutional neural networks (CNNs), which have emerged as promising models of the visual system.
arXiv Detail & Related papers (2022-09-27T17:33:19Z) - A neuromorphic approach to image processing and machine vision [0.9137554315375922]
We explore the implementation of visual tasks such as image segmentation, visual attention and object recognition.
We have emphasized on the employment of non-volatile memory devices such as memristors to realize artificial visual systems.
arXiv Detail & Related papers (2022-08-07T05:01:57Z) - Guiding Visual Attention in Deep Convolutional Neural Networks Based on
Human Eye Movements [0.0]
Deep Convolutional Neural Networks (DCNNs) were originally inspired by principles of biological vision.
Recent advances in deep learning seem to decrease this similarity.
We investigate a purely data-driven approach to obtain useful models.
arXiv Detail & Related papers (2022-06-21T17:59:23Z) - Peripheral Vision Transformer [52.55309200601883]
We take a biologically inspired approach and explore to model peripheral vision in deep neural networks for visual recognition.
We propose to incorporate peripheral position encoding to the multi-head self-attention layers to let the network learn to partition the visual field into diverse peripheral regions given training data.
We evaluate the proposed network, dubbed PerViT, on the large-scale ImageNet dataset and systematically investigate the inner workings of the model for machine perception.
arXiv Detail & Related papers (2022-06-14T12:47:47Z) - Prune and distill: similar reformatting of image information along rat
visual cortex and deep neural networks [61.60177890353585]
Deep convolutional neural networks (CNNs) have been shown to provide excellent models for its functional analogue in the brain, the ventral stream in visual cortex.
Here we consider some prominent statistical patterns that are known to exist in the internal representations of either CNNs or the visual cortex.
We show that CNNs and visual cortex share a similarly tight relationship between dimensionality expansion/reduction of object representations and reformatting of image information.
arXiv Detail & Related papers (2022-05-27T08:06:40Z) - Interpretability of Neural Network With Physiological Mechanisms [5.1971653175509145]
Deep learning continues to play as a powerful state-of-art technique that has achieved extraordinary accuracy levels in various domains of regression and classification tasks.
The original goal of proposing the neural network model is to improve the understanding of complex human brains using a mathematical expression approach.
Recent deep learning techniques continue to lose the interpretations of its functional process by being treated mostly as a black-box approximator.
arXiv Detail & Related papers (2022-03-24T21:40:04Z) - The world seems different in a social context: a neural network analysis
of human experimental data [57.729312306803955]
We show that it is possible to replicate human behavioral data in both individual and social task settings by modifying the precision of prior and sensory signals.
An analysis of the neural activation traces of the trained networks provides evidence that information is coded in fundamentally different ways in the network in the individual and in the social conditions.
arXiv Detail & Related papers (2022-03-03T17:19:12Z) - Visual Sensation and Perception Computational Models for Deep Learning:
State of the art, Challenges and Prospects [7.949330621850412]
visual sensation and perception refers to the process of sensing, organizing, identifying, and interpreting visual information in environmental awareness and understanding.
Computational models inspired by visual perception have the characteristics of complexity and diversity, as they come from many subjects such as cognition science, information science, and artificial intelligence.
arXiv Detail & Related papers (2021-09-08T01:51:24Z) - Affect Analysis in-the-wild: Valence-Arousal, Expressions, Action Units
and a Unified Framework [83.21732533130846]
The paper focuses on large in-the-wild databases, i.e., Aff-Wild and Aff-Wild2.
It presents the design of two classes of deep neural networks trained with these databases.
A novel multi-task and holistic framework is presented which is able to jointly learn and effectively generalize and perform affect recognition.
arXiv Detail & Related papers (2021-03-29T17:36:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.