Visual Sensation and Perception Computational Models for Deep Learning:
State of the art, Challenges and Prospects
- URL: http://arxiv.org/abs/2109.03391v1
- Date: Wed, 8 Sep 2021 01:51:24 GMT
- Title: Visual Sensation and Perception Computational Models for Deep Learning:
State of the art, Challenges and Prospects
- Authors: Bing Wei, Yudi Zhao, Kuangrong Hao, and Lei Gao
- Abstract summary: visual sensation and perception refers to the process of sensing, organizing, identifying, and interpreting visual information in environmental awareness and understanding.
Computational models inspired by visual perception have the characteristics of complexity and diversity, as they come from many subjects such as cognition science, information science, and artificial intelligence.
- Score: 7.949330621850412
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Visual sensation and perception refers to the process of sensing, organizing,
identifying, and interpreting visual information in environmental awareness and
understanding. Computational models inspired by visual perception have the
characteristics of complexity and diversity, as they come from many subjects
such as cognition science, information science, and artificial intelligence. In
this paper, visual perception computational models oriented deep learning are
investigated from the biological visual mechanism and computational vision
theory systematically. Then, some points of view about the prospects of the
visual perception computational models are presented. Finally, this paper also
summarizes the current challenges of visual perception and predicts its future
development trends. Through this survey, it will provide a comprehensive
reference for research in this direction.
Related papers
- When Does Perceptual Alignment Benefit Vision Representations? [76.32336818860965]
We investigate how aligning vision model representations to human perceptual judgments impacts their usability.
We find that aligning models to perceptual judgments yields representations that improve upon the original backbones across many downstream tasks.
Our results suggest that injecting an inductive bias about human perceptual knowledge into vision models can contribute to better representations.
arXiv Detail & Related papers (2024-10-14T17:59:58Z) - Graphical Perception of Saliency-based Model Explanations [6.936466872687605]
We study the perception of model explanations, specifically, saliency-based explanations for visual recognition models.
Our findings show that factors related to visualization design decisions, the type of alignment, and qualities of the saliency map all play important roles in how humans perceive saliency-based visual explanations.
arXiv Detail & Related papers (2024-06-11T20:29:25Z) - Visual Knowledge in the Big Model Era: Retrospect and Prospect [63.282425615863]
Visual knowledge is a new form of knowledge representation that can encapsulate visual concepts and their relations in a succinct, comprehensive, and interpretable manner.
As the knowledge about the visual world has been identified as an indispensable component of human cognition and intelligence, visual knowledge is poised to have a pivotal role in establishing machine intelligence.
arXiv Detail & Related papers (2024-04-05T07:31:24Z) - Deep Learning for Visual Neuroprosthesis [22.59701507351177]
The visual pathway involves complex networks of cells and regions which contribute to the encoding and processing of visual information.
This chapter discusses the importance of visual perception and the challenges associated with understanding how visual information is encoded and represented in the brain.
arXiv Detail & Related papers (2024-01-08T02:53:22Z) - Intrinsic Physical Concepts Discovery with Object-Centric Predictive
Models [86.25460882547581]
We introduce the PHYsical Concepts Inference NEtwork (PHYCINE), a system that infers physical concepts in different abstract levels without supervision.
We show that object representations containing the discovered physical concepts variables could help achieve better performance in causal reasoning tasks.
arXiv Detail & Related papers (2023-03-03T11:52:21Z) - Deep Learning to See: Towards New Foundations of Computer Vision [88.69805848302266]
This book criticizes the supposed scientific progress in the field of computer vision.
It proposes the investigation of vision within the framework of information-based laws of nature.
arXiv Detail & Related papers (2022-06-30T15:20:36Z) - Peripheral Vision Transformer [52.55309200601883]
We take a biologically inspired approach and explore to model peripheral vision in deep neural networks for visual recognition.
We propose to incorporate peripheral position encoding to the multi-head self-attention layers to let the network learn to partition the visual field into diverse peripheral regions given training data.
We evaluate the proposed network, dubbed PerViT, on the large-scale ImageNet dataset and systematically investigate the inner workings of the model for machine perception.
arXiv Detail & Related papers (2022-06-14T12:47:47Z) - Causal Reasoning Meets Visual Representation Learning: A Prospective
Study [117.08431221482638]
Lack of interpretability, robustness, and out-of-distribution generalization are becoming the challenges of the existing visual models.
Inspired by the strong inference ability of human-level agents, recent years have witnessed great effort in developing causal reasoning paradigms.
This paper aims to provide a comprehensive overview of this emerging field, attract attention, encourage discussions, bring to the forefront the urgency of developing novel causal reasoning methods.
arXiv Detail & Related papers (2022-04-26T02:22:28Z) - Relating Blindsight and AI: A Review [2.071592865573579]
We review research on the phenomenon of blindsight in an attempt to generate ideas for artificial intelligence models.
Blindsight can be considered as a diminished form of visual experience.
arXiv Detail & Related papers (2021-12-09T02:33:11Z) - Data augmentation and image understanding [2.123756175601459]
dissertation explores some advantageous synergies between machine learning, cognitive science and neuroscience.
dissertation focuses on learning representations that are more aligned with visual perception and the biological vision.
arXiv Detail & Related papers (2020-12-28T11:00:52Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.