Diptychs of human and machine perceptions
- URL: http://arxiv.org/abs/2010.13864v1
- Date: Mon, 12 Oct 2020 10:22:28 GMT
- Title: Diptychs of human and machine perceptions
- Authors: Vivien Cabannes and Thomas Kerdreux and Louis Thiry
- Abstract summary: We propose visual creations that put differences in algorithms and humans emphperceptions into perspective.
We exploit saliency maps of neural networks and visual focus of humans to create diptychs that are reinterpretations of an original image according to both machine and human attentions.
- Score: 7.41960767776045
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We propose visual creations that put differences in algorithms and humans
\emph{perceptions} into perspective. We exploit saliency maps of neural
networks and visual focus of humans to create diptychs that are
reinterpretations of an original image according to both machine and human
attentions. Using those diptychs as a qualitative evaluation of perception, we
discuss some crucial issues of current \textit{task-oriented} artificial
intelligence.
Related papers
- Toward a Machine Bertin: Why Visualization Needs Design Principles for Machine Cognition [0.27074235008521247]
Vision-language models (VLMs) increasingly consume chart images in automated analysis pipelines.<n>Current approaches address this gap primarily by bypassing vision entirely.<n>This paper makes the case that the visualization field needs to investigate machine-oriented visual design as a distinct research problem.
arXiv Detail & Related papers (2026-02-02T01:39:33Z) - When Does Perceptual Alignment Benefit Vision Representations? [76.32336818860965]
We investigate how aligning vision model representations to human perceptual judgments impacts their usability.
We find that aligning models to perceptual judgments yields representations that improve upon the original backbones across many downstream tasks.
Our results suggest that injecting an inductive bias about human perceptual knowledge into vision models can contribute to better representations.
arXiv Detail & Related papers (2024-10-14T17:59:58Z) - Achieving More Human Brain-Like Vision via Human EEG Representational Alignment [1.811217832697894]
We present 'Re(presentational)Al(ignment)net', a vision model aligned with human brain activity based on non-invasive EEG.
Our innovative image-to-brain multi-layer encoding framework advances human neural alignment by optimizing multiple model layers.
Our findings suggest that ReAlnet represents a breakthrough in bridging the gap between artificial and human vision, and paving the way for more brain-like artificial intelligence systems.
arXiv Detail & Related papers (2024-01-30T18:18:41Z) - Explaining Deep Face Algorithms through Visualization: A Survey [57.60696799018538]
This work undertakes a first-of-its-kind meta-analysis of explainability algorithms in the face domain.
We review existing face explainability works and reveal valuable insights into the structure and hierarchy of face networks.
arXiv Detail & Related papers (2023-09-26T07:16:39Z) - BI AVAN: Brain inspired Adversarial Visual Attention Network [67.05560966998559]
We propose a brain-inspired adversarial visual attention network (BI-AVAN) to characterize human visual attention directly from functional brain activity.
Our model imitates the biased competition process between attention-related/neglected objects to identify and locate the visual objects in a movie frame the human brain focuses on in an unsupervised manner.
arXiv Detail & Related papers (2022-10-27T22:20:36Z) - Computational imaging with the human brain [1.614301262383079]
Brain-computer interfaces (BCIs) are enabling a range of new possibilities and routes for augmenting human capability.
We demonstrate ghost imaging of a hidden scene using the human visual system that is combined with an adaptive computational imaging scheme.
This brain-computer connectivity demonstrates a form of augmented human computation that could in the future extend the sensing range of human vision.
arXiv Detail & Related papers (2022-10-07T08:40:18Z) - A domain adaptive deep learning solution for scanpath prediction of
paintings [66.46953851227454]
This paper focuses on the eye-movement analysis of viewers during the visual experience of a certain number of paintings.
We introduce a new approach to predicting human visual attention, which impacts several cognitive functions for humans.
The proposed new architecture ingests images and returns scanpaths, a sequence of points featuring a high likelihood of catching viewers' attention.
arXiv Detail & Related papers (2022-09-22T22:27:08Z) - Robustness of Humans and Machines on Object Recognition with Extreme
Image Transformations [0.0]
We introduce a novel set of image transforms and evaluate humans and networks on an object recognition task.
We found performance for a few common networks quickly decreases while humans are able to recognize objects with a high accuracy.
arXiv Detail & Related papers (2022-05-09T17:15:54Z) - Towards Human-Understandable Visual Explanations:Imperceptible
High-frequency Cues Can Better Be Removed [46.36600006968488]
We argue that the capabilities of humans, constrained by the Human Visual System (HVS) and psychophysics, need to be taken into account.
We conduct a case study regarding the classification of real vs. fake face images, where many of the distinguishing features picked up by standard neural networks turn out not to be perceptible to humans.
arXiv Detail & Related papers (2021-04-16T08:11:30Z) - Matching Representations of Explainable Artificial Intelligence and Eye
Gaze for Human-Machine Interaction [0.7742297876120561]
Rapid non-verbal communication of task-based stimuli is a challenge in human-machine teaming.
In this work, we examine the correlations between visual heatmap explanations of a neural network trained to predict driving behavior and eye gaze heatmaps of human drivers.
arXiv Detail & Related papers (2021-01-30T07:42:56Z) - Bongard-LOGO: A New Benchmark for Human-Level Concept Learning and
Reasoning [78.13740873213223]
Bongard problems (BPs) were introduced as an inspirational challenge for visual cognition in intelligent systems.
We propose a new benchmark Bongard-LOGO for human-level concept learning and reasoning.
arXiv Detail & Related papers (2020-10-02T03:19:46Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.