Measuring uncertainty in human visual segmentation
- URL: http://arxiv.org/abs/2301.07807v3
- Date: Wed, 11 Oct 2023 07:07:18 GMT
- Title: Measuring uncertainty in human visual segmentation
- Authors: Jonathan Vacher, Claire Launay, Pascal Mamassian, Ruben Coen-Cagli
- Abstract summary: We propose a new, integrated approach to measure perceptual segmentation maps.
We measure pixel-based same--different judgments and perform model-based reconstruction of the underlying segmentation map.
We show that image uncertainty affects measured human variability, and it influences how participants weigh different visual features.
- Score: 1.2289361708127877
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Segmenting visual stimuli into distinct groups of features and visual objects
is central to visual function. Classical psychophysical methods have helped
uncover many rules of human perceptual segmentation, and recent progress in
machine learning has produced successful algorithms. Yet, the computational
logic of human segmentation remains unclear, partially because we lack
well-controlled paradigms to measure perceptual segmentation maps and compare
models quantitatively. Here we propose a new, integrated approach: given an
image, we measure multiple pixel-based same--different judgments and perform
model--based reconstruction of the underlying segmentation map. The
reconstruction is robust to several experimental manipulations and captures the
variability of individual participants. We demonstrate the validity of the
approach on human segmentation of natural images and composite textures. We
show that image uncertainty affects measured human variability, and it
influences how participants weigh different visual features. Because any
putative segmentation algorithm can be inserted to perform the reconstruction,
our paradigm affords quantitative tests of theories of perception as well as
new benchmarks for segmentation algorithms.
Related papers
- Evaluating Multiview Object Consistency in Humans and Image Models [68.36073530804296]
We leverage an experimental design from the cognitive sciences which requires zero-shot visual inferences about object shape.
We collect 35K trials of behavioral data from over 500 participants.
We then evaluate the performance of common vision models.
arXiv Detail & Related papers (2024-09-09T17:59:13Z) - Explainable Metric Learning for Deflating Data Bias [2.977255700811213]
We present an explainable metric learning framework, which constructs hierarchical levels of semantic segments of an image for better interpretability.
Our approach enables a more human-understandable similarity measurement between two images based on the semantic segments within it.
arXiv Detail & Related papers (2024-07-05T21:07:27Z) - Bayesian Unsupervised Disentanglement of Anatomy and Geometry for Deep Groupwise Image Registration [50.62725807357586]
This article presents a general Bayesian learning framework for multi-modal groupwise image registration.
We propose a novel hierarchical variational auto-encoding architecture to realise the inference procedure of the latent variables.
Experiments were conducted to validate the proposed framework, including four different datasets from cardiac, brain, and abdominal medical images.
arXiv Detail & Related papers (2024-01-04T08:46:39Z) - Two Approaches to Supervised Image Segmentation [55.616364225463066]
The present work develops comparison experiments between deep learning and multiset neurons approaches.
The deep learning approach confirmed its potential for performing image segmentation.
The alternative multiset methodology allowed for enhanced accuracy while requiring little computational resources.
arXiv Detail & Related papers (2023-07-19T16:42:52Z) - Semantic Brain Decoding: from fMRI to conceptually similar image
reconstruction of visual stimuli [0.29005223064604074]
We propose a novel approach to brain decoding that also relies on semantic and contextual similarity.
We employ an fMRI dataset of natural image vision and create a deep learning decoding pipeline inspired by the existence of both bottom-up and top-down processes in human vision.
We produce reconstructions of visual stimuli that match the original content very well on a semantic level, surpassing the state of the art in previous literature.
arXiv Detail & Related papers (2022-12-13T16:54:08Z) - A kinetic approach to consensus-based segmentation of biomedical images [39.58317527488534]
We apply a kinetic version of a bounded confidence consensus model to biomedical segmentation problems.
The large time behavior of the system is then computed with the aid of a surrogate Fokker-Planck approach.
We minimize the introduced segmentation metric for a relevant set of 2D gray-scale images.
arXiv Detail & Related papers (2022-11-08T09:54:34Z) - Just Noticeable Difference for Machine Perception and Generation of
Regularized Adversarial Images with Minimal Perturbation [8.920717493647121]
We introduce a measure for machine perception inspired by the concept of Just Noticeable Difference (JND) of human perception.
We suggest an adversarial image generation algorithm, which iteratively distorts an image by an additive noise until the machine learning model detects the change in the image by outputting a false label.
We evaluate the adversarial images generated by our algorithm both qualitatively and quantitatively on CIFAR10, ImageNet, and MS COCO datasets.
arXiv Detail & Related papers (2021-02-16T11:01:55Z) - DRG: Dual Relation Graph for Human-Object Interaction Detection [65.50707710054141]
We tackle the challenging problem of human-object interaction (HOI) detection.
Existing methods either recognize the interaction of each human-object pair in isolation or perform joint inference based on complex appearance-based features.
In this paper, we leverage an abstract spatial-semantic representation to describe each human-object pair and aggregate the contextual information of the scene via a dual relation graph.
arXiv Detail & Related papers (2020-08-26T17:59:40Z) - Pathological Retinal Region Segmentation From OCT Images Using Geometric
Relation Based Augmentation [84.7571086566595]
We propose improvements over previous GAN-based medical image synthesis methods by jointly encoding the intrinsic relationship of geometry and shape.
The proposed method outperforms state-of-the-art segmentation methods on the public RETOUCH dataset having images captured from different acquisition procedures.
arXiv Detail & Related papers (2020-03-31T11:50:43Z) - Monocular Human Pose and Shape Reconstruction using Part Differentiable
Rendering [53.16864661460889]
Recent works succeed in regression-based methods which estimate parametric models directly through a deep neural network supervised by 3D ground truth.
In this paper, we introduce body segmentation as critical supervision.
To improve the reconstruction with part segmentation, we propose a part-level differentiable part that enables part-based models to be supervised by part segmentation.
arXiv Detail & Related papers (2020-03-24T14:25:46Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.