Understanding How Blind Users Handle Object Recognition Errors: Strategies and Challenges
- URL: http://arxiv.org/abs/2408.03303v1
- Date: Tue, 6 Aug 2024 17:09:56 GMT
- Title: Understanding How Blind Users Handle Object Recognition Errors: Strategies and Challenges
- Authors: Jonggi Hong, Hernisa Kacorri,
- Abstract summary: This paper presents a study aimed at understanding blind users' interaction with object recognition systems for identifying and avoiding errors.
We conducted a user study involving 12 blind and low-vision participants.
We gained insights into users' experiences, challenges, and strategies for identifying errors in camera-based assistive technologies and object recognition systems.
- Score: 10.565823004989817
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Object recognition technologies hold the potential to support blind and low-vision people in navigating the world around them. However, the gap between benchmark performances and practical usability remains a significant challenge. This paper presents a study aimed at understanding blind users' interaction with object recognition systems for identifying and avoiding errors. Leveraging a pre-existing object recognition system, URCam, fine-tuned for our experiment, we conducted a user study involving 12 blind and low-vision participants. Through in-depth interviews and hands-on error identification tasks, we gained insights into users' experiences, challenges, and strategies for identifying errors in camera-based assistive technologies and object recognition systems. During interviews, many participants preferred independent error review, while expressing apprehension toward misrecognitions. In the error identification task, participants varied viewpoints, backgrounds, and object sizes in their images to avoid and overcome errors. Even after repeating the task, participants identified only half of the errors, and the proportion of errors identified did not significantly differ from their first attempts. Based on these insights, we offer implications for designing accessible interfaces tailored to the needs of blind and low-vision users in identifying object recognition errors.
Related papers
- Collaborative Knowledge Infusion for Low-resource Stance Detection [83.88515573352795]
Target-related knowledge is often needed to assist stance detection models.
We propose a collaborative knowledge infusion approach for low-resource stance detection tasks.
arXiv Detail & Related papers (2024-03-28T08:32:14Z) - Assessing the Efficacy of Deep Learning Approaches for Facial Expression Recognition in Individuals with Intellectual Disabilities [0.7124736158080939]
We train a set of 12 convolutional neural networks in different approaches to recognize facial expressions in individuals with intellectual disabilities.
Our findings show the need of facial expression recognition within this population through tailored user-specific training methodologies.
arXiv Detail & Related papers (2024-01-22T12:02:40Z) - Visual Hindsight Self-Imitation Learning for Interactive Navigation [18.02337761240743]
We propose Visual Hindsight Self-Imitation Learning ( VHS) for enhancing sample efficiency through hindsight goal re-labeling and self-imitation.
VHS outperforms existing techniques in interactive visual navigation tasks.
arXiv Detail & Related papers (2023-12-05T05:34:12Z) - When Measures are Unreliable: Imperceptible Adversarial Perturbations
toward Top-$k$ Multi-Label Learning [83.8758881342346]
A novel loss function is devised to generate adversarial perturbations that could achieve both visual and measure imperceptibility.
Experiments on large-scale benchmark datasets demonstrate the superiority of our proposed method in attacking the top-$k$ multi-label systems.
arXiv Detail & Related papers (2023-07-27T13:18:47Z) - Learning Common Rationale to Improve Self-Supervised Representation for
Fine-Grained Visual Recognition Problems [61.11799513362704]
We propose learning an additional screening mechanism to identify discriminative clues commonly seen across instances and classes.
We show that a common rationale detector can be learned by simply exploiting the GradCAM induced from the SSL objective.
arXiv Detail & Related papers (2023-03-03T02:07:40Z) - Gait Recognition Based on Deep Learning: A Survey [1.731119436110423]
Biometry-based control systems may not rely on individual expected behavior or cooperation to operate appropriately.
Some works suggest addressing the problem through gait recognition approaches.
Deep learning-based approaches recently emerged as a robust set of tools to deal with virtually any image and computer-vision related problem.
arXiv Detail & Related papers (2022-01-10T12:44:42Z) - Unsupervised Person Re-Identification: A Systematic Survey of Challenges
and Solutions [64.68497473454816]
Unsupervised person Re-ID has drawn increasing attention for its potential to address the scalability issue in person Re-ID.
Unsupervised person Re-ID is challenging primarily due to lacking identity labels to supervise person feature learning.
This survey review recent works on unsupervised person Re-ID from the perspective of challenges and solutions.
arXiv Detail & Related papers (2021-09-01T00:01:35Z) - Harnessing Unrecognizable Faces for Face Recognition [87.80037162457427]
We propose a measure of recognizability of a face image, implemented by a deep neural network trained using mostly recognizable identities.
We show that accounting for recognizability reduces error rate of single-image face recognition by 58% at FAR=1e-5.
arXiv Detail & Related papers (2021-06-08T05:25:03Z) - Masked Contrastive Learning for Anomaly Detection [10.499890749386676]
We propose a task-specific variant of contrastive learning named masked contrastive learning.
We also propose a new inference method dubbed self-ensemble inference.
arXiv Detail & Related papers (2021-05-18T19:27:02Z) - More Than Accuracy: Towards Trustworthy Machine Learning Interfaces for
Object Recognition [0.0]
This paper investigates the user experience of visualizations of a machine learning (ML) system that recognizes objects in images.
We exposed users with a background in ML to three visualizations of three systems with different levels of accuracy.
In interviews, we explored how the visualization helped users assess the accuracy of systems in use and how the visualization and the accuracy of the system affected trust and reliance.
arXiv Detail & Related papers (2020-08-05T07:56:37Z) - Learning to Detect Important People in Unlabelled Images for
Semi-supervised Important People Detection [85.91577271918783]
We propose learning important people detection on partially annotated images.
Our approach iteratively learns to assign pseudo-labels to individuals in un-annotated images.
We have collected two large-scale datasets for evaluation.
arXiv Detail & Related papers (2020-04-16T10:09:37Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.