Notably Inaccessible -- Data Driven Understanding of Data Science
Notebook (In)Accessibility
- URL: http://arxiv.org/abs/2308.03241v1
- Date: Mon, 7 Aug 2023 01:33:32 GMT
- Title: Notably Inaccessible -- Data Driven Understanding of Data Science
Notebook (In)Accessibility
- Authors: Venkatesh Potluri, Sudheesh Singanamalla, Nussara Tieanklin, Jennifer
Mankoff
- Abstract summary: We perform a large scale systematic analysis of 100000 Jupyter notebooks to identify various accessibility challenges.
We make recommendations to improve accessibility of the artifacts of a notebook, suggest authoring practices, and propose changes to infrastructure to make notebooks accessible.
- Score: 13.428631054625797
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Computational notebooks, tools that facilitate storytelling through
exploration, data analysis, and information visualization, have become the
widely accepted standard in the data science community. These notebooks have
been widely adopted through notebook software such as Jupyter, Datalore and
Google Colab, both in academia and industry. While there is extensive research
to learn how data scientists use computational notebooks, identify their pain
points, and enable collaborative data science practices, very little is known
about the various accessibility barriers experienced by blind and visually
impaired (BVI) users using these notebooks. BVI users are unable to use
computational notebook interfaces due to (1) inaccessibility of the interface,
(2) common ways in which data is represented in these interfaces, and (3)
inability for popular libraries to provide accessible outputs. We perform a
large scale systematic analysis of 100000 Jupyter notebooks to identify various
accessibility challenges in published notebooks affecting the creation and
consumption of these notebooks. Through our findings, we make recommendations
to improve accessibility of the artifacts of a notebook, suggest authoring
practices, and propose changes to infrastructure to make notebooks accessible.
An accessible PDF can be obtained at
https://blvi.dev/noteably-inaccessible-paper
Related papers
- Maximizing Data Efficiency for Cross-Lingual TTS Adaptation by
Self-Supervised Representation Mixing and Embedding Initialization [57.38123229553157]
This paper presents an effective transfer learning framework for language adaptation in text-to-speech systems.
We focus on achieving language adaptation using minimal labeled and unlabeled data.
Experimental results show that our framework is able to synthesize intelligible speech in unseen languages with only 4 utterances of labeled data and 15 minutes of unlabeled data.
arXiv Detail & Related papers (2024-01-23T21:55:34Z) - Make It Make Sense! Understanding and Facilitating Sensemaking in
Computational Notebooks [10.621214052177125]
Porpoise integrates computational notebook features with digital design, grouping cells into labeled sections that can be expanded, collapsed, or annotated for improved sensemaking.
Our study with 24 data scientists found Porpoise enhanced code comprehension, making the experience more akin to reading a book, with one participant describing it as It's really like reading a book.
arXiv Detail & Related papers (2023-12-18T18:33:58Z) - An Interactive Interface for Novel Class Discovery in Tabular Data [54.11148718494725]
Novel Class Discovery (NCD) is the problem of trying to discover novel classes in an unlabeled set, given a labeled set of different but related classes.
The majority of NCD methods proposed so far only deal with image data.
This interface allows a domain expert to easily run state-of-the-art algorithms for NCD in tabular data.
arXiv Detail & Related papers (2023-06-22T14:32:53Z) - SuperNOVA: Design Strategies and Opportunities for Interactive Visualization in Computational Notebooks [34.04783941358773]
We analyze 163 interactive visualization tools for notebooks.
We identify key design implications and trade-offs.
We develop SuperNOVA, an open-source interactive browser to help researchers explore existing notebook visualization tools.
arXiv Detail & Related papers (2023-05-04T17:57:54Z) - The Semantic Reader Project: Augmenting Scholarly Documents through
AI-Powered Interactive Reading Interfaces [54.2590226904332]
We describe the Semantic Reader Project, a effort across multiple institutions to explore automatic creation of dynamic reading interfaces for research papers.
Ten prototype interfaces have been developed and more than 300 participants and real-world users have shown improved reading experiences.
We structure this paper around challenges scholars and the public face when reading research papers.
arXiv Detail & Related papers (2023-03-25T02:47:09Z) - StickyLand: Breaking the Linear Presentation of Computational Notebooks [5.1175396458764855]
StickyLand is a notebook extension for empowering users to freely organize their code in non-linear ways.
With sticky cells that are always shown on the screen, users can quickly access their notes, instantly observe experiment results, and easily build interactive dashboards.
arXiv Detail & Related papers (2022-02-22T18:25:54Z) - Context-aware Execution Migration Tool for Data Science Jupyter
Notebooks on Hybrid Clouds [0.22908242575265025]
This paper presents a solution developed as a Jupyter extension that automatically selects which cells, as well as in which scenarios, such cells should be migrated to a more suitable platform for execution.
Using notebooks from Earth science (remote sensing), image recognition, and hand written digit identification (machine learning), our experiments show notebook state reductions of up to 55x and migration decisions leading to performance gains of up to 3.25x when the user interactivity with the notebook is taken into consideration.
arXiv Detail & Related papers (2021-07-01T02:33:18Z) - GIS and Computational Notebooks [0.0]
This chapter introduces computational notebooks in the geographical context.
It begins by explaining the computational paradigm and philosophy that underlies notebooks.
It then unpacks their architecture to illustrate a notebook user's typical workflow.
arXiv Detail & Related papers (2021-01-02T01:59:14Z) - Synthetic Data: Opening the data floodgates to enable faster, more
directed development of machine learning methods [96.92041573661407]
Many ground-breaking advancements in machine learning can be attributed to the availability of a large volume of rich data.
Many large-scale datasets are highly sensitive, such as healthcare data, and are not widely available to the machine learning community.
Generating synthetic data with privacy guarantees provides one such solution.
arXiv Detail & Related papers (2020-12-08T17:26:10Z) - Adversarial Knowledge Transfer from Unlabeled Data [62.97253639100014]
We present a novel Adversarial Knowledge Transfer framework for transferring knowledge from internet-scale unlabeled data to improve the performance of a classifier.
An important novel aspect of our method is that the unlabeled source data can be of different classes from those of the labeled target data, and there is no need to define a separate pretext task.
arXiv Detail & Related papers (2020-08-13T08:04:27Z) - Federated and continual learning for classification tasks in a society
of devices [59.45414406974091]
Light Federated and Continual Consensus (LFedCon2) is a new federated and continual architecture that uses light, traditional learners.
Our method allows powerless devices (such as smartphones or robots) to learn in real time, locally, continuously, autonomously and from users.
In order to test our proposal, we have applied it in a heterogeneous community of smartphone users to solve the problem of walking recognition.
arXiv Detail & Related papers (2020-06-12T12:37:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.