Reimagining Data Work: Participatory Annotation Workshops as Feminist Practice
- URL: http://arxiv.org/abs/2602.22196v1
- Date: Wed, 25 Feb 2026 18:45:42 GMT
- Title: Reimagining Data Work: Participatory Annotation Workshops as Feminist Practice
- Authors: Yujia Gao, Isadora Araujo Cruxên, Helena Suárez Val, Alessandra Jungs de Almeida, Catherine D'Ignazio, Harini Suresh,
- Abstract summary: We show that prioritizing context and pluralism in practice may require bounding'' context and working towards a tactical consensus''<n>We contribute to efforts to reimagine data and AI development as spaces for understanding difference and building solidarity across struggles.
- Score: 37.45130081398238
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: AI systems depend on the invisible and undervalued labor of data workers, who are often treated as interchangeable units rather than collaborators with meaningful expertise. Critical scholars and practitioners have proposed alternative principles for data work, but few empirical studies examine how to enact them in practice. This paper bridges this gap through a case study of multilingual, iterative, and participatory data annotation processes with journalists and activists focused on news narratives of gender-related violence. We offer two methodological contributions. First, we demonstrate how workshops rooted in feminist epistemology can foster dialogue, build community, and disrupt knowledge hierarchies in data annotation. Second, drawing insights from practice, we deepen the analysis of existing feminist and participatory principles. We show that prioritizing context and pluralism in practice may require ``bounding'' context and working towards what we describe as a ``tactical consensus.'' We also explore tensions around materially acknowledging labor while resisting transactional researcher-participant dynamics. Through this work, we contribute to growing efforts to reimagine data and AI development as relational and political spaces for understanding difference, enacting care, and building solidarity across shared struggles.
Related papers
- Talking Point based Ideological Discourse Analysis in News Events [62.18747509565779]
We propose a framework motivated by the theory of ideological discourse analysis to analyze news articles related to real-world events.<n>Our framework represents the news articles using a relational structure - talking points, which captures the interaction between entities, their roles, and media frames along with a topic of discussion.<n>We evaluate our framework's ability to generate these perspectives through automated tasks - ideology and partisan classification tasks, supplemented by human validation.
arXiv Detail & Related papers (2025-04-10T02:52:34Z) - Assessing employment and labour issues implicated by using AI [0.0]
The chapter critiques the dominant reductionist approach in AI and work studies.<n>It advocates for a systemic perspective that emphasizes the interdependence of tasks, roles, and workplace contexts.
arXiv Detail & Related papers (2025-04-08T10:14:19Z) - Delineating Feminist Studies through bibliometric analysis [1.1060425537315088]
This paper proposes a novel approach for identifying gender/sex related publications scattered across diverse scientific disciplines.<n>We employ bibliometric techniques, natural language processing (NLP) and manual curation to compile a dataset of scientific publications.<n>The resulting dataset comprises over 1.9 million scientific documents published between 1668 and 2023, spanning four languages.
arXiv Detail & Related papers (2024-11-27T12:52:51Z) - Towards an Analysis of Discourse and Interactional Pragmatic Reasoning Capabilities of Large Language Models [0.0]
We discuss the scope of the field of pragmatics and suggest a subdivision into discourse pragmatics and interactional pragmatics.
We consider the resulting heterogeneous set of phenomena and methods as a starting point for our survey of work on discourse pragmatics and interactional pragmatics in the context of LLMs.
arXiv Detail & Related papers (2024-08-06T10:02:05Z) - Context-faithful Prompting for Large Language Models [51.194410884263135]
Large language models (LLMs) encode parametric knowledge about world facts.
Their reliance on parametric knowledge may cause them to overlook contextual cues, leading to incorrect predictions in context-sensitive NLP tasks.
We assess and enhance LLMs' contextual faithfulness in two aspects: knowledge conflict and prediction with abstention.
arXiv Detail & Related papers (2023-03-20T17:54:58Z) - Factoring the Matrix of Domination: A Critical Review and Reimagination
of Intersectionality in AI Fairness [55.037030060643126]
Intersectionality is a critical framework that allows us to examine how social inequalities persist.
We argue that adopting intersectionality as an analytical framework is pivotal to effectively operationalizing fairness.
arXiv Detail & Related papers (2023-03-16T21:02:09Z) - PAR: Political Actor Representation Learning with Social Context and
Expert Knowledge [45.215862050840116]
We propose textbfPAR, a textbfPolitical textbfActor textbfRepresentation learning framework.
We retrieve and extract factual statements about legislators to leverage social context information.
We then construct a heterogeneous information network to incorporate social context and use relational graph neural networks to learn legislator representations.
arXiv Detail & Related papers (2022-10-15T19:28:06Z) - Rethinking the Role of Demonstrations: What Makes In-Context Learning
Work? [112.72413411257662]
Large language models (LMs) are able to in-context learn by conditioning on a few input-label pairs (demonstrations) and making predictions for new inputs.
We show that ground truth demonstrations are in fact not required -- randomly replacing labels in the demonstrations barely hurts performance.
We find that other aspects of the demonstrations are the key drivers of end task performance.
arXiv Detail & Related papers (2022-02-25T17:25:19Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.