Ethical conundrums: Hacked data in the study of far-right violent extremism
- URL: http://arxiv.org/abs/2511.10924v1
- Date: Fri, 14 Nov 2025 03:27:09 GMT
- Title: Ethical conundrums: Hacked data in the study of far-right violent extremism
- Authors: Lise Waldek, Brian Ballsun-Stanton, Muhammad Iqbal, David Kernot, Debra Smith,
- Abstract summary: This article outlines the ethical debates that arose when considering the use of hacked data to examine online far-right violent extremism.<n>It argues that under certain circumstances, researchers can do ethical research with hacked data.
- Score: 0.2754772107483804
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Ethical conduct in digital research is full of grey areas. Disciplinary, institutional and individual norms and conventions developed to support research are challenged, often leaving scholars with a sense of unease or lack of clarity. The growing availability of hacked data is one area. Discussions and debates around the use of these datasets in research are extremely limited. Reviews of the history, culture, or morality of the act of hacking are topics that have attracted some scholarly attention. However, how to undertake research with this data is less examined and provides an opportunity for the generation of reflexive ethical practice. This article presents a case-study outlining the ethical debates that arose when considering the use of hacked data to examine online far-right violent extremism. It argues that under certain circumstances, researchers can do ethical research with hacked data. However, to do so we must proactively and continually engage deeply with ethical quandaries and dilemmas.
Related papers
- [Extended] Ethics in Computer Security Research: A Data-Driven Assessment of the Past, the Present, and the Possible Future [21.60516344845455]
Researchers in computer security lack clear guidance on how to make, document, and assess ethical decisions.<n>We review all 1154 top-tier security papers published in 2024, finding inconsistent levels of ethics reporting.<n>We report on the results of a semi-structured interview study with 24 computer security and privacy researchers.
arXiv Detail & Related papers (2025-09-11T11:06:56Z) - The Only Way is Ethics: A Guide to Ethical Research with Large Language Models [53.316174782223115]
'LLM Ethics Whitepaper' is an open resource for NLP practitioners and those tasked with evaluating the ethical implications of others' work.<n>Our goal is to translate ethics literature into concrete recommendations and provocations for thinking with clear first steps.<n>'LLM Ethics Whitepaper' distils a thorough literature review into clear Do's and Don'ts, which we present also in this paper.
arXiv Detail & Related papers (2024-12-20T16:14:43Z) - Web Scraping for Research: Legal, Ethical, Institutional, and Scientific Considerations [11.851771490297693]
This paper proposes a comprehensive framework for web scraping in social science research for U.S.-based researchers.<n>We present an overview of the current regulatory environment impacting when and how researchers can access, collect, store, and share data via scraping.<n>We then provide researchers with recommendations to conduct scraping in a scientifically legitimate and ethical manner.
arXiv Detail & Related papers (2024-10-30T20:20:44Z) - Eagle: Ethical Dataset Given from Real Interactions [74.7319697510621]
We create datasets extracted from real interactions between ChatGPT and users that exhibit social biases, toxicity, and immoral problems.
Our experiments show that Eagle captures complementary aspects, not covered by existing datasets proposed for evaluation and mitigation of such ethical challenges.
arXiv Detail & Related papers (2024-02-22T03:46:02Z) - Metaethical Perspectives on 'Benchmarking' AI Ethics [81.65697003067841]
Benchmarks are seen as the cornerstone for measuring technical progress in Artificial Intelligence (AI) research.
An increasingly prominent research area in AI is ethics, which currently has no set of benchmarks nor commonly accepted way for measuring the 'ethicality' of an AI system.
We argue that it makes more sense to talk about 'values' rather than 'ethics' when considering the possible actions of present and future AI systems.
arXiv Detail & Related papers (2022-04-11T14:36:39Z) - Yes-Yes-Yes: Donation-based Peer Reviewing Data Collection for ACL
Rolling Review and Beyond [58.71736531356398]
We present an in-depth discussion of peer reviewing data, outline the ethical and legal desiderata for peer reviewing data collection, and propose the first continuous, donation-based data collection workflow.
We report on the ongoing implementation of this workflow at the ACL Rolling Review and deliver the first insights obtained with the newly collected data.
arXiv Detail & Related papers (2022-01-27T11:02:43Z) - A Non-Expert's Introduction to Data Ethics for Mathematicians [0.0]
I begin with some background information and societal context for data ethics.
I briefly highlight a few efforts -- at my home institution and elsewhere -- on data ethics, society, and social good.
I then discuss open data in research, research replicability and some other ethical issues in research.
I then discuss ethical principles, institutional review boards, and a few other considerations in the scientific use of human data.
arXiv Detail & Related papers (2022-01-18T23:31:06Z) - Use of Formal Ethical Reviews in NLP Literature: Historical Trends and
Current Practices [6.195761193461355]
Ethical aspects of research in language technologies have received much attention recently.
It is a standard practice to get a study involving human subjects reviewed and approved by a professional ethics committee/board of the institution.
With the rising concerns and discourse around the ethics of NLP, do we also observe a rise in formal ethical reviews of NLP studies?
arXiv Detail & Related papers (2021-06-02T12:12:59Z) - An Ethical Highlighter for People-Centric Dataset Creation [62.886916477131486]
We propose an analytical framework to guide ethical evaluation of existing datasets and to serve future dataset creators in avoiding missteps.
Our work is informed by a review and analysis of prior works and highlights where such ethical challenges arise.
arXiv Detail & Related papers (2020-11-27T07:18:44Z) - Scruples: A Corpus of Community Ethical Judgments on 32,000 Real-Life
Anecdotes [72.64975113835018]
Motivated by descriptive ethics, we investigate a novel, data-driven approach to machine ethics.
We introduce Scruples, the first large-scale dataset with 625,000 ethical judgments over 32,000 real-life anecdotes.
Our dataset presents a major challenge to state-of-the-art neural language models, leaving significant room for improvement.
arXiv Detail & Related papers (2020-08-20T17:34:15Z) - Ethical issues with using Internet of Things devices in citizen science
research: A scoping review [1.933681537640272]
This chapter presents a scoping review of published scientific studies that utilise both citizen scientists and Internet of Things devices.
We selected studies where the authors had included at least a short discussion of the ethical issues encountered during the research process.
Following this analysis, our discussion provides recommendations for researchers who wish to integrate citizen scientists and Internet of Things devices into their research.
arXiv Detail & Related papers (2020-07-18T12:22:05Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.