Related papers: Fairness Testing: A Comprehensive Survey and Analysis of Trends

Fairness Testing: A Comprehensive Survey and Analysis of Trends

URL: http://arxiv.org/abs/2207.10223v4
Date: Wed, 6 Mar 2024 14:07:48 GMT
Title: Fairness Testing: A Comprehensive Survey and Analysis of Trends
Authors: Zhenpeng Chen, Jie M. Zhang, Max Hort, Mark Harman, Federica Sarro
Abstract summary: Unfair behaviors of Machine Learning (ML) software have garnered increasing attention and concern among software engineers. This paper offers a comprehensive survey of existing studies in this field.
Score: 30.637712832450525
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Unfair behaviors of Machine Learning (ML) software have garnered increasing attention and concern among software engineers. To tackle this issue, extensive research has been dedicated to conducting fairness testing of ML software, and this paper offers a comprehensive survey of existing studies in this field. We collect 100 papers and organize them based on the testing workflow (i.e., how to test) and testing components (i.e., what to test). Furthermore, we analyze the research focus, trends, and promising directions in the realm of fairness testing. We also identify widely-adopted datasets and open-source tools for fairness testing.

Related papers

A Survey on Web Testing: On the Rise of AI and Applications in Industry [1.5149438988761574]
This paper presents a systematic literature survey focusing on web testing methodologies, tools, and trends from 2014 to 2024. Our results show that web testing research has been highly active, with ICST as the leading venue. Selenium is the most widely used tool, while industrial adoption and human studies remain comparatively limited.
arXiv Detail & Related papers (2025-03-07T12:39:59Z)
Requirements-Driven Automated Software Testing: A Systematic Review [13.67495800498868]
This study synthesizes the current state of REDAST research, highlights trends, and proposes future directions. This systematic literature review ( SLR) explores the landscape of REDAST by analyzing requirements input, transformation techniques, test outcomes, evaluation methods, and existing limitations.
arXiv Detail & Related papers (2025-02-25T23:13:09Z)
A Comprehensive Survey on Imbalanced Data Learning [45.3186824501823]
imbalanced data is prevalent in various types of raw data and hinders the performance of machine learning. This survey systematically analyzes various real-world data formats. It concludes existing researches for different data formats into four categories: data re-balancing, feature representation, training strategy, and ensemble learning.
arXiv Detail & Related papers (2025-02-13T04:53:17Z)
Testing Research Software: An In-Depth Survey of Practices, Methods, and Tools [3.831549883667425]
Testing research software is challenging due to the software's complexity and to the unique culture of the research software community. This study focuses on test case design, challenges with expected outputs, use of quality metrics, execution methods, tools, and desired tool features.
arXiv Detail & Related papers (2025-01-29T16:27:13Z)
Which Combination of Test Metrics Can Predict Success of a Software Project? A Case Study in a Year-Long Project Course [1.553083901660282]
Testing plays an important role in securing the success of a software development project. We investigate whether we can quantify the effects various types of testing have on functional suitability.
arXiv Detail & Related papers (2024-08-22T04:23:51Z)
A Software Engineering Perspective on Testing Large Language Models: Research, Practice, Tools and Benchmarks [2.8061460833143346]
Large Language Models (LLMs) are rapidly becoming ubiquitous both as stand-alone tools and as components of current and future software systems. To enable usage of LLMs in the high-stake or safety-critical systems of 2030, they need to undergo rigorous testing.
arXiv Detail & Related papers (2024-06-12T13:45:45Z)
Survey of Computerized Adaptive Testing: A Machine Learning Perspective [66.26687542572974]
Computerized Adaptive Testing (CAT) provides an efficient and tailored method for assessing the proficiency of examinees. This paper aims to provide a machine learning-focused survey on CAT, presenting a fresh perspective on this adaptive testing method.
arXiv Detail & Related papers (2024-03-31T15:09:47Z)
Elevating Software Quality in Agile Environments: The Role of Testing Professionals in Unit Testing [0.0]
Testing is an essential quality activity in the software development process. This paper explores the participation of test engineers in unit testing within an industrial context.
arXiv Detail & Related papers (2024-03-20T00:41:49Z)
Large Language Models Based Fuzzing Techniques: A Survey [4.155653485098873]
fuzzing test, as an efficient software testing method, are widely used in various domains. The rapid development of Large Language Models (LLMs) has facilitated their application in the field of software testing. There is a growing trend towards employing fuzzing test generated based on large language models.
arXiv Detail & Related papers (2024-02-01T05:34:03Z)
Are We Testing or Being Tested? Exploring the Practical Applications of Large Language Models in Software Testing [0.0]
A Large Language Model (LLM) represents a cutting-edge artificial intelligence model that generates coherent content. LLM can play a pivotal role in software development, including software testing. This study explores the practical application of LLMs in software testing within an industrial setting.
arXiv Detail & Related papers (2023-12-08T06:30:37Z)
A Comprehensive Survey on Test-Time Adaptation under Distribution Shifts [143.14128737978342]
Test-time adaptation, an emerging paradigm, has the potential to adapt a pre-trained model to unlabeled data during testing, before making predictions. Recent progress in this paradigm highlights the significant benefits of utilizing unlabeled data for training self-adapted models prior to inference.
arXiv Detail & Related papers (2023-03-27T16:32:21Z)
Is margin all you need? An extensive empirical study of active learning on tabular data [66.18464006872345]
We analyze the performance of a variety of active learning algorithms on 69 real-world datasets from the OpenML-CC18 benchmark. Surprisingly, we find that the classical margin sampling technique matches or outperforms all others, including current state-of-art.
arXiv Detail & Related papers (2022-10-07T21:18:24Z)
Towards Informed Design and Validation Assistance in Computer Games Using Imitation Learning [65.12226891589592]
This paper proposes a new approach to automated game validation and testing. Our method leverages a data-driven imitation learning technique, which requires little effort and time and no knowledge of machine learning or programming.
arXiv Detail & Related papers (2022-08-15T11:08:44Z)
Dynamic Causal Effects Evaluation in A/B Testing with a Reinforcement Learning Framework [68.96770035057716]
A/B testing is a business strategy to compare a new product with an old one in pharmaceutical, technological, and traditional industries. This paper introduces a reinforcement learning framework for carrying A/B testing in online experiments.
arXiv Detail & Related papers (2020-02-05T10:25:02Z)

This list is automatically generated from the titles and abstracts of the papers in this site.