A Review on Oracle Issues in Machine Learning
- URL: http://arxiv.org/abs/2105.01407v1
- Date: Tue, 4 May 2021 10:41:34 GMT
- Title: A Review on Oracle Issues in Machine Learning
- Authors: Diogo Seca
- Abstract summary: oracle is the data, and the data is not always a correct representation of the problem that machine learning tries to model.
We present a survey of the oracle issues found in machine learning and state-of-the-art solutions for dealing with these issues.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Machine learning contrasts with traditional software development in that the
oracle is the data, and the data is not always a correct representation of the
problem that machine learning tries to model. We present a survey of the oracle
issues found in machine learning and state-of-the-art solutions for dealing
with these issues. These include lines of research for differential testing,
metamorphic testing, and test coverage. We also review some recent improvements
to robustness during modeling that reduce the impact of oracle issues, as well
as tools and frameworks for assisting in testing and discovering issues
specific to the dataset.
Related papers
- Underwater Object Detection in the Era of Artificial Intelligence: Current, Challenge, and Future [119.88454942558485]
Underwater object detection (UOD) aims to identify and localise objects in underwater images or videos.
In recent years, artificial intelligence (AI) based methods, especially deep learning methods, have shown promising performance in UOD.
arXiv Detail & Related papers (2024-10-08T00:25:33Z) - Unsupervised Attention Regularization Based Domain Adaptation for Oracle Character Recognition [59.05212866862219]
The study of oracle characters plays an important role in Chinese archaeology and philology.
The difficulty of collecting and annotating real-world scanned oracle characters hinders the development of oracle character recognition.
We develop a novel unsupervised domain adaptation (UDA) method to transfer recognition knowledge from labeled handprinted oracle characters to unlabeled scanned data.
arXiv Detail & Related papers (2024-09-24T09:07:05Z) - Data-driven Machinery Fault Detection: A Comprehensive Review [2.373572816573706]
Timely and accurately identifying faulty machine signals is vital in industrial applications.
Data-driven Machinery Fault Diagnosis (MFD) solutions based on machine/deep learning approaches have been used ubiquitously in manufacturing.
This survey provides a comprehensive review of the articles using different types of machine learning approaches for the detection and diagnosis of various types of machinery faults.
arXiv Detail & Related papers (2024-05-29T07:50:47Z) - Test Oracle Automation in the era of LLMs [52.69509240442899]
Large Language Models (LLMs) have demonstrated remarkable proficiency in tackling diverse software testing tasks.
This paper aims to enable discussions on the potential of using LLMs for test oracle automation, along with the challenges that may emerge during the generation of various types of oracles.
arXiv Detail & Related papers (2024-05-21T13:19:10Z) - The Frontier of Data Erasure: Machine Unlearning for Large Language Models [56.26002631481726]
Large Language Models (LLMs) are foundational to AI advancements.
LLMs pose risks by potentially memorizing and disseminating sensitive, biased, or copyrighted information.
Machine unlearning emerges as a cutting-edge solution to mitigate these concerns.
arXiv Detail & Related papers (2024-03-23T09:26:15Z) - Learning Objective-Specific Active Learning Strategies with Attentive
Neural Processes [72.75421975804132]
Learning Active Learning (LAL) suggests to learn the active learning strategy itself, allowing it to adapt to the given setting.
We propose a novel LAL method for classification that exploits symmetry and independence properties of the active learning problem.
Our approach is based on learning from a myopic oracle, which gives our model the ability to adapt to non-standard objectives.
arXiv Detail & Related papers (2023-09-11T14:16:37Z) - Modelling Concurrency Bugs Using Machine Learning [0.0]
This project aims to compare both common and recent machine learning approaches.
We define a synthetic dataset that we generate with the scope of simulating real-life (concurrent) programs.
We formulate hypotheses about fundamental limits of various machine learning model types.
arXiv Detail & Related papers (2023-05-08T17:30:24Z) - Software Testing for Machine Learning [13.021014899410684]
Machine learning has shown to be susceptible to deception, leading to errors and even fatal failures.
This circumstance calls into question the widespread use of machine learning, especially in safety-critical applications.
This summary talk discusses the current state-of-the-art of software testing for machine learning.
arXiv Detail & Related papers (2022-04-30T08:47:10Z) - Knowledge as Invariance -- History and Perspectives of
Knowledge-augmented Machine Learning [69.99522650448213]
Research in machine learning is at a turning point.
Research interests are shifting away from increasing the performance of highly parameterized models to exceedingly specific tasks.
This white paper provides an introduction and discussion of this emerging field in machine learning research.
arXiv Detail & Related papers (2020-12-21T15:07:19Z) - A Survey of Machine Learning Methods and Challenges for Windows Malware
Classification [43.4550536920809]
Survey aims to be useful both to cybersecurity practitioners who wish to learn more about how machine learning can be applied to the malware problem, and to give data scientists the necessary background into the challenges in this uniquely complicated space.
arXiv Detail & Related papers (2020-06-15T17:46:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.