Ethical behavior in humans and machines -- Evaluating training data
quality for beneficial machine learning
- URL: http://arxiv.org/abs/2008.11463v1
- Date: Wed, 26 Aug 2020 09:48:38 GMT
- Title: Ethical behavior in humans and machines -- Evaluating training data
quality for beneficial machine learning
- Authors: Thilo Hagendorff
- Abstract summary: This study describes new dimensions of data quality for supervised machine learning applications.
The specific objective of this study is to describe how training data can be selected according to ethical assessments of the behavior it originates from.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Machine behavior that is based on learning algorithms can be significantly
influenced by the exposure to data of different qualities. Up to now, those
qualities are solely measured in technical terms, but not in ethical ones,
despite the significant role of training and annotation data in supervised
machine learning. This is the first study to fill this gap by describing new
dimensions of data quality for supervised machine learning applications. Based
on the rationale that different social and psychological backgrounds of
individuals correlate in practice with different modes of
human-computer-interaction, the paper describes from an ethical perspective how
varying qualities of behavioral data that individuals leave behind while using
digital technologies have socially relevant ramification for the development of
machine learning applications. The specific objective of this study is to
describe how training data can be selected according to ethical assessments of
the behavior it originates from, establishing an innovative filter regime to
transition from the big data rationale n = all to a more selective way of
processing data for training sets in machine learning. The overarching aim of
this research is to promote methods for achieving beneficial machine learning
applications that could be widely useful for industry as well as academia.
Related papers
- Causal Inference Tools for a Better Evaluation of Machine Learning [0.0]
We introduce key statistical methods such as Ordinary Least Squares (OLS) regression, Analysis of Variance (ANOVA) and logistic regression.
The document serves as a guide for researchers and practitioners, detailing how these techniques can provide deeper insights into model behavior, performance, and fairness.
arXiv Detail & Related papers (2024-10-02T10:03:29Z) - Silver Linings in the Shadows: Harnessing Membership Inference for Machine Unlearning [7.557226714828334]
We present a novel unlearning mechanism designed to remove the impact of specific data samples from a neural network.
In achieving this goal, we crafted a novel loss function tailored to eliminate privacy-sensitive information from weights and activation values of the target model.
Our results showcase the superior performance of our approach in terms of unlearning efficacy and latency as well as the fidelity of the primary task.
arXiv Detail & Related papers (2024-07-01T00:20:26Z) - Machine Learning for QoS Prediction in Vehicular Communication:
Challenges and Solution Approaches [46.52224306624461]
We consider maximum throughput prediction enhancing, for example, streaming or high-definition mapping applications.
We highlight how confidence can be built on machine learning technologies by better understanding the underlying characteristics of the collected data.
We use explainable AI to show that machine learning can learn underlying principles of wireless networks without being explicitly programmed.
arXiv Detail & Related papers (2023-02-23T12:29:20Z) - Striving for data-model efficiency: Identifying data externalities on
group performance [75.17591306911015]
Building trustworthy, effective, and responsible machine learning systems hinges on understanding how differences in training data and modeling decisions interact to impact predictive performance.
We focus on a particular type of data-model inefficiency, in which adding training data from some sources can actually lower performance evaluated on key sub-groups of the population.
Our results indicate that data-efficiency is a key component of both accurate and trustworthy machine learning.
arXiv Detail & Related papers (2022-11-11T16:48:27Z) - Impact Learning: A Learning Method from Features Impact and Competition [1.3569491184708429]
This paper introduced a new machine learning algorithm called impact learning.
Impact learning is a supervised learning algorithm that can be consolidated in both classification and regression problems.
It is prepared by the impacts of the highlights from the intrinsic rate of natural increase.
arXiv Detail & Related papers (2022-11-04T04:56:35Z) - Advancing Reacting Flow Simulations with Data-Driven Models [50.9598607067535]
Key to effective use of machine learning tools in multi-physics problems is to couple them to physical and computer models.
The present chapter reviews some of the open opportunities for the application of data-driven reduced-order modeling of combustion systems.
arXiv Detail & Related papers (2022-09-05T16:48:34Z) - Can Information Behaviour Inform Machine Learning? [0.0]
The paper illustrates how human information behaviour research can bring to machine learning a more nuanced view of information and informing.
Despite their clear differences, the fields of information behaviour and machine learning share many common objectives, paradigms, and key research questions.
arXiv Detail & Related papers (2022-05-01T19:00:52Z) - Human-in-the-Loop Disinformation Detection: Stance, Sentiment, or
Something Else? [93.91375268580806]
Both politics and pandemics have recently provided ample motivation for the development of machine learning-enabled disinformation (a.k.a. fake news) detection algorithms.
Existing literature has focused primarily on the fully-automated case, but the resulting techniques cannot reliably detect disinformation on the varied topics, sources, and time scales required for military applications.
By leveraging an already-available analyst as a human-in-the-loop, canonical machine learning techniques of sentiment analysis, aspect-based sentiment analysis, and stance detection become plausible methods to use for a partially-automated disinformation detection system.
arXiv Detail & Related papers (2021-11-09T13:30:34Z) - Human-Robot Collaboration and Machine Learning: A Systematic Review of
Recent Research [69.48907856390834]
Human-robot collaboration (HRC) is the approach that explores the interaction between a human and a robot.
This paper proposes a thorough literature review of the use of machine learning techniques in the context of HRC.
arXiv Detail & Related papers (2021-10-14T15:14:33Z) - What Matters in Learning from Offline Human Demonstrations for Robot
Manipulation [64.43440450794495]
We conduct an extensive study of six offline learning algorithms for robot manipulation.
Our study analyzes the most critical challenges when learning from offline human data.
We highlight opportunities for learning from human datasets.
arXiv Detail & Related papers (2021-08-06T20:48:30Z) - Knowledge as Invariance -- History and Perspectives of
Knowledge-augmented Machine Learning [69.99522650448213]
Research in machine learning is at a turning point.
Research interests are shifting away from increasing the performance of highly parameterized models to exceedingly specific tasks.
This white paper provides an introduction and discussion of this emerging field in machine learning research.
arXiv Detail & Related papers (2020-12-21T15:07:19Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.