New methods for new data? An overview and illustration of quantitative
inductive methods for HRM research
- URL: http://arxiv.org/abs/2305.08889v1
- Date: Mon, 15 May 2023 09:51:30 GMT
- Title: New methods for new data? An overview and illustration of quantitative
inductive methods for HRM research
- Authors: Alain LACROUX (UP1 EMS)
- Abstract summary: "Data is the new oil", in short, data would be the essential source of the ongoing fourth industrial revolution.
Unlike oil, there are no major issues here concerning the production of data.
The methodological challenges of data valuation lie, both for practitioners and for academic researchers.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: "Data is the new oil", in short, data would be the essential source of the
ongoing fourth industrial revolution, which has led some commentators to
assimilate too quickly the quantity of data to a source of wealth in itself,
and consider the development of big data as an quasi direct cause of profit.
Human resources management is not escaping this trend, and the accumulation of
large amounts of data on employees is perceived by some entrepreneurs as a
necessary and sufficient condition for the construction of predictive models of
complex work behaviors such as absenteeism or job performance. In fact, the
analogy is somewhat misleading: unlike oil, there are no major issues here
concerning the production of data (whose flows are generated continuously and
at low cost by various information systems), but rather their ''refining'',
i.e. the operations necessary to transform this data into a useful product,
namely into knowledge. This transformation is where the methodological
challenges of data valuation lie, both for practitioners and for academic
researchers. Considerations on the methods applicable to take advantage of the
possibilities offered by these massive data are relatively recent, and often
highlight the disruptive aspect of the current ''data deluge'' to point out
that this evolution would be the source of a revival of empiricism in a
''fourth paradigm'' based on the intensive and ''agnostic'' exploitation of
massive amounts of data in order to bring out new knowledge, following a purely
inductive logic. Although we do not adopt this speculative point of view, it is
clear that data-driven approaches are scarce in quantitative HRM studies.
However, there are well-established methods, particularly in the field of data
mining, which are based on inductive approaches. This area of quantitative
analysis with an inductive aim is still relatively unexplored in HRM ( apart
from typological analyses). The objective of this paper is first to give an
overview of data driven methods that can be used for HRM research, before
proposing an empirical illustration which consists in an exploratory research
combining a latent profile analysis and an exploration by Gaussian graphical
models.
Related papers
- A Survey on Data Synthesis and Augmentation for Large Language Models [35.59526251210408]
This paper reviews and summarizes data generation techniques throughout the lifecycle of Large Language Models.
We discuss the current constraints faced by these methods and investigate potential pathways for future development and research.
arXiv Detail & Related papers (2024-10-16T16:12:39Z) - DISCOVER: A Data-driven Interactive System for Comprehensive Observation, Visualization, and ExploRation of Human Behaviour [6.716560115378451]
We introduce a modular, flexible, yet user-friendly software framework specifically developed to streamline computational-driven data exploration for human behavior analysis.
Our primary objective is to democratize access to advanced computational methodologies, thereby enabling researchers across disciplines to engage in detailed behavioral analysis without the need for extensive technical proficiency.
arXiv Detail & Related papers (2024-07-18T11:28:52Z) - A Survey on Data Selection for Language Models [148.300726396877]
Data selection methods aim to determine which data points to include in a training dataset.
Deep learning is mostly driven by empirical evidence and experimentation on large-scale data is expensive.
Few organizations have the resources for extensive data selection research.
arXiv Detail & Related papers (2024-02-26T18:54:35Z) - A spectrum of physics-informed Gaussian processes for regression in
engineering [0.0]
Despite the growing availability of sensing and data in general, we remain unable to fully characterise many in-service engineering systems and structures from a purely data-driven approach.
This paper pursues the combination of machine learning technology and physics-based reasoning to enhance our ability to make predictive models with limited data.
arXiv Detail & Related papers (2023-09-19T14:39:03Z) - Provably Efficient Offline Reinforcement Learning with Perturbed Data
Sources [23.000116974718]
Existing theoretical studies on offline reinforcement learning (RL) mostly consider a dataset sampled directly from the target task.
In practice, however, data often come from several heterogeneous but related sources.
This work aims at rigorously understanding offline RL with multiple datasets that are collected from randomly perturbed versions of the target task instead of from itself.
arXiv Detail & Related papers (2023-06-14T08:53:20Z) - Synthetic data, real errors: how (not) to publish and use synthetic data [86.65594304109567]
We show how the generative process affects the downstream ML task.
We introduce Deep Generative Ensemble (DGE) to approximate the posterior distribution over the generative process model parameters.
arXiv Detail & Related papers (2023-05-16T07:30:29Z) - Dataset Distillation: A Comprehensive Review [76.26276286545284]
dataset distillation (DD) aims to derive a much smaller dataset containing synthetic samples, based on which the trained models yield performance comparable with those trained on the original dataset.
This paper gives a comprehensive review and summary of recent advances in DD and its application.
arXiv Detail & Related papers (2023-01-17T17:03:28Z) - Synthetic Data in Human Analysis: A Survey [16.562921709882865]
Survey is intended for researchers and practitioners in the field of human analysis.
We conduct a survey that summarises current state-of-the-art methods and the main benefits of using synthetic data.
We also provide an overview of publicly available synthetic datasets and generation models.
arXiv Detail & Related papers (2022-08-19T07:32:34Z) - Research Trends and Applications of Data Augmentation Algorithms [77.34726150561087]
We identify the main areas of application of data augmentation algorithms, the types of algorithms used, significant research trends, their progression over time and research gaps in data augmentation literature.
We expect readers to understand the potential of data augmentation, as well as identify future research directions and open questions within data augmentation research.
arXiv Detail & Related papers (2022-07-18T11:38:32Z) - Principled Knowledge Extrapolation with GANs [92.62635018136476]
We study counterfactual synthesis from a new perspective of knowledge extrapolation.
We show that an adversarial game with a closed-form discriminator can be used to address the knowledge extrapolation problem.
Our method enjoys both elegant theoretical guarantees and superior performance in many scenarios.
arXiv Detail & Related papers (2022-05-21T08:39:42Z) - Combining Feature and Instance Attribution to Detect Artifacts [62.63504976810927]
We propose methods to facilitate identification of training data artifacts.
We show that this proposed training-feature attribution approach can be used to uncover artifacts in training data.
We execute a small user study to evaluate whether these methods are useful to NLP researchers in practice.
arXiv Detail & Related papers (2021-07-01T09:26:13Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.