Limitations of a proposed correction for slow drifts in decision
criterion
- URL: http://arxiv.org/abs/2205.10912v1
- Date: Sun, 22 May 2022 19:33:19 GMT
- Title: Limitations of a proposed correction for slow drifts in decision
criterion
- Authors: Diksha Gupta and Carlos D. Brody
- Abstract summary: We propose a model-based approach for disambiguating systematic updates from random drifts.
We show that this approach accurately recovers the latent trajectory of drifts in decision criterion.
Our results highlight the advantages of incorporating assumptions about the generative process directly into models of decision-making.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Trial history biases in decision-making tasks are thought to reflect
systematic updates of decision variables, therefore their precise nature
informs conclusions about underlying heuristic strategies and learning
processes. However, random drifts in decision variables can corrupt this
inference by mimicking the signatures of systematic updates. Hence, identifying
the trial-by-trial evolution of decision variables requires methods that can
robustly account for such drifts. Recent studies (Lak'20, Mendon\c{c}a'20) have
made important advances in this direction, by proposing a convenient method to
correct for the influence of slow drifts in decision criterion, a key decision
variable. Here we apply this correction to a variety of updating scenarios, and
evaluate its performance. We show that the correction fails for a wide range of
commonly assumed systematic updating strategies, distorting one's inference
away from the veridical strategies towards a narrow subset. To address these
limitations, we propose a model-based approach for disambiguating systematic
updates from random drifts, and demonstrate its success on real and synthetic
datasets. We show that this approach accurately recovers the latent trajectory
of drifts in decision criterion as well as the generative systematic updates
from simulated data. Our results offer recommendations for methods to account
for the interactions between history biases and slow drifts, and highlight the
advantages of incorporating assumptions about the generative process directly
into models of decision-making.
Related papers
- Making Large Language Models Better Planners with Reasoning-Decision Alignment [70.5381163219608]
We motivate an end-to-end decision-making model based on multimodality-augmented LLM.
We propose a reasoning-decision alignment constraint between the paired CoTs and planning results.
We dub our proposed large language planners with reasoning-decision alignment as RDA-Driver.
arXiv Detail & Related papers (2024-08-25T16:43:47Z) - Sequential sampling without comparison to boundary through model-free reinforcement learning [0.0]
We propose a model-free reinforcement learning algorithm for perceptual decisions under uncertainty.
Our model learns whether to commit to a decision given the available evidence or continue sampling information at a cost.
arXiv Detail & Related papers (2024-08-12T11:56:39Z) - Decision-Focused Forecasting: Decision Losses for Multistage Optimisation [0.0]
We propose decision-focused forecasting, a multiple-implicitlayer model which in its training accounts for the intertemporal decision effects of forecasts using differentiable optimisation.
We present an analysis of the gradients produced by this model showing the adjustments made to account for the state-path caused by forecasting.
We demonstrate an application of the model to an energy storage arbitrage task and report that our model outperforms existing approaches.
arXiv Detail & Related papers (2024-05-23T15:48:46Z) - Effective Restoration of Source Knowledge in Continual Test Time
Adaptation [44.17577480511772]
This paper introduces an unsupervised domain change detection method that is capable of identifying domain shifts in dynamic environments.
By restoring the knowledge from the source, it effectively corrects the negative consequences arising from the gradual deterioration of model parameters.
We perform extensive experiments on benchmark datasets to demonstrate the superior performance of our method compared to state-of-the-art adaptation methods.
arXiv Detail & Related papers (2023-11-08T19:21:48Z) - RISE: Robust Individualized Decision Learning with Sensitive Variables [1.5293427903448025]
A naive baseline is to ignore sensitive variables in learning decision rules, leading to significant uncertainty and bias.
We propose a decision learning framework to incorporate sensitive variables during offline training but not include them in the input of the learned decision rule during model deployment.
arXiv Detail & Related papers (2022-11-12T04:31:38Z) - Inverse Online Learning: Understanding Non-Stationary and Reactionary
Policies [79.60322329952453]
We show how to develop interpretable representations of how agents make decisions.
By understanding the decision-making processes underlying a set of observed trajectories, we cast the policy inference problem as the inverse to this online learning problem.
We introduce a practical algorithm for retrospectively estimating such perceived effects, alongside the process through which agents update them.
Through application to the analysis of UNOS organ donation acceptance decisions, we demonstrate that our approach can bring valuable insights into the factors that govern decision processes and how they change over time.
arXiv Detail & Related papers (2022-03-14T17:40:42Z) - Error-based Knockoffs Inference for Controlled Feature Selection [49.99321384855201]
We propose an error-based knockoff inference method by integrating the knockoff features, the error-based feature importance statistics, and the stepdown procedure together.
The proposed inference procedure does not require specifying a regression model and can handle feature selection with theoretical guarantees.
arXiv Detail & Related papers (2022-03-09T01:55:59Z) - Modularity in Reinforcement Learning via Algorithmic Independence in
Credit Assignment [79.5678820246642]
We show that certain action-value methods are more sample efficient than policy-gradient methods on transfer problems that require only sparse changes to a sequence of previously optimal decisions.
We generalize the recently proposed societal decision-making framework as a more granular formalism than the Markov decision process.
arXiv Detail & Related papers (2021-06-28T21:29:13Z) - Stein Variational Model Predictive Control [130.60527864489168]
Decision making under uncertainty is critical to real-world, autonomous systems.
Model Predictive Control (MPC) methods have demonstrated favorable performance in practice, but remain limited when dealing with complex distributions.
We show that this framework leads to successful planning in challenging, non optimal control problems.
arXiv Detail & Related papers (2020-11-15T22:36:59Z) - Inverse Active Sensing: Modeling and Understanding Timely
Decision-Making [111.07204912245841]
We develop a framework for the general setting of evidence-based decision-making under endogenous, context-dependent time pressure.
We demonstrate how it enables modeling intuitive notions of surprise, suspense, and optimality in decision strategies.
arXiv Detail & Related papers (2020-06-25T02:30:45Z) - Effective Reinforcement Learning through Evolutionary Surrogate-Assisted
Prescription [18.547387505708485]
This paper introduces a general such approach, called Evolutionary Surrogate-Assisted Prescription, or ESP.
ESP forms a promising foundation to decision optimization in real-world problems.
arXiv Detail & Related papers (2020-02-13T06:59:26Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.