Observation-Augmented Contextual Multi-Armed Bandits for Robotic
Exploration with Uncertain Semantic Data
- URL: http://arxiv.org/abs/2312.12583v1
- Date: Tue, 19 Dec 2023 20:28:42 GMT
- Title: Observation-Augmented Contextual Multi-Armed Bandits for Robotic
Exploration with Uncertain Semantic Data
- Authors: Shohei Wakayama and Nisar Ahmed
- Abstract summary: We introduce a new variant of contextual multi-armed bandits called observation-augmented CMABs (OA-CMABs)
OA-CMABs model the expected option outcomes as a function of context features and hidden parameters.
We propose a robust Bayesian inference process for OA-CMABs that is based on the concept of probabilistic data validation.
- Score: 7.795929277007235
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: For robotic decision-making under uncertainty, the balance between
exploitation and exploration of available options must be carefully taken into
account. In this study, we introduce a new variant of contextual multi-armed
bandits called observation-augmented CMABs (OA-CMABs) wherein a decision-making
agent can utilize extra outcome observations from an external information
source. CMABs model the expected option outcomes as a function of context
features and hidden parameters, which are inferred from previous option
outcomes. In OA-CMABs, external observations are also a function of context
features and thus provide additional evidence about the hidden parameters. Yet,
if an external information source is error-prone, the resulting posterior
updates can harm decision-making performance unless the presence of errors is
considered. To this end, we propose a robust Bayesian inference process for
OA-CMABs that is based on the concept of probabilistic data validation. Our
approach handles complex mixture model parameter priors and hybrid observation
likelihoods for semantic data sources, allowing us to develop validation
algorithms based on recently develop probabilistic semantic data association
techniques. Furthermore, to more effectively cope with the combined sources of
uncertainty in OA-CMABs, we derive a new active inference algorithm for option
selection based on expected free energy minimization. This generalizes previous
work on active inference for bandit-based robotic decision-making by accounting
for faulty observations and non-Gaussian inference. Our approaches are
demonstrated on a simulated asynchronous search site selection problem for
space exploration. The results show that even if incorrect observations are
provided by external information sources, efficient decision-making and robust
parameter inference are still achieved in a wide variety of experimental
conditions.
Related papers
- Assumption-Lean Post-Integrated Inference with Negative Control Outcomes [0.0]
We introduce a robust post-integrated inference (PII) method that adjusts for latent heterogeneity using negative control outcomes.
Our method extends to projected direct effect estimands, accounting for hidden mediators, confounders, and moderators.
The proposed doubly robust estimators are consistent and efficient under minimal assumptions and potential misspecification.
arXiv Detail & Related papers (2024-10-07T12:52:38Z) - Bounded and Unbiased Composite Differential Privacy [25.427802467876248]
The objective of differential privacy (DP) is to protect privacy by producing an output distribution that is indistinguishable between two neighboring databases.
Existing solutions attempt to address this issue by employing post-processing or truncation techniques.
We propose a novel differentially private mechanism which uses a composite probability density function to generate bounded and unbiased outputs.
arXiv Detail & Related papers (2023-11-04T04:43:47Z) - Approximating Counterfactual Bounds while Fusing Observational, Biased
and Randomised Data Sources [64.96984404868411]
We address the problem of integrating data from multiple, possibly biased, observational and interventional studies.
We show that the likelihood of the available data has no local maxima.
We then show how the same approach can address the general case of multiple datasets.
arXiv Detail & Related papers (2023-07-31T11:28:24Z) - ALMERIA: Boosting pairwise molecular contrasts with scalable methods [0.0]
ALMERIA is a tool for estimating compound similarities and activity prediction based on pairwise molecular contrasts.
It has been implemented using scalable software and methods to exploit large volumes of data.
Experiments show state-of-the-art performance for molecular activity prediction.
arXiv Detail & Related papers (2023-04-28T16:27:06Z) - Learning to Bound Counterfactual Inference in Structural Causal Models
from Observational and Randomised Data [64.96984404868411]
We derive a likelihood characterisation for the overall data that leads us to extend a previous EM-based algorithm.
The new algorithm learns to approximate the (unidentifiability) region of model parameters from such mixed data sources.
It delivers interval approximations to counterfactual results, which collapse to points in the identifiable case.
arXiv Detail & Related papers (2022-12-06T12:42:11Z) - MissDAG: Causal Discovery in the Presence of Missing Data with
Continuous Additive Noise Models [78.72682320019737]
We develop a general method, which we call MissDAG, to perform causal discovery from data with incomplete observations.
MissDAG maximizes the expected likelihood of the visible part of observations under the expectation-maximization framework.
We demonstrate the flexibility of MissDAG for incorporating various causal discovery algorithms and its efficacy through extensive simulations and real data experiments.
arXiv Detail & Related papers (2022-05-27T09:59:46Z) - The interventional Bayesian Gaussian equivalent score for Bayesian
causal inference with unknown soft interventions [0.0]
In certain settings, such as genomics, we may have data from heterogeneous study conditions, with soft (partial) interventions only pertaining to a subset of the study variables.
We define the interventional BGe score for a mixture of observational and interventional data, where the targets and effects of intervention may be unknown.
arXiv Detail & Related papers (2022-05-05T12:32:08Z) - Leveraging Global Parameters for Flow-based Neural Posterior Estimation [90.21090932619695]
Inferring the parameters of a model based on experimental observations is central to the scientific method.
A particularly challenging setting is when the model is strongly indeterminate, i.e., when distinct sets of parameters yield identical observations.
We present a method for cracking such indeterminacy by exploiting additional information conveyed by an auxiliary set of observations sharing global parameters.
arXiv Detail & Related papers (2021-02-12T12:23:13Z) - Decorrelated Clustering with Data Selection Bias [55.91842043124102]
We propose a novel Decorrelation regularized K-Means algorithm (DCKM) for clustering with data selection bias.
Our DCKM algorithm achieves significant performance gains, indicating the necessity of removing unexpected feature correlations induced by selection bias.
arXiv Detail & Related papers (2020-06-29T08:55:50Z) - A Robust Functional EM Algorithm for Incomplete Panel Count Data [66.07942227228014]
We propose a functional EM algorithm to estimate the counting process mean function under a missing completely at random assumption (MCAR)
The proposed algorithm wraps several popular panel count inference methods, seamlessly deals with incomplete counts and is robust to misspecification of the Poisson process assumption.
We illustrate the utility of the proposed algorithm through numerical experiments and an analysis of smoking cessation data.
arXiv Detail & Related papers (2020-03-02T20:04:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.