Data Association Aware POMDP Planning with Hypothesis Pruning
Performance Guarantees
- URL: http://arxiv.org/abs/2303.02139v3
- Date: Tue, 1 Aug 2023 18:41:35 GMT
- Title: Data Association Aware POMDP Planning with Hypothesis Pruning
Performance Guarantees
- Authors: Moran Barenboim, Idan Lev-Yehudi and Vadim Indelman
- Abstract summary: We introduce a pruning-based approach for planning with ambiguous data associations.
Our key contribution is to derive bounds between the value function based on the complete set of hypotheses and the value function based on a pruned-subset of the hypotheses.
We demonstrate how these bounds can both be used to certify any pruning in retrospect and propose a novel approach to determine which hypotheses to prune in order to ensure a predefined limit on the loss.
- Score: 7.928094304325113
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Autonomous agents that operate in the real world must often deal with partial
observability, which is commonly modeled as partially observable Markov
decision processes (POMDPs). However, traditional POMDP models rely on the
assumption of complete knowledge of the observation source, known as fully
observable data association. To address this limitation, we propose a planning
algorithm that maintains multiple data association hypotheses, represented as a
belief mixture, where each component corresponds to a different data
association hypothesis. However, this method can lead to an exponential growth
in the number of hypotheses, resulting in significant computational overhead.
To overcome this challenge, we introduce a pruning-based approach for planning
with ambiguous data associations. Our key contribution is to derive bounds
between the value function based on the complete set of hypotheses and the
value function based on a pruned-subset of the hypotheses, enabling us to
establish a trade-off between computational efficiency and performance. We
demonstrate how these bounds can both be used to certify any pruning heuristic
in retrospect and propose a novel approach to determine which hypotheses to
prune in order to ensure a predefined limit on the loss. We evaluate our
approach in simulated environments and demonstrate its efficacy in handling
multi-modal belief hypotheses with ambiguous data associations.
Related papers
- Stratified Prediction-Powered Inference for Hybrid Language Model Evaluation [62.2436697657307]
Prediction-powered inference (PPI) is a method that improves statistical estimates based on limited human-labeled data.
We propose a method called Stratified Prediction-Powered Inference (StratPPI)
We show that the basic PPI estimates can be considerably improved by employing simple data stratification strategies.
arXiv Detail & Related papers (2024-06-06T17:37:39Z) - Deriving Causal Order from Single-Variable Interventions: Guarantees & Algorithm [14.980926991441345]
We show that datasets containing interventional data can be effectively extracted under realistic assumptions about the data distribution.
We introduce interventional faithfulness, which relies on comparisons between the marginal distributions of each variable across observational and interventional settings.
We also introduce Intersort, an algorithm designed to infer the causal order from datasets containing large numbers of single-variable interventions.
arXiv Detail & Related papers (2024-05-28T16:07:17Z) - Conformal Counterfactual Inference under Hidden Confounding [19.190396053530417]
Predicting potential outcomes along with its uncertainty in a counterfactual world poses the foundamental challenge in causal inference.
Existing methods that construct confidence intervals for counterfactuals either rely on the assumption of strong ignorability.
We propose a novel approach based on transductive weighted conformal prediction, which provides confidence intervals for counterfactual outcomes with marginal converage guarantees.
arXiv Detail & Related papers (2024-05-20T21:43:43Z) - Mitigating LLM Hallucinations via Conformal Abstention [70.83870602967625]
We develop a principled procedure for determining when a large language model should abstain from responding in a general domain.
We leverage conformal prediction techniques to develop an abstention procedure that benefits from rigorous theoretical guarantees on the hallucination rate (error rate)
Experimentally, our resulting conformal abstention method reliably bounds the hallucination rate on various closed-book, open-domain generative question answering datasets.
arXiv Detail & Related papers (2024-04-04T11:32:03Z) - Source-Free Unsupervised Domain Adaptation with Hypothesis Consolidation
of Prediction Rationale [53.152460508207184]
Source-Free Unsupervised Domain Adaptation (SFUDA) is a challenging task where a model needs to be adapted to a new domain without access to target domain labels or source domain data.
This paper proposes a novel approach that considers multiple prediction hypotheses for each sample and investigates the rationale behind each hypothesis.
To achieve the optimal performance, we propose a three-step adaptation process: model pre-adaptation, hypothesis consolidation, and semi-supervised learning.
arXiv Detail & Related papers (2024-02-02T05:53:22Z) - Efficient Conformal Prediction under Data Heterogeneity [79.35418041861327]
Conformal Prediction (CP) stands out as a robust framework for uncertainty quantification.
Existing approaches for tackling non-exchangeability lead to methods that are not computable beyond the simplest examples.
This work introduces a new efficient approach to CP that produces provably valid confidence sets for fairly general non-exchangeable data distributions.
arXiv Detail & Related papers (2023-12-25T20:02:51Z) - Advancing Counterfactual Inference through Nonlinear Quantile Regression [77.28323341329461]
We propose a framework for efficient and effective counterfactual inference implemented with neural networks.
The proposed approach enhances the capacity to generalize estimated counterfactual outcomes to unseen data.
Empirical results conducted on multiple datasets offer compelling support for our theoretical assertions.
arXiv Detail & Related papers (2023-06-09T08:30:51Z) - Counterfactual inference for sequential experiments [17.817769460838665]
We consider after-study statistical inference for sequentially designed experiments wherein multiple units are assigned treatments for multiple time points.
Our goal is to provide inference guarantees for the counterfactual mean at the smallest possible scale.
We illustrate our theory via several simulations and a case study involving data from a mobile health clinical trial HeartSteps.
arXiv Detail & Related papers (2022-02-14T17:24:27Z) - D2A-BSP: Distilled Data Association Belief Space Planning with
Performance Guarantees Under Budget Constraints [6.62472687864754]
Unresolved data association in ambiguous and perceptually aliased environments leads to multi-modal hypotheses on both the robot's and the environment state.
We present a novel approach that utilizes only a distilled subset of hypotheses to solve BSP problems while reasoning about data association.
We then demonstrate our approach in an extremely aliased environment, where we manage to significantly reduce computation time without compromising on the quality of the solution.
arXiv Detail & Related papers (2022-02-10T11:13:24Z) - GenDICE: Generalized Offline Estimation of Stationary Values [108.17309783125398]
We show that effective estimation can still be achieved in important applications.
Our approach is based on estimating a ratio that corrects for the discrepancy between the stationary and empirical distributions.
The resulting algorithm, GenDICE, is straightforward and effective.
arXiv Detail & Related papers (2020-02-21T00:27:52Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.