Data Association Aware POMDP Planning with Hypothesis Pruning
Performance Guarantees
- URL: http://arxiv.org/abs/2303.02139v3
- Date: Tue, 1 Aug 2023 18:41:35 GMT
- Title: Data Association Aware POMDP Planning with Hypothesis Pruning
Performance Guarantees
- Authors: Moran Barenboim, Idan Lev-Yehudi and Vadim Indelman
- Abstract summary: We introduce a pruning-based approach for planning with ambiguous data associations.
Our key contribution is to derive bounds between the value function based on the complete set of hypotheses and the value function based on a pruned-subset of the hypotheses.
We demonstrate how these bounds can both be used to certify any pruning in retrospect and propose a novel approach to determine which hypotheses to prune in order to ensure a predefined limit on the loss.
- Score: 7.928094304325113
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Autonomous agents that operate in the real world must often deal with partial
observability, which is commonly modeled as partially observable Markov
decision processes (POMDPs). However, traditional POMDP models rely on the
assumption of complete knowledge of the observation source, known as fully
observable data association. To address this limitation, we propose a planning
algorithm that maintains multiple data association hypotheses, represented as a
belief mixture, where each component corresponds to a different data
association hypothesis. However, this method can lead to an exponential growth
in the number of hypotheses, resulting in significant computational overhead.
To overcome this challenge, we introduce a pruning-based approach for planning
with ambiguous data associations. Our key contribution is to derive bounds
between the value function based on the complete set of hypotheses and the
value function based on a pruned-subset of the hypotheses, enabling us to
establish a trade-off between computational efficiency and performance. We
demonstrate how these bounds can both be used to certify any pruning heuristic
in retrospect and propose a novel approach to determine which hypotheses to
prune in order to ensure a predefined limit on the loss. We evaluate our
approach in simulated environments and demonstrate its efficacy in handling
multi-modal belief hypotheses with ambiguous data associations.
Related papers
- Model-free Methods for Event History Analysis and Efficient Adjustment (PhD Thesis) [55.2480439325792]
This thesis is a series of independent contributions to statistics unified by a model-free perspective.
The first chapter elaborates on how a model-free perspective can be used to formulate flexible methods that leverage prediction techniques from machine learning.
The second chapter studies the concept of local independence, which describes whether the evolution of one process is directly influenced by another.
arXiv Detail & Related papers (2025-02-11T19:24:09Z) - Prediction-Powered E-Values [0.66567375919026]
We apply ideas of prediction-powered inference to e-values.
We show that every inference procedure that can be framed in terms of e-values has a prediction-powered counterpart.
Our approach is modular and easily integrable into existing algorithms.
arXiv Detail & Related papers (2025-02-06T18:36:01Z) - Stratified Prediction-Powered Inference for Hybrid Language Model Evaluation [62.2436697657307]
Prediction-powered inference (PPI) is a method that improves statistical estimates based on limited human-labeled data.
We propose a method called Stratified Prediction-Powered Inference (StratPPI)
We show that the basic PPI estimates can be considerably improved by employing simple data stratification strategies.
arXiv Detail & Related papers (2024-06-06T17:37:39Z) - Conformal Counterfactual Inference under Hidden Confounding [19.190396053530417]
Predicting potential outcomes along with its uncertainty in a counterfactual world poses the foundamental challenge in causal inference.
Existing methods that construct confidence intervals for counterfactuals either rely on the assumption of strong ignorability.
We propose a novel approach based on transductive weighted conformal prediction, which provides confidence intervals for counterfactual outcomes with marginal converage guarantees.
arXiv Detail & Related papers (2024-05-20T21:43:43Z) - Mitigating LLM Hallucinations via Conformal Abstention [70.83870602967625]
We develop a principled procedure for determining when a large language model should abstain from responding in a general domain.
We leverage conformal prediction techniques to develop an abstention procedure that benefits from rigorous theoretical guarantees on the hallucination rate (error rate)
Experimentally, our resulting conformal abstention method reliably bounds the hallucination rate on various closed-book, open-domain generative question answering datasets.
arXiv Detail & Related papers (2024-04-04T11:32:03Z) - Source-Free Unsupervised Domain Adaptation with Hypothesis Consolidation
of Prediction Rationale [53.152460508207184]
Source-Free Unsupervised Domain Adaptation (SFUDA) is a challenging task where a model needs to be adapted to a new domain without access to target domain labels or source domain data.
This paper proposes a novel approach that considers multiple prediction hypotheses for each sample and investigates the rationale behind each hypothesis.
To achieve the optimal performance, we propose a three-step adaptation process: model pre-adaptation, hypothesis consolidation, and semi-supervised learning.
arXiv Detail & Related papers (2024-02-02T05:53:22Z) - Efficient Conformal Prediction under Data Heterogeneity [79.35418041861327]
Conformal Prediction (CP) stands out as a robust framework for uncertainty quantification.
Existing approaches for tackling non-exchangeability lead to methods that are not computable beyond the simplest examples.
This work introduces a new efficient approach to CP that produces provably valid confidence sets for fairly general non-exchangeable data distributions.
arXiv Detail & Related papers (2023-12-25T20:02:51Z) - Advancing Counterfactual Inference through Nonlinear Quantile Regression [77.28323341329461]
We propose a framework for efficient and effective counterfactual inference implemented with neural networks.
The proposed approach enhances the capacity to generalize estimated counterfactual outcomes to unseen data.
Empirical results conducted on multiple datasets offer compelling support for our theoretical assertions.
arXiv Detail & Related papers (2023-06-09T08:30:51Z) - Counterfactual inference for sequential experiments [17.817769460838665]
We consider after-study statistical inference for sequentially designed experiments wherein multiple units are assigned treatments for multiple time points.
Our goal is to provide inference guarantees for the counterfactual mean at the smallest possible scale.
We illustrate our theory via several simulations and a case study involving data from a mobile health clinical trial HeartSteps.
arXiv Detail & Related papers (2022-02-14T17:24:27Z) - D2A-BSP: Distilled Data Association Belief Space Planning with
Performance Guarantees Under Budget Constraints [6.62472687864754]
Unresolved data association in ambiguous and perceptually aliased environments leads to multi-modal hypotheses on both the robot's and the environment state.
We present a novel approach that utilizes only a distilled subset of hypotheses to solve BSP problems while reasoning about data association.
We then demonstrate our approach in an extremely aliased environment, where we manage to significantly reduce computation time without compromising on the quality of the solution.
arXiv Detail & Related papers (2022-02-10T11:13:24Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.