involve-MI: Informative Planning with High-Dimensional Non-Parametric
Beliefs
- URL: http://arxiv.org/abs/2209.11591v1
- Date: Fri, 23 Sep 2022 13:51:36 GMT
- Title: involve-MI: Informative Planning with High-Dimensional Non-Parametric
Beliefs
- Authors: Gilad Rotman, Vadim Indelman
- Abstract summary: We calculate an information-theoretic expected reward, mutual information (MI), over a much lower-dimensional subset of the state, to improve efficiency and without sacrificing accuracy.
We then develop an estimator for the MI which works in a Sequential Monte Carlo manner, and avoids the reconstruction of future belief's surfaces.
This work is then evaluated in a simulation of an active SLAM problem, where the improvement in both accuracy and timing is demonstrated.
- Score: 6.62472687864754
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: One of the most complex tasks of decision making and planning is to gather
information. This task becomes even more complex when the state is
high-dimensional and its belief cannot be expressed with a parametric
distribution. Although the state is high-dimensional, in many problems only a
small fraction of it might be involved in transitioning the state and
generating observations. We exploit this fact to calculate an
information-theoretic expected reward, mutual information (MI), over a much
lower-dimensional subset of the state, to improve efficiency and without
sacrificing accuracy. A similar approach was used in previous works, yet
specifically for Gaussian distributions, and we here extend it for general
distributions. Moreover, we apply the dimensionality reduction for cases in
which the new states are augmented to the previous, yet again without
sacrificing accuracy. We then continue by developing an estimator for the MI
which works in a Sequential Monte Carlo (SMC) manner, and avoids the
reconstruction of future belief's surfaces. Finally, we show how this work is
applied to the informative planning optimization problem. This work is then
evaluated in a simulation of an active SLAM problem, where the improvement in
both accuracy and timing is demonstrated.
Related papers
- Generalizing Multi-Step Inverse Models for Representation Learning to Finite-Memory POMDPs [23.584313644411967]
We study the problem of discovering an informative, or agent-centric, state representation that encodes only the relevant information while discarding the irrelevant.
Our results include theory in the deterministic dynamics setting as well as counter-examples for alternative intuitive algorithms.
We show that these can be a double-edged sword: making the algorithms more successful when used correctly and causing dramatic failure when used incorrectly.
arXiv Detail & Related papers (2024-04-22T19:46:16Z) - Non-stationary Reinforcement Learning under General Function
Approximation [60.430936031067006]
We first propose a new complexity metric called dynamic Bellman Eluder (DBE) dimension for non-stationary MDPs.
Based on the proposed complexity metric, we propose a novel confidence-set based model-free algorithm called SW-OPEA.
We show that SW-OPEA is provably efficient as long as the variation budget is not significantly large.
arXiv Detail & Related papers (2023-06-01T16:19:37Z) - Extension of Dynamic Mode Decomposition for dynamic systems with
incomplete information based on t-model of optimal prediction [69.81996031777717]
The Dynamic Mode Decomposition has proved to be a very efficient technique to study dynamic data.
The application of this approach becomes problematic if the available data is incomplete because some dimensions of smaller scale either missing or unmeasured.
We consider a first-order approximation of the Mori-Zwanzig decomposition, state the corresponding optimization problem and solve it with the gradient-based optimization method.
arXiv Detail & Related papers (2022-02-23T11:23:59Z) - Efficient Belief Space Planning in High-Dimensional State Spaces using
PIVOT: Predictive Incremental Variable Ordering Tactic [11.878820609988693]
We examine the problem of online decision making under uncertainty, which we formulate as planning in the belief space.
We call this approach PIVOT: Predictive Incremental Variable Ordering Tactic.
Applying this tactic can also improve the state inference efficiency.
arXiv Detail & Related papers (2021-12-29T07:30:47Z) - Fast Approximation of the Sliced-Wasserstein Distance Using
Concentration of Random Projections [19.987683989865708]
The Sliced-Wasserstein distance (SW) is being increasingly used in machine learning applications.
We propose a new perspective to approximate SW by making use of the concentration of measure phenomenon.
Our method does not require sampling a number of random projections, and is therefore both accurate and easy to use compared to the usual Monte Carlo approximation.
arXiv Detail & Related papers (2021-06-29T13:56:19Z) - Parallel Stochastic Mirror Descent for MDPs [72.75921150912556]
We consider the problem of learning the optimal policy for infinite-horizon Markov decision processes (MDPs)
Some variant of Mirror Descent is proposed for convex programming problems with Lipschitz-continuous functionals.
We analyze this algorithm in a general case and obtain an estimate of the convergence rate that does not accumulate errors during the operation of the method.
arXiv Detail & Related papers (2021-02-27T19:28:39Z) - Neural Joint Entropy Estimation [12.77733789371855]
Estimating the entropy of a discrete random variable is a fundamental problem in information theory and related fields.
In this work, we introduce a practical solution to this problem, which extends the work of McAllester and Statos ( 2020)
The proposed scheme uses the generalization abilities of cross-entropy estimation in deep neural networks (DNNs) to introduce improved entropy estimation accuracy.
arXiv Detail & Related papers (2020-12-21T09:23:39Z) - Augmented Parallel-Pyramid Net for Attention Guided Pose-Estimation [90.28365183660438]
This paper proposes an augmented parallel-pyramid net with attention partial module and differentiable auto-data augmentation.
We define a new pose search space where the sequences of data augmentations are formulated as a trainable and operational CNN component.
Notably, our method achieves the top-1 accuracy on the challenging COCO keypoint benchmark and the state-of-the-art results on the MPII datasets.
arXiv Detail & Related papers (2020-03-17T03:52:17Z) - Approximate MMAP by Marginal Search [78.50747042819503]
We present a strategy for marginal MAP queries in graphical models.
The proposed confidence measure is properly detecting instances for which the algorithm is accurate.
For sufficiently high confidence levels, the algorithm gives the exact solution or an approximation whose Hamming distance from the exact one is small.
arXiv Detail & Related papers (2020-02-12T07:41:13Z) - A Near-Optimal Gradient Flow for Learning Neural Energy-Based Models [93.24030378630175]
We propose a novel numerical scheme to optimize the gradient flows for learning energy-based models (EBMs)
We derive a second-order Wasserstein gradient flow of the global relative entropy from Fokker-Planck equation.
Compared with existing schemes, Wasserstein gradient flow is a smoother and near-optimal numerical scheme to approximate real data densities.
arXiv Detail & Related papers (2019-10-31T02:26:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.