Related papers: Maximum Likelihood Constraint Inference from Stochastic Demonstrations

Maximum Likelihood Constraint Inference from Stochastic Demonstrations

URL: http://arxiv.org/abs/2102.12554v1
Date: Wed, 24 Feb 2021 20:46:55 GMT
Title: Maximum Likelihood Constraint Inference from Stochastic Demonstrations
Authors: David L. McPherson, Kaylene C. Stocking, S. Shankar Sastry
Abstract summary: This paper extends maximum likelihood constraint inference to applications by using maximum causal entropy likelihoods. We propose an efficient algorithm that computes constraint likelihood and risk tolerance in a unified Bellman backup.
Score: 5.254702845143088
License: http://creativecommons.org/licenses/by/4.0/
Abstract: When an expert operates a perilous dynamic system, ideal constraint information is tacitly contained in their demonstrated trajectories and controls. The likelihood of these demonstrations can be computed, given the system dynamics and task objective, and the maximum likelihood constraints can be identified. Prior constraint inference work has focused mainly on deterministic models. Stochastic models, however, can capture the uncertainty and risk tolerance that are often present in real systems of interest. This paper extends maximum likelihood constraint inference to stochastic applications by using maximum causal entropy likelihoods. Furthermore, we propose an efficient algorithm that computes constraint likelihood and risk tolerance in a unified Bellman backup, allowing us to generalize to stochastic systems without increasing computational complexity.

Related papers

Formal Control for Uncertain Systems via Contract-Based Probabilistic Surrogates (Extended Version) [1.474723404975345]
We provide an abstraction-based technique that scales effectively to higher dimensions while addressing complex nonlinear agent-environment interactions.<n>Our approach trades scalability for conservatism favorably, as demonstrated on a complex high-dimensional vehicle intersection.
arXiv Detail & Related papers (2025-06-20T13:00:50Z)
End-to-End Probabilistic Framework for Learning with Hard Constraints [47.10876360975842]
ProbHardE2E learns systems that can incorporate operational/physical constraints as hard requirements.<n>It enforces hard constraints by exploiting variance information in a novel way.<n>It can incorporate a range of non-linear constraints (increasing the power of modeling and flexibility)
arXiv Detail & Related papers (2025-06-08T05:29:50Z)
Constrained Online Decision-Making: A Unified Framework [14.465944215100746]
We investigate a general formulation of sequential decision-making with stage-wise feasibility constraints.<n>We propose a unified algorithmic framework that captures many existing constrained learning problems.<n>Our result offers a principled foundation for constrained sequential decision-making in both theory and practice.
arXiv Detail & Related papers (2025-05-11T19:22:04Z)
Probabilistic Pontryagin's Maximum Principle for Continuous-Time Model-Based Reinforcement Learning [3.6300632181659234]
We show that minimization of the mean Hamiltonian is a necessary optimality condition when optimizing the mean cost. Our approach offers a principled and practical framework for controlling uncertain systems with learned dynamics.
arXiv Detail & Related papers (2025-04-03T12:51:20Z)
Accelerated zero-order SGD under high-order smoothness and overparameterized regime [79.85163929026146]
We present a novel gradient-free algorithm to solve convex optimization problems. Such problems are encountered in medicine, physics, and machine learning. We provide convergence guarantees for the proposed algorithm under both types of noise.
arXiv Detail & Related papers (2024-11-21T10:26:17Z)
Learning Controlled Stochastic Differential Equations [61.82896036131116]
This work proposes a novel method for estimating both drift and diffusion coefficients of continuous, multidimensional, nonlinear controlled differential equations with non-uniform diffusion. We provide strong theoretical guarantees, including finite-sample bounds for (L2), (Linfty), and risk metrics, with learning rates adaptive to coefficients' regularity. Our method is available as an open-source Python library.
arXiv Detail & Related papers (2024-11-04T11:09:58Z)
Probabilistic Flux Limiters [0.873811641236639]
A popular method to virtually eliminate Gibbs oscillations in under-resolved simulations is to use a flux limiter. Here, we introduce a conceptually distinct type of flux limiter that is designed to handle the effects of randomness in the model. We show that a machine learned, probabilistic flux limiter may be used in a shock capturing code to more accurately capture shock profiles.
arXiv Detail & Related papers (2024-05-13T21:06:53Z)
Correct-by-Construction Control for Stochastic and Uncertain Dynamical Models via Formal Abstractions [44.99833362998488]
We develop an abstraction framework that can be used to solve this problem under various modeling assumptions. We use state-of-the-art verification techniques to compute an optimal policy on the iMDP with guarantees for satisfying the given specification. We then show that, by construction, we can refine this policy into a feedback controller for which these guarantees carry over to the dynamical model.
arXiv Detail & Related papers (2023-11-16T11:03:54Z)
Likelihood Ratio Confidence Sets for Sequential Decision Making [51.66638486226482]
We revisit the likelihood-based inference principle and propose to use likelihood ratios to construct valid confidence sequences. Our method is especially suitable for problems with well-specified likelihoods. We show how to provably choose the best sequence of estimators and shed light on connections to online convex optimization.
arXiv Detail & Related papers (2023-11-08T00:10:21Z)
Online Constraint Tightening in Stochastic Model Predictive Control: A Regression Approach [49.056933332667114]
No analytical solutions exist for chance-constrained optimal control problems. We propose a data-driven approach for learning the constraint-tightening parameters online during control. Our approach yields constraint-tightening parameters that tightly satisfy the chance constraints.
arXiv Detail & Related papers (2023-10-04T16:22:02Z)
Probabilistic Exponential Integrators [36.98314810594263]
Like standard solvers, they suffer performance penalties for certain stiff systems. This paper develops a class of probabilistic exponential solvers with favorable properties. We evaluate the proposed methods on multiple stiff differential equations.
arXiv Detail & Related papers (2023-05-24T10:13:13Z)
When Demonstrations Meet Generative World Models: A Maximum Likelihood Framework for Offline Inverse Reinforcement Learning [62.00672284480755]
This paper aims to recover the structure of rewards and environment dynamics that underlie observed actions in a fixed, finite set of demonstrations from an expert agent. Accurate models of expertise in executing a task has applications in safety-sensitive applications such as clinical decision making and autonomous driving.
arXiv Detail & Related papers (2023-02-15T04:14:20Z)
Recursive Constraints to Prevent Instability in Constrained Reinforcement Learning [16.019477271828745]
We consider the challenge of finding a deterministic policy for a Markov decision process. This class of problem is known to be hard, but the combined requirements of determinism and uniform optimality can create learning instability. We present a suitable constrained reinforcement learning algorithm that prevents learning instability.
arXiv Detail & Related papers (2022-01-20T02:33:24Z)
Probabilistic robust linear quadratic regulators with Gaussian processes [73.0364959221845]
Probabilistic models such as Gaussian processes (GPs) are powerful tools to learn unknown dynamical systems from data for subsequent use in control design. We present a novel controller synthesis for linearized GP dynamics that yields robust controllers with respect to a probabilistic stability margin.
arXiv Detail & Related papers (2021-05-17T08:36:18Z)
Incorporating physical constraints in a deep probabilistic machine learning framework for coarse-graining dynamical systems [7.6146285961466]
This paper offers a data-based, probablistic perspective that enables the quantification of predictive uncertainties. We formulate the coarse-graining process by employing a probabilistic state-space model. It is capable of reconstructing the evolution of the full, fine-scale system.
arXiv Detail & Related papers (2019-12-30T16:07:46Z)

This list is automatically generated from the titles and abstracts of the papers in this site.