Leveraging Experience in Lazy Search
- URL: http://arxiv.org/abs/2110.04669v1
- Date: Sun, 10 Oct 2021 00:46:44 GMT
- Title: Leveraging Experience in Lazy Search
- Authors: Mohak Bhardwaj, Sanjiban Choudhury, Byron Boots, Siddhartha Srinivasa
- Abstract summary: Lazy graph search algorithms are efficient at solving motion planning problems where edge evaluation is the computational bottleneck.
We formulate this problem as a Markov Decision Process (MDP) on the state of the search problem.
We show that we can compute oracular selectors that can solve the MDP during training.
- Score: 37.75223642505171
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Lazy graph search algorithms are efficient at solving motion planning
problems where edge evaluation is the computational bottleneck. These
algorithms work by lazily computing the shortest potentially feasible path,
evaluating edges along that path, and repeating until a feasible path is found.
The order in which edges are selected is critical to minimizing the total
number of edge evaluations: a good edge selector chooses edges that are not
only likely to be invalid, but also eliminates future paths from consideration.
We wish to learn such a selector by leveraging prior experience. We formulate
this problem as a Markov Decision Process (MDP) on the state of the search
problem. While solving this large MDP is generally intractable, we show that we
can compute oracular selectors that can solve the MDP during training. With
access to such oracles, we use imitation learning to find effective policies.
If new search problems are sufficiently similar to problems solved during
training, the learned policy will choose a good edge evaluation ordering and
solve the motion planning problem quickly. We evaluate our algorithms on a wide
range of 2D and 7D problems and show that the learned selector outperforms
baseline commonly used heuristics. We further provide a novel theoretical
analysis of lazy search in a Bayesian framework as well as regret guarantees on
our imitation learning based approach to motion planning.
Related papers
- Learning the Positions in CountSketch [49.57951567374372]
We consider sketching algorithms which first compress data by multiplication with a random sketch matrix, and then apply the sketch to quickly solve an optimization problem.
In this work, we propose the first learning-based algorithms that also optimize the locations of the non-zero entries.
arXiv Detail & Related papers (2023-06-11T07:28:35Z) - Fast Line Search for Multi-Task Learning [0.0]
We propose a novel idea for line search algorithms in multi-task learning.
The idea is to use latent representation space instead of parameter space for finding step size.
We compare this idea with classical backtracking and gradient methods with a constant learning rate on MNIST, CIFAR-10, Cityscapes tasks.
arXiv Detail & Related papers (2021-10-02T21:02:29Z) - (Machine) Learning to Improve the Empirical Performance of Discrete
Algorithms [0.0]
We train machine learning methods to select the optimal algorithm for given data without human expert opinion.
Our framework recommends various pivot rules that improve overall total performance over just using a fixed default pivot rule.
For the all-pairs shortest path problem, the models trained made a large improvement and our selection is on average.07 percent away from the optimal choice.
arXiv Detail & Related papers (2021-09-29T08:33:09Z) - Machine Learning for Online Algorithm Selection under Censored Feedback [71.6879432974126]
In online algorithm selection (OAS), instances of an algorithmic problem class are presented to an agent one after another, and the agent has to quickly select a presumably best algorithm from a fixed set of candidate algorithms.
For decision problems such as satisfiability (SAT), quality typically refers to the algorithm's runtime.
In this work, we revisit multi-armed bandit algorithms for OAS and discuss their capability of dealing with the problem.
We adapt them towards runtime-oriented losses, allowing for partially censored data while keeping a space- and time-complexity independent of the time horizon.
arXiv Detail & Related papers (2021-09-13T18:10:52Z) - Learning to Sparsify Travelling Salesman Problem Instances [0.5985204759362747]
We use a pruning machine learning as a pre-processing step followed by an exact Programming approach to sparsify the travelling salesman problem.
Our learning approach requires very little training data and is amenable to mathematical analysis.
arXiv Detail & Related papers (2021-04-19T14:35:14Z) - Towards Time-Optimal Any-Angle Path Planning With Dynamic Obstacles [1.370633147306388]
Path finding is a well-studied problem in AI, which is often framed as graph search.
We present two algorithms, grounded in the same idea, that can obtain provably optimal solutions to the considered problem.
arXiv Detail & Related papers (2021-04-14T07:59:53Z) - Towards Optimally Efficient Tree Search with Deep Learning [76.64632985696237]
This paper investigates the classical integer least-squares problem which estimates signals integer from linear models.
The problem is NP-hard and often arises in diverse applications such as signal processing, bioinformatics, communications and machine learning.
We propose a general hyper-accelerated tree search (HATS) algorithm by employing a deep neural network to estimate the optimal estimation for the underlying simplified memory-bounded A* algorithm.
arXiv Detail & Related papers (2021-01-07T08:00:02Z) - Towards Meta-Algorithm Selection [78.13985819417974]
Instance-specific algorithm selection (AS) deals with the automatic selection of an algorithm from a fixed set of candidates.
We show that meta-algorithm-selection can indeed prove beneficial in some cases.
arXiv Detail & Related papers (2020-11-17T17:27:33Z) - Run2Survive: A Decision-theoretic Approach to Algorithm Selection based
on Survival Analysis [75.64261155172856]
survival analysis (SA) naturally supports censored data and offers appropriate ways to use such data for learning distributional models of algorithm runtime.
We leverage such models as a basis of a sophisticated decision-theoretic approach to algorithm selection, which we dub Run2Survive.
In an extensive experimental study with the standard benchmark ASlib, our approach is shown to be highly competitive and in many cases even superior to state-of-the-art AS approaches.
arXiv Detail & Related papers (2020-07-06T15:20:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.