Generation of Geodesics with Actor-Critic Reinforcement Learning to Predict Midpoints
- URL: http://arxiv.org/abs/2407.01991v2
- Date: Tue, 16 Jul 2024 11:41:27 GMT
- Title: Generation of Geodesics with Actor-Critic Reinforcement Learning to Predict Midpoints
- Authors: Kazumi Kasaura,
- Abstract summary: We show that the proposed method outperforms existing methods on both local and global path planning tasks.
We prove the soundness of our approach and show experimentally that the proposed method outperforms existing methods on both local and global path planning tasks.
- Score: 1.6317061277457001
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: To find the shortest paths for all pairs on manifolds with infinitesimally defined metrics, we propose to generate them by predicting midpoints recursively and an actor-critic method to learn midpoint prediction. We prove the soundness of our approach and show experimentally that the proposed method outperforms existing methods on both local and global path planning tasks.
Related papers
- Spatial Clustering Approach for Vessel Path Identification [3.2230949286556627]
We propose a spatial clustering approach for labeling the vessel paths by using only position information.
We develop a path clustering framework employing two methods: a distance-based path modeling and a likelihood estimation method.
arXiv Detail & Related papers (2024-03-09T03:21:18Z) - LanePtrNet: Revisiting Lane Detection as Point Voting and Grouping on
Curves [8.037214110171123]
Lane detection plays a critical role in the field of autonomous driving.
We propose a novel approach, LanePtrNet, which treats lane detection as a process of point voting and grouping on ordered sets.
We conduct comprehensive experiments to validate the effectiveness of our proposed approach, demonstrating its superior performance.
arXiv Detail & Related papers (2024-03-08T08:45:42Z) - An approach to robust ICP initialization [77.45039118761837]
We propose an approach to initialize the Iterative Closest Point (ICP) algorithm to match unlabelled point clouds related by rigid transformations.
We derive bounds on the robustness of our approach to noise and numerical experiments confirm our theoretical findings.
arXiv Detail & Related papers (2022-12-10T16:27:25Z) - S$^*$: A Heuristic Information-Based Approximation Framework for
Multi-Goal Path Finding [0.0]
We develop a novel framework for a Multi-Goal Path Finding (MGPF) problem that provides a 2-approximation guarantee.
MGPF aims to find a least-cost path from an origin to a destination such that each node in a given set of goals is visited at least once along the path.
We present numerical results to illustrate the advantages of our framework over conventional alternates in terms of the number of expanded nodes and run time.
arXiv Detail & Related papers (2021-03-15T06:27:37Z) - A Survey on Deep Semi-supervised Learning [51.26862262550445]
We first present a taxonomy for deep semi-supervised learning that categorizes existing methods.
We then offer a detailed comparison of these methods in terms of the type of losses, contributions, and architecture differences.
arXiv Detail & Related papers (2021-02-28T16:22:58Z) - Point-Level Temporal Action Localization: Bridging Fully-supervised
Proposals to Weakly-supervised Losses [84.2964408497058]
Point-level temporal action localization (PTAL) aims to localize actions in untrimmed videos with only one timestamp annotation for each action instance.
Existing methods adopt the frame-level prediction paradigm to learn from the sparse single-frame labels.
This paper attempts to explore the proposal-based prediction paradigm for point-level annotations.
arXiv Detail & Related papers (2020-12-15T12:11:48Z) - Pathwise Conditioning of Gaussian Processes [72.61885354624604]
Conventional approaches for simulating Gaussian process posteriors view samples as draws from marginal distributions of process values at finite sets of input locations.
This distribution-centric characterization leads to generative strategies that scale cubically in the size of the desired random vector.
We show how this pathwise interpretation of conditioning gives rise to a general family of approximations that lend themselves to efficiently sampling Gaussian process posteriors.
arXiv Detail & Related papers (2020-11-08T17:09:37Z) - A maximum-entropy approach to off-policy evaluation in average-reward
MDPs [54.967872716145656]
This work focuses on off-policy evaluation (OPE) with function approximation in infinite-horizon undiscounted Markov decision processes (MDPs)
We provide the first finite-sample OPE error bound, extending existing results beyond the episodic and discounted cases.
We show that this results in an exponential-family distribution whose sufficient statistics are the features, paralleling maximum-entropy approaches in supervised learning.
arXiv Detail & Related papers (2020-06-17T18:13:37Z) - CONSAC: Robust Multi-Model Fitting by Conditional Sample Consensus [62.86856923633923]
We present a robust estimator for fitting multiple parametric models of the same form to noisy measurements.
In contrast to previous works, which resorted to hand-crafted search strategies for multiple model detection, we learn the search strategy from data.
For self-supervised learning of the search, we evaluate the proposed algorithm on multi-homography estimation and demonstrate an accuracy that is superior to state-of-the-art methods.
arXiv Detail & Related papers (2020-01-08T17:37:01Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.