Neural representation of a time optimal, constant acceleration
rendezvous
- URL: http://arxiv.org/abs/2203.15490v1
- Date: Tue, 29 Mar 2022 12:40:50 GMT
- Title: Neural representation of a time optimal, constant acceleration
rendezvous
- Authors: Dario Izzo and Sebastien Origer
- Abstract summary: We train neural models to represent both the optimal policy (i.e. the optimal thrust direction) and the value function (i.e. the time of flight) for a time optimal, constant acceleration low-thrust rendezvous.
We achieve, in all cases, accuracies resulting in successful rendezvous and time of flight predictions.
- Score: 10.191757341020216
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We train neural models to represent both the optimal policy (i.e. the optimal
thrust direction) and the value function (i.e. the time of flight) for a time
optimal, constant acceleration low-thrust rendezvous. In both cases we develop
and make use of the data augmentation technique we call backward generation of
optimal examples. We are thus able to produce and work with large dataset and
to fully exploit the benefit of employing a deep learning framework. We
achieve, in all cases, accuracies resulting in successful rendezvous (simulated
following the learned policy) and time of flight predictions (using the learned
value function). We find that residuals as small as a few m/s, thus well within
the possibility of a spacecraft navigation $\Delta V$ budget, are achievable
for the velocity at rendezvous. We also find that, on average, the absolute
error to predict the optimal time of flight to rendezvous from any orbit in the
asteroid belt to an Earth-like orbit is small (less than 4\%) and thus also of
interest for practical uses, for example, during preliminary mission design
phases.
Related papers
- Truncating Trajectories in Monte Carlo Policy Evaluation: an Adaptive Approach [51.76826149868971]
Policy evaluation via Monte Carlo simulation is at the core of many MC Reinforcement Learning (RL) algorithms.
We propose as a quality index a surrogate of the mean squared error of a return estimator that uses trajectories of different lengths.
We present an adaptive algorithm called Robust and Iterative Data collection strategy Optimization (RIDO)
arXiv Detail & Related papers (2024-10-17T11:47:56Z) - Revisiting Space Mission Planning: A Reinforcement Learning-Guided Approach for Multi-Debris Rendezvous [15.699822139827916]
The aim is to optimize the sequence in which all the given debris should be visited to get the least total time for rendezvous for the entire mission.
A neural network (NN) policy is developed, trained on simulated space missions with varying debris fields.
The reinforcement learning approach demonstrates a significant improvement in planning efficiency.
arXiv Detail & Related papers (2024-09-25T12:50:01Z) - OPUS: Occupancy Prediction Using a Sparse Set [64.60854562502523]
We present a framework to simultaneously predict occupied locations and classes using a set of learnable queries.
OPUS incorporates a suite of non-trivial strategies to enhance model performance.
Our lightest model achieves superior RayIoU on the Occ3D-nuScenes dataset at near 2x FPS, while our heaviest model surpasses previous best results by 6.1 RayIoU.
arXiv Detail & Related papers (2024-09-14T07:44:22Z) - Pre-training on Synthetic Driving Data for Trajectory Prediction [61.520225216107306]
We propose a pipeline-level solution to mitigate the issue of data scarcity in trajectory forecasting.
We adopt HD map augmentation and trajectory synthesis for generating driving data, and then we learn representations by pre-training on them.
We conduct extensive experiments to demonstrate the effectiveness of our data expansion and pre-training strategies.
arXiv Detail & Related papers (2023-09-18T19:49:22Z) - Instance-Conditional Timescales of Decay for Non-Stationary Learning [11.90763787610444]
Slow concept drift is a ubiquitous, yet under-studied problem in machine learning systems.
We propose an optimization-driven approach towards balancing instance importance over large training windows.
Experiments on a large real-world dataset of 39M photos over a 9 year period show upto 15% relative gains in accuracy.
arXiv Detail & Related papers (2022-12-12T14:16:26Z) - Globally Optimal Event-Based Divergence Estimation for Ventral Landing [55.29096494880328]
Event sensing is a major component in bio-inspired flight guidance and control systems.
We explore the usage of event cameras for predicting time-to-contact with the surface during ventral landing.
This is achieved by estimating divergence (inverse TTC), which is the rate of radial optic flow, from the event stream generated during landing.
Our core contributions are a novel contrast maximisation formulation for event-based divergence estimation, and a branch-and-bound algorithm to exactly maximise contrast and find the optimal divergence value.
arXiv Detail & Related papers (2022-09-27T06:00:52Z) - Time-Optimal Planning for Quadrotor Waypoint Flight [50.016821506107455]
Planning time-optimal trajectories at the actuation limit of a quadrotor is an open problem.
We propose a solution while exploiting the full quadrotor's actuator potential.
We validate our method in real-world flights in one of the world's largest motion-capture systems.
arXiv Detail & Related papers (2021-08-10T09:26:43Z) - Autonomous Drone Racing with Deep Reinforcement Learning [39.757652701917166]
In many robotic tasks, such as drone racing, the goal is to travel through a set of waypoints as fast as possible.
A key challenge is planning the minimum-time trajectory, which is typically solved by assuming perfect knowledge of the waypoints to pass in advance.
In this work, a new approach to minimum-time trajectory generation for quadrotors is presented.
arXiv Detail & Related papers (2021-03-15T18:05:49Z) - FLOT: Scene Flow on Point Clouds Guided by Optimal Transport [82.86743909483312]
We propose and study a method called FLOT that estimates scene flow on point clouds.
Inspired by recent works on graph matching, we build a method to find these correspondences by borrowing tools from optimal transport.
Our main finding is that FLOT can perform as well as the best existing methods on synthetic and real-world datasets.
arXiv Detail & Related papers (2020-07-22T00:15:30Z) - Real-Time Optimal Guidance and Control for Interplanetary Transfers
Using Deep Networks [10.191757341020216]
Imitation learning of optimal examples is used as a network training paradigm.
G&CNETs are suitable for an on-board, real-time, implementation of the optimal guidance and control system of the spacecraft.
arXiv Detail & Related papers (2020-02-20T23:37:43Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.