Flight Validation of Learning-Based Trajectory Optimization for the Astrobee Free-Flyer
- URL: http://arxiv.org/abs/2505.05588v1
- Date: Thu, 08 May 2025 18:42:36 GMT
- Title: Flight Validation of Learning-Based Trajectory Optimization for the Astrobee Free-Flyer
- Authors: Somrita Banerjee, Abhishek Cauligi, Marco Pavone,
- Abstract summary: We present flight results from experiments with the Astrobee free-flying robot on board the International Space Station.<n>We demonstrate how machine learning can accelerate on-board trajectory optimization while preserving theoretical solver guarantees.
- Score: 17.306347323545985
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Although widely used in commercial and industrial robotics, trajectory optimization has seen limited use in space applications due to its high computational demands. In this work, we present flight results from experiments with the Astrobee free-flying robot on board the International Space Station (ISS), that demonstrate how machine learning can accelerate on-board trajectory optimization while preserving theoretical solver guarantees. To the best of the authors' knowledge, this is the first-ever demonstration of learning-based control on the ISS. Our approach leverages the GuSTO sequential convex programming framework and uses a neural network, trained offline, to map problem parameters to effective initial ``warm-start'' trajectories, paving the way for faster real-time optimization on resource-constrained space platforms.
Related papers
- DRL-based Dolph-Tschebyscheff Beamforming in Downlink Transmission for Mobile Users [52.9870460238443]
We propose a deep reinforcement learning-based blind beamforming technique using a learnable Dolph-Tschebyscheff antenna array.<n>Our simulation results show that the proposed method can support data rates very close to the best possible values.
arXiv Detail & Related papers (2025-02-03T11:50:43Z) - Space for Improvement: Navigating the Design Space for Federated Learning in Satellite Constellations [0.8437187555622164]
We develop a method for space-ification of existing FL algorithms, evaluated on FLySTacK, our novel satellite constellation design and hardware aware testing platform.
We introduce AutoFLSat, a generalized, hierarchical, autonomous FL algorithm for space that provides a 12.5% to 37.5% reduction in model training time than leading alternatives.
arXiv Detail & Related papers (2024-10-31T23:49:36Z) - Learning-Initialized Trajectory Planning in Unknown Environments [4.2960463890487555]
Planning for autonomous flight in unknown environments requires precise planning for both the spatial and temporal trajectories.
We introduce a novel approach that guides optimization using a Neural-d Trajectory Planner.
We propose a framework that supports robust online replanning with tolerance to planning latency.
arXiv Detail & Related papers (2023-09-19T15:07:26Z) - Confidence-Controlled Exploration: Efficient Sparse-Reward Policy Learning for Robot Navigation [72.24964965882783]
Reinforcement learning (RL) is a promising approach for robotic navigation, allowing robots to learn through trial and error.<n>Real-world robotic tasks often suffer from sparse rewards, leading to inefficient exploration and suboptimal policies.<n>We introduce Confidence-Controlled Exploration (CCE), a novel method that improves sample efficiency in RL-based robotic navigation without modifying the reward function.
arXiv Detail & Related papers (2023-06-09T18:45:15Z) - Optimality Principles in Spacecraft Neural Guidance and Control [16.59877059263942]
We argue that end-to-end neural guidance and control architectures (here called G&CNets) allow transferring onboard the burden of acting upon optimality principles.
In this way, the sensor information is transformed in real time into optimal plans thus increasing the mission autonomy and robustness.
We discuss the main results obtained in training such neural architectures in simulation for interplanetary transfers, landings and close proximity operations.
arXiv Detail & Related papers (2023-05-22T14:48:58Z) - Gradient-Based Trajectory Optimization With Learned Dynamics [80.41791191022139]
We use machine learning techniques to learn a differentiable dynamics model of the system from data.
We show that a neural network can model highly nonlinear behaviors accurately for large time horizons.
In our hardware experiments, we demonstrate that our learned model can represent complex dynamics for both the Spot and Radio-controlled (RC) car.
arXiv Detail & Related papers (2022-04-09T22:07:34Z) - Bayesian Optimization and Deep Learning forsteering wheel angle
prediction [58.720142291102135]
This work aims to obtain an accurate model for the prediction of the steering angle in an automated driving system.
BO was able to identify, within a limited number of trials, a model -- namely BOST-LSTM -- which resulted, the most accurate when compared to classical end-to-end driving models.
arXiv Detail & Related papers (2021-10-22T15:25:14Z) - Time-Optimal Planning for Quadrotor Waypoint Flight [50.016821506107455]
Planning time-optimal trajectories at the actuation limit of a quadrotor is an open problem.
We propose a solution while exploiting the full quadrotor's actuator potential.
We validate our method in real-world flights in one of the world's largest motion-capture systems.
arXiv Detail & Related papers (2021-08-10T09:26:43Z) - Reinforcement Learning for Low-Thrust Trajectory Design of
Interplanetary Missions [77.34726150561087]
This paper investigates the use of reinforcement learning for the robust design of interplanetary trajectories in presence of severe disturbances.
An open-source implementation of the state-of-the-art algorithm Proximal Policy Optimization is adopted.
The resulting Guidance and Control Network provides both a robust nominal trajectory and the associated closed-loop guidance law.
arXiv Detail & Related papers (2020-08-19T15:22:15Z) - First Steps: Latent-Space Control with Semantic Constraints for
Quadruped Locomotion [73.37945453998134]
Traditional approaches to quadruped control employ simplified, hand-derived models.
This significantly reduces the capability of the robot since its effective kinematic range is curtailed.
In this work, these challenges are addressed by framing quadruped control as optimisation in a structured latent space.
A deep generative model captures a statistical representation of feasible joint configurations, whilst complex dynamic and terminal constraints are expressed via high-level, semantic indicators.
We validate the feasibility of locomotion trajectories optimised using our approach both in simulation and on a real-worldmal quadruped.
arXiv Detail & Related papers (2020-07-03T07:04:18Z) - Real-Time Optimal Guidance and Control for Interplanetary Transfers
Using Deep Networks [10.191757341020216]
Imitation learning of optimal examples is used as a network training paradigm.
G&CNETs are suitable for an on-board, real-time, implementation of the optimal guidance and control system of the spacecraft.
arXiv Detail & Related papers (2020-02-20T23:37:43Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.