Learning Efficient Navigation in Vortical Flow Fields
- URL: http://arxiv.org/abs/2102.10536v1
- Date: Sun, 21 Feb 2021 07:25:03 GMT
- Title: Learning Efficient Navigation in Vortical Flow Fields
- Authors: Peter Gunnarson, Ioannis Mandralis, Guido Novati, Petros Koumoutsakos,
John O. Dabiri
- Abstract summary: We apply a novel Reinforcement Learning algorithm to steer a fixed-speed swimmer through an unsteady two-dimensional flow field.
The algorithm entails inputting environmental cues into a deep neural network that determines the swimmer's actions.
A velocity sensing approach outperformed a bio-mimetic vorticity sensing approach by nearly two-fold in success rate.
- Score: 6.585044528359311
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Efficient point-to-point navigation in the presence of a background flow
field is important for robotic applications such as ocean surveying. In such
applications, robots may only have knowledge of their immediate surroundings or
be faced with time-varying currents, which limits the use of optimal control
techniques for planning trajectories. Here, we apply a novel Reinforcement
Learning algorithm to discover time-efficient navigation policies to steer a
fixed-speed swimmer through an unsteady two-dimensional flow field. The
algorithm entails inputting environmental cues into a deep neural network that
determines the swimmer's actions, and deploying Remember and Forget Experience
replay. We find that the resulting swimmers successfully exploit the background
flow to reach the target, but that this success depends on the type of sensed
environmental cue. Surprisingly, a velocity sensing approach outperformed a
bio-mimetic vorticity sensing approach by nearly two-fold in success rate.
Equipped with local velocity measurements, the reinforcement learning algorithm
achieved near 100% success in reaching the target locations while approaching
the time-efficiency of paths found by a global optimal control planner.
Related papers
- MPVO: Motion-Prior based Visual Odometry for PointGoal Navigation [3.9974562667271507]
Visual odometry (VO) is essential for enabling accurate point-goal navigation of embodied agents in indoor environments.
Recent deep-learned VO methods show robust performance but suffer from sample inefficiency during training.
We propose a robust and sample-efficient VO pipeline based on motion priors available while an agent is navigating an environment.
arXiv Detail & Related papers (2024-11-07T15:36:49Z) - LLM-A*: Large Language Model Enhanced Incremental Heuristic Search on Path Planning [91.95362946266577]
Path planning is a fundamental scientific problem in robotics and autonomous navigation.
Traditional algorithms like A* and its variants are capable of ensuring path validity but suffer from significant computational and memory inefficiencies as the state space grows.
We propose a new LLM based route planning method that synergistically combines the precise pathfinding capabilities of A* with the global reasoning capability of LLMs.
This hybrid approach aims to enhance pathfinding efficiency in terms of time and space complexity while maintaining the integrity of path validity, especially in large-scale scenarios.
arXiv Detail & Related papers (2024-06-20T01:24:30Z) - Histogram Layer Time Delay Neural Networks for Passive Sonar
Classification [58.720142291102135]
A novel method combines a time delay neural network and histogram layer to incorporate statistical contexts for improved feature learning and underwater acoustic target classification.
The proposed method outperforms the baseline model, demonstrating the utility in incorporating statistical contexts for passive sonar target recognition.
arXiv Detail & Related papers (2023-07-25T19:47:26Z) - CCE: Sample Efficient Sparse Reward Policy Learning for Robotic Navigation via Confidence-Controlled Exploration [72.24964965882783]
Confidence-Controlled Exploration (CCE) is designed to enhance the training sample efficiency of reinforcement learning algorithms for sparse reward settings such as robot navigation.
CCE is based on a novel relationship we provide between gradient estimation and policy entropy.
We demonstrate through simulated and real-world experiments that CCE outperforms conventional methods that employ constant trajectory lengths and entropy regularization.
arXiv Detail & Related papers (2023-06-09T18:45:15Z) - Visual-Language Navigation Pretraining via Prompt-based Environmental
Self-exploration [83.96729205383501]
We introduce prompt-based learning to achieve fast adaptation for language embeddings.
Our model can adapt to diverse vision-language navigation tasks, including VLN and REVERIE.
arXiv Detail & Related papers (2022-03-08T11:01:24Z) - Deep Learning Aided Packet Routing in Aeronautical Ad-Hoc Networks
Relying on Real Flight Data: From Single-Objective to Near-Pareto
Multi-Objective Optimization [79.96177511319713]
We invoke deep learning (DL) to assist routing in aeronautical ad-hoc networks (AANETs)
A deep neural network (DNN) is conceived for mapping the local geographic information observed by the forwarding node into the information required for determining the optimal next hop.
We extend the DL-aided routing algorithm to a multi-objective scenario, where we aim for simultaneously minimizing the delay, maximizing the path capacity, and maximizing the path lifetime.
arXiv Detail & Related papers (2021-10-28T14:18:22Z) - Integrating Deep Reinforcement and Supervised Learning to Expedite
Indoor Mapping [0.0]
We show that combining the two methods can shorten the mapping time, compared to frontier-based motion planning, by up to 75%.
One is the use of deep reinforcement learning to train the motion planner.
The second is the inclusion of a pre-trained generative deep neural network, acting as a map predictor.
arXiv Detail & Related papers (2021-09-17T12:07:07Z) - Autonomous Drone Racing with Deep Reinforcement Learning [39.757652701917166]
In many robotic tasks, such as drone racing, the goal is to travel through a set of waypoints as fast as possible.
A key challenge is planning the minimum-time trajectory, which is typically solved by assuming perfect knowledge of the waypoints to pass in advance.
In this work, a new approach to minimum-time trajectory generation for quadrotors is presented.
arXiv Detail & Related papers (2021-03-15T18:05:49Z) - Optimal control of point-to-point navigation in turbulent time-dependent
flows using Reinforcement Learning [0.0]
We present theoretical and numerical results concerning the problem to find the path that minimizes time to navigate between two given points in a complex fluid.
We show that ActorCritic algorithms are able to find quasi-optimal solutions in the presence of either time-independent or chaotically evolving flow configurations.
arXiv Detail & Related papers (2021-02-27T21:31:18Z) - End-to-end Learning for Inter-Vehicle Distance and Relative Velocity
Estimation in ADAS with a Monocular Camera [81.66569124029313]
We propose a camera-based inter-vehicle distance and relative velocity estimation method based on end-to-end training of a deep neural network.
The key novelty of our method is the integration of multiple visual clues provided by any two time-consecutive monocular frames.
We also propose a vehicle-centric sampling mechanism to alleviate the effect of perspective distortion in the motion field.
arXiv Detail & Related papers (2020-06-07T08:18:31Z) - Enhance the performance of navigation: A two-stage machine learning
approach [13.674463804942837]
Real time traffic navigation is an important capability in smart transportation technologies.
In this paper, we adopt the ideas of ensemble learning and develop a two-stage machine learning model to give accurate navigation results.
arXiv Detail & Related papers (2020-04-02T08:55:27Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.