Cooperative guidance of multiple missiles: a hybrid co-evolutionary
approach
- URL: http://arxiv.org/abs/2208.07156v2
- Date: Sat, 15 Apr 2023 03:11:00 GMT
- Title: Cooperative guidance of multiple missiles: a hybrid co-evolutionary
approach
- Authors: Xuejing Lan, Junda Chen, Zhijia Zhao, Tao Zou
- Abstract summary: Cooperative guidance of multiple missiles is a challenging task with rigorous constraints of time and space consensus.
This paper develops a novel natural co-evolutionary strategy (NCES) to address the issues of non-stationarity and continuous control faced by cooperative guidance.
A hybrid co-evolutionary cooperative guidance law (HCCGL) is proposed by integrating the highly scalable co-evolutionary mechanism and the traditional guidance strategy.
- Score: 0.9176056742068814
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Cooperative guidance of multiple missiles is a challenging task with rigorous
constraints of time and space consensus, especially when attacking dynamic
targets. In this paper, the cooperative guidance task is described as a
distributed multi-objective cooperative optimization problem. To address the
issues of non-stationarity and continuous control faced by cooperative
guidance, the natural evolutionary strategy (NES) is improved along with an
elitist adaptive learning technique to develop a novel natural co-evolutionary
strategy (NCES). The gradients of original evolutionary strategy are rescaled
to reduce the estimation bias caused by the interaction between the multiple
missiles. Then, a hybrid co-evolutionary cooperative guidance law (HCCGL) is
proposed by integrating the highly scalable co-evolutionary mechanism and the
traditional guidance strategy. Finally, three simulations under different
conditions demonstrate the effectiveness and superiority of this guidance law
in solving cooperative guidance tasks with high accuracy. The proposed
co-evolutionary approach has great prospects not only in cooperative guidance,
but also in other application scenarios of multi-objective optimization,
dynamic optimization and distributed control.
Related papers
- Multi-task multi-constraint differential evolution with elite-guided knowledge transfer for coal mine integrated energy system dispatching [9.050846217690856]
Existing constrained multiobjective evolutionary algorithms struggle with locating multiple small and irregular feasible regions.
We here develop a multitask evolutionary algorithm framework that incorporates the dispatch correlated domain knowledge.
The proposed algorithm in feasibility, convergence, and diversity is demonstrated in a case study of a coal mine integrated energy system.
arXiv Detail & Related papers (2024-06-29T10:00:16Z) - Joint Demonstration and Preference Learning Improves Policy Alignment with Human Feedback [58.049113055986375]
We develop a single stage approach named Alignment with Integrated Human Feedback (AIHF) to train reward models and the policy.
The proposed approach admits a suite of efficient algorithms, which can easily reduce to, and leverage, popular alignment algorithms.
We demonstrate the efficiency of the proposed solutions with extensive experiments involving alignment problems in LLMs and robotic control problems in MuJoCo.
arXiv Detail & Related papers (2024-06-11T01:20:53Z) - Multi-Agent Dynamic Relational Reasoning for Social Robot Navigation [55.65482030032804]
Social robot navigation can be helpful in various contexts of daily life but requires safe human-robot interactions and efficient trajectory planning.
We propose a systematic relational reasoning approach with explicit inference of the underlying dynamically evolving relational structures.
Our approach infers dynamically evolving relation graphs and hypergraphs to capture the evolution of relations, which the trajectory predictor employs to generate future states.
arXiv Detail & Related papers (2024-01-22T18:58:22Z) - Gradient Based Hybridization of PSO [1.1059341532498634]
Particle Swarm Optimization (PSO) has emerged as a powerful metaheuristic global optimization approach over the past three decades.
PSO faces challenges, such as premature stagnation in single-objective scenarios and the need to strike a balance between exploration and exploitation.
Hybridizing PSO by integrating its cooperative nature with established optimization techniques from diverse paradigms offers a promising solution.
arXiv Detail & Related papers (2023-12-15T11:26:36Z) - Context-Aware Bayesian Network Actor-Critic Methods for Cooperative
Multi-Agent Reinforcement Learning [7.784991832712813]
We introduce a Bayesian network to inaugurate correlations between agents' action selections in their joint policy.
We develop practical algorithms to learn the context-aware Bayesian network policies.
Empirical results on a range of MARL benchmarks show the benefits of our approach.
arXiv Detail & Related papers (2023-06-02T21:22:27Z) - Stateful active facilitator: Coordination and Environmental
Heterogeneity in Cooperative Multi-Agent Reinforcement Learning [71.53769213321202]
We formalize the notions of coordination level and heterogeneity level of an environment.
We present HECOGrid, a suite of multi-agent environments that facilitates empirical evaluation of different MARL approaches.
We propose a Training Decentralized Execution learning approach that enables agents to work efficiently in high-coordination and high-heterogeneity environments.
arXiv Detail & Related papers (2022-10-04T18:17:01Z) - Formation control with connectivity assurance for missile swarm: a
natural co-evolutionary strategy approach [0.0]
We present a metaheuristic approach that leverages a natural co-evolutionary strategy to solve the formation control problem for a swarm of missiles.
Focusing on the issue of local optimum and unstable evolution, we incorporate a novel model-based policy constraint.
We show that it is feasible to treat generic formation control problem as Markov Decision Process(MDP) and solve it through iterative learning.
arXiv Detail & Related papers (2022-08-24T07:54:06Z) - Accelerated Federated Learning with Decoupled Adaptive Optimization [53.230515878096426]
federated learning (FL) framework enables clients to collaboratively learn a shared model while keeping privacy of training data on clients.
Recently, many iterations efforts have been made to generalize centralized adaptive optimization methods, such as SGDM, Adam, AdaGrad, etc., to federated settings.
This work aims to develop novel adaptive optimization methods for FL from the perspective of dynamics of ordinary differential equations (ODEs)
arXiv Detail & Related papers (2022-07-14T22:46:43Z) - Multi-Agent Reinforcement Learning for Unmanned Aerial Vehicle
Coordination by Multi-Critic Policy Gradient Optimization [16.6182621419268]
In agriculture, disaster management, search and rescue operations, commercial and military applications, the advantage of applying a fleet of drones originates from their ability to cooperate autonomously.
We propose a Multi-Agent Reinforcement Learning approach that achieves a stable policy network update and similarity in reward signal development for an increasing number of agents.
arXiv Detail & Related papers (2020-12-31T07:00:44Z) - EOS: a Parallel, Self-Adaptive, Multi-Population Evolutionary Algorithm
for Constrained Global Optimization [68.8204255655161]
EOS is a global optimization algorithm for constrained and unconstrained problems of real-valued variables.
It implements a number of improvements to the well-known Differential Evolution (DE) algorithm.
Results prove that EOSis capable of achieving increased performance compared to state-of-the-art single-population self-adaptive DE algorithms.
arXiv Detail & Related papers (2020-07-09T10:19:22Z) - Decentralized MCTS via Learned Teammate Models [89.24858306636816]
We present a trainable online decentralized planning algorithm based on decentralized Monte Carlo Tree Search.
We show that deep learning and convolutional neural networks can be employed to produce accurate policy approximators.
arXiv Detail & Related papers (2020-03-19T13:10:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.