STDPG: A Spatio-Temporal Deterministic Policy Gradient Agent for Dynamic
Routing in SDN
- URL: http://arxiv.org/abs/2004.09783v1
- Date: Tue, 21 Apr 2020 07:19:07 GMT
- Title: STDPG: A Spatio-Temporal Deterministic Policy Gradient Agent for Dynamic
Routing in SDN
- Authors: Juan Chen, Zhiwen Xiao, Huanlai Xing, Penglin Dai, Shouxi Luo,
Muhammad Azhar Iqbal
- Abstract summary: Dynamic routing in software-defined networking (SDN) can be viewed as a centralized decision-making problem.
We propose a novel model-free framework for dynamic routing in SDN, which is referred to as SDN-temporal deterministic policy gradient (STDPG) agent.
STDPG achieves better routing solutions in terms of average end-to-end delay.
- Score: 6.27420060051673
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Dynamic routing in software-defined networking (SDN) can be viewed as a
centralized decision-making problem. Most of the existing deep reinforcement
learning (DRL) agents can address it, thanks to the deep neural network
(DNN)incorporated. However, fully-connected feed-forward neural network (FFNN)
is usually adopted, where spatial correlation and temporal variation of traffic
flows are ignored. This drawback usually leads to significantly high
computational complexity due to large number of training parameters. To
overcome this problem, we propose a novel model-free framework for dynamic
routing in SDN, which is referred to as spatio-temporal deterministic policy
gradient (STDPG) agent. Both the actor and critic networks are based on
identical DNN structure, where a combination of convolutional neural network
(CNN) and long short-term memory network (LSTM) with temporal attention
mechanism, CNN-LSTM-TAM, is devised. By efficiently exploiting spatial and
temporal features, CNNLSTM-TAM helps the STDPG agent learn better from the
experience transitions. Furthermore, we employ the prioritized experience
replay (PER) method to accelerate the convergence of model training. The
experimental results show that STDPG can automatically adapt for current
network environment and achieve robust convergence. Compared with a number
state-ofthe-art DRL agents, STDPG achieves better routing solutions in terms of
the average end-to-end delay.
Related papers
Err
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.