Team Deep Mixture of Experts for Distributed Power Control
- URL: http://arxiv.org/abs/2007.14147v1
- Date: Tue, 28 Jul 2020 12:01:06 GMT
- Title: Team Deep Mixture of Experts for Distributed Power Control
- Authors: Matteo Zecchin, David Gesbert, Marios Kountouris
- Abstract summary: We propose an architecture inspired from the well-known Mixture of Experts (MoE) model.
We show the ability of the so called Team-DMoE model to efficiently track time-varying statistical scenarios.
- Score: 23.612400109629544
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In the context of wireless networking, it was recently shown that multiple
DNNs can be jointly trained to offer a desired collaborative behaviour capable
of coping with a broad range of sensing uncertainties. In particular, it was
established that DNNs can be used to derive policies that are robust with
respect to the information noise statistic affecting the local information
(e.g. CSI in a wireless network) used by each agent (e.g. transmitter) to make
its decision. While promising, a major challenge in the implementation of such
method is that information noise statistics may differ from agent to agent and,
more importantly, that such statistics may not be available at the time of
training or may evolve over time, making burdensome retraining necessary. This
situation makes it desirable to devise a "universal" machine learning model,
which can be trained once for all so as to allow for decentralized cooperation
in any future feedback noise environment. With this goal in mind, we propose an
architecture inspired from the well-known Mixture of Experts (MoE) model, which
was previously used for non-linear regression and classification tasks in
various contexts, such as computer vision and speech recognition. We consider
the decentralized power control problem as an example to showcase the validity
of the proposed model and to compare it against other power control algorithms.
We show the ability of the so called Team-DMoE model to efficiently track
time-varying statistical scenarios.
Related papers
- Federated Learning for Misbehaviour Detection with Variational Autoencoders and Gaussian Mixture Models [0.2999888908665658]
Federated Learning (FL) has become an attractive approach to collaboratively train Machine Learning (ML) models.
This work proposes a novel unsupervised FL approach for the identification of potential misbehavior in vehicular environments.
We leverage the computing capabilities of public cloud services for model aggregation purposes.
arXiv Detail & Related papers (2024-05-16T08:49:50Z) - Optimal design of experiments in the context of machine-learning inter-atomic potentials: improving the efficiency and transferability of kernel based methods [0.7234862895932991]
Data-driven, machine learning (ML) models of atomistic interactions can relate nuanced aspects of atomic arrangements into predictions of energies and forces.
The main challenge stems from the fact that descriptors of chemical environments are often sparse high-dimensional objects without a well-defined continuous metric.
We will demonstrate that classical concepts of statistical planning of experiments and optimal design can help to mitigate such problems at a relatively low computational cost.
arXiv Detail & Related papers (2024-05-14T14:14:23Z) - Decentralized Learning Strategies for Estimation Error Minimization with Graph Neural Networks [94.2860766709971]
We address the challenge of sampling and remote estimation for autoregressive Markovian processes in a wireless network with statistically-identical agents.
Our goal is to minimize time-average estimation error and/or age of information with decentralized scalable sampling and transmission policies.
arXiv Detail & Related papers (2024-04-04T06:24:11Z) - Enhancing Multiple Reliability Measures via Nuisance-extended
Information Bottleneck [77.37409441129995]
In practical scenarios where training data is limited, many predictive signals in the data can be rather from some biases in data acquisition.
We consider an adversarial threat model under a mutual information constraint to cover a wider class of perturbations in training.
We propose an autoencoder-based training to implement the objective, as well as practical encoder designs to facilitate the proposed hybrid discriminative-generative training.
arXiv Detail & Related papers (2023-03-24T16:03:21Z) - Denoised MDPs: Learning World Models Better Than the World Itself [94.74665254213588]
This work categorizes information out in the wild into four types based on controllability and relation with reward, and formulates useful information as that which is both controllable and reward-relevant.
Experiments on variants of DeepMind Control Suite and RoboDesk demonstrate superior performance of our denoised world model over using raw observations alone.
arXiv Detail & Related papers (2022-06-30T17:59:49Z) - Robust, Deep, and Reinforcement Learning for Management of Communication
and Power Networks [6.09170287691728]
The present thesis first develops principled methods to make generic machine learning models robust against distributional uncertainties and adversarial data.
We then build on this robust framework to design robust semi-supervised learning over graph methods.
The second part of this thesis aspires to fully unleash the potential of next-generation wired and wireless networks.
arXiv Detail & Related papers (2022-02-08T05:49:06Z) - Learning Connectivity for Data Distribution in Robot Teams [96.39864514115136]
We propose a task-agnostic, decentralized, low-latency method for data distribution in ad-hoc networks using Graph Neural Networks (GNN)
Our approach enables multi-agent algorithms based on global state information to function by ensuring it is available at each robot.
We train the distributed GNN communication policies via reinforcement learning using the average Age of Information as the reward function and show that it improves training stability compared to task-specific reward functions.
arXiv Detail & Related papers (2021-03-08T21:48:55Z) - Collective Awareness for Abnormality Detection in Connected Autonomous
Vehicles [4.659696262995864]
This article presents a novel approach to develop an initial level of collective awareness in a network of intelligent agents.
A specific collective self awareness functionality is considered, namely, agent centered detection of abnormal situations.
The impact is also evaluated by the communication channel used by the network to share the data sensed in a distributed way by each agent of the network.
arXiv Detail & Related papers (2020-10-28T12:11:36Z) - Dif-MAML: Decentralized Multi-Agent Meta-Learning [54.39661018886268]
We propose a cooperative multi-agent meta-learning algorithm, referred to as MAML or Dif-MAML.
We show that the proposed strategy allows a collection of agents to attain agreement at a linear rate and to converge to a stationary point of the aggregate MAML.
Simulation results illustrate the theoretical findings and the superior performance relative to the traditional non-cooperative setting.
arXiv Detail & Related papers (2020-10-06T16:51:09Z) - Plausible Counterfactuals: Auditing Deep Learning Classifiers with
Realistic Adversarial Examples [84.8370546614042]
Black-box nature of Deep Learning models has posed unanswered questions about what they learn from data.
Generative Adversarial Network (GAN) and multi-objectives are used to furnish a plausible attack to the audited model.
Its utility is showcased within a human face classification task, unveiling the enormous potential of the proposed framework.
arXiv Detail & Related papers (2020-03-25T11:08:56Z) - Resource Management in Wireless Networks via Multi-Agent Deep
Reinforcement Learning [15.091308167639815]
We propose a mechanism for distributed resource management and interference mitigation in wireless networks using multi-agent deep reinforcement learning (RL)
We equip each transmitter in the network with a deep RL agent that receives delayed observations from its associated users, while also exchanging observations with its neighboring agents.
Our proposed framework enables agents to make decisions simultaneously and in a distributed manner, unaware of the concurrent decisions of other agents.
arXiv Detail & Related papers (2020-02-14T19:01:07Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.