Distributed Traffic Synthesis and Classification in Edge Networks: A
Federated Self-supervised Learning Approach
- URL: http://arxiv.org/abs/2302.00207v1
- Date: Wed, 1 Feb 2023 03:23:11 GMT
- Title: Distributed Traffic Synthesis and Classification in Edge Networks: A
Federated Self-supervised Learning Approach
- Authors: Yong Xiao, Rong Xia, Yingyu Li, Guangming Shi, Diep N. Nguyen, Dinh
Thai Hoang, Dusit Niyato, Marwan Krunz
- Abstract summary: This paper proposes FS-GAN to support automatic traffic analysis and synthesis over a large number of heterogeneous datasets.
FS-GAN is composed of multiple distributed Generative Adversarial Networks (GANs)
FS-GAN can classify data of unknown types of service and create synthetic samples that capture the traffic distribution of the unknown types.
- Score: 83.2160310392168
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: With the rising demand for wireless services and increased awareness of the
need for data protection, existing network traffic analysis and management
architectures are facing unprecedented challenges in classifying and
synthesizing the increasingly diverse services and applications. This paper
proposes FS-GAN, a federated self-supervised learning framework to support
automatic traffic analysis and synthesis over a large number of heterogeneous
datasets. FS-GAN is composed of multiple distributed Generative Adversarial
Networks (GANs), with a set of generators, each being designed to generate
synthesized data samples following the distribution of an individual service
traffic, and each discriminator being trained to differentiate the synthesized
data samples and the real data samples of a local dataset. A federated
learning-based framework is adopted to coordinate local model training
processes of different GANs across different datasets. FS-GAN can classify data
of unknown types of service and create synthetic samples that capture the
traffic distribution of the unknown types. We prove that FS-GAN can minimize
the Jensen-Shannon Divergence (JSD) between the distribution of real data
across all the datasets and that of the synthesized data samples. FS-GAN also
maximizes the JSD among the distributions of data samples created by different
generators, resulting in each generator producing synthetic data samples that
follow the same distribution as one particular service type. Extensive
simulation results show that the classification accuracy of FS-GAN achieves
over 20% improvement in average compared to the state-of-the-art
clustering-based traffic analysis algorithms. FS-GAN also has the capability to
synthesize highly complex mixtures of traffic types without requiring any
human-labeled data samples.
Related papers
- Tackling Data Heterogeneity in Federated Time Series Forecasting [61.021413959988216]
Time series forecasting plays a critical role in various real-world applications, including energy consumption prediction, disease transmission monitoring, and weather forecasting.
Most existing methods rely on a centralized training paradigm, where large amounts of data are collected from distributed devices to a central cloud server.
We propose a novel framework, Fed-TREND, to address data heterogeneity by generating informative synthetic data as auxiliary knowledge carriers.
arXiv Detail & Related papers (2024-11-24T04:56:45Z) - FLIGAN: Enhancing Federated Learning with Incomplete Data using GAN [1.5749416770494706]
Federated Learning (FL) provides a privacy-preserving mechanism for distributed training of machine learning models on networked devices.
We propose FLIGAN, a novel approach to address the issue of data incompleteness in FL.
Our methodology adheres to FL's privacy requirements by generating synthetic data in a federated manner without sharing the actual data in the process.
arXiv Detail & Related papers (2024-03-25T16:49:38Z) - PFL-GAN: When Client Heterogeneity Meets Generative Models in
Personalized Federated Learning [55.930403371398114]
We propose a novel generative adversarial network (GAN) sharing and aggregation strategy for personalized learning (PFL)
PFL-GAN addresses the client heterogeneity in different scenarios. More specially, we first learn the similarity among clients and then develop an weighted collaborative data aggregation.
The empirical results through the rigorous experimentation on several well-known datasets demonstrate the effectiveness of PFL-GAN.
arXiv Detail & Related papers (2023-08-23T22:38:35Z) - Revisiting the Evaluation of Image Synthesis with GANs [55.72247435112475]
This study presents an empirical investigation into the evaluation of synthesis performance, with generative adversarial networks (GANs) as a representative of generative models.
In particular, we make in-depth analyses of various factors, including how to represent a data point in the representation space, how to calculate a fair distance using selected samples, and how many instances to use from each set.
arXiv Detail & Related papers (2023-04-04T17:54:32Z) - Creating Synthetic Datasets for Collaborative Filtering Recommender
Systems using Generative Adversarial Networks [1.290382979353427]
Research and education in machine learning needs diverse, representative, and open datasets to handle the necessary training, validation, and testing tasks.
To feed this research variety, it is necessary and convenient to reinforce the existing datasets with synthetic ones.
This paper proposes a Generative Adversarial Network (GAN)-based method to generate collaborative filtering datasets.
arXiv Detail & Related papers (2023-03-02T14:23:27Z) - Federated Learning with GAN-based Data Synthesis for Non-IID Clients [8.304185807036783]
Federated learning (FL) has recently emerged as a popular privacy-preserving collaborative learning paradigm.
We propose a novel framework, named Synthetic Data Aided Federated Learning (SDA-FL), to resolve this non-IID challenge by sharing synthetic data.
arXiv Detail & Related papers (2022-06-11T11:43:25Z) - CAFE: Learning to Condense Dataset by Aligning Features [72.99394941348757]
We propose a novel scheme to Condense dataset by Aligning FEatures (CAFE)
At the heart of our approach is an effective strategy to align features from the real and synthetic data across various scales.
We validate the proposed CAFE across various datasets, and demonstrate that it generally outperforms the state of the art.
arXiv Detail & Related papers (2022-03-03T05:58:49Z) - Federated Traffic Synthesizing and Classification Using Generative
Adversarial Networks [30.686118264562598]
This paper introduces a novel framework, Federated Generative Adversarial Networks and Automatic Classification (FGAN-AC)
FGAN-AC is able to synthesize and classify multiple types of service data traffic from decentralized local datasets without requiring a large volume of manually labeled dataset or causing any data leakage.
arXiv Detail & Related papers (2021-04-21T08:10:46Z) - Lessons Learned from the Training of GANs on Artificial Datasets [0.0]
Generative Adversarial Networks (GANs) have made great progress in synthesizing realistic images in recent years.
GANs are prone to underfitting or overfitting, making the analysis of them difficult and constrained.
We train them on artificial datasets where there are infinitely many samples and the real data distributions are simple.
We find that training mixtures of GANs leads to more performance gain compared to increasing the network depth or width.
arXiv Detail & Related papers (2020-07-13T14:51:02Z) - Brainstorming Generative Adversarial Networks (BGANs): Towards
Multi-Agent Generative Models with Distributed Private Datasets [70.62568022925971]
generative adversarial networks (GANs) must be fed by large datasets that adequately represent the data space.
In many scenarios, the available datasets may be limited and distributed across multiple agents, each of which is seeking to learn the distribution of the data on its own.
In this paper, a novel brainstorming GAN (BGAN) architecture is proposed using which multiple agents can generate real-like data samples while operating in a fully distributed manner.
arXiv Detail & Related papers (2020-02-02T02:58:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.