Federated Traffic Synthesizing and Classification Using Generative
Adversarial Networks
- URL: http://arxiv.org/abs/2104.10400v1
- Date: Wed, 21 Apr 2021 08:10:46 GMT
- Title: Federated Traffic Synthesizing and Classification Using Generative
Adversarial Networks
- Authors: Chenxin Xu, Rong Xia, Yong Xiao, Yingyu Li, Guangming Shi, Kwang-cheng
Chen
- Abstract summary: This paper introduces a novel framework, Federated Generative Adversarial Networks and Automatic Classification (FGAN-AC)
FGAN-AC is able to synthesize and classify multiple types of service data traffic from decentralized local datasets without requiring a large volume of manually labeled dataset or causing any data leakage.
- Score: 30.686118264562598
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: With the fast growing demand on new services and applications as well as the
increasing awareness of data protection, traditional centralized traffic
classification approaches are facing unprecedented challenges. This paper
introduces a novel framework, Federated Generative Adversarial Networks and
Automatic Classification (FGAN-AC), which integrates decentralized data
synthesizing with traffic classification. FGAN-AC is able to synthesize and
classify multiple types of service data traffic from decentralized local
datasets without requiring a large volume of manually labeled dataset or
causing any data leakage. Two types of data synthesizing approaches have been
proposed and compared: computation-efficient FGAN
(FGAN-\uppercase\expandafter{\romannumeral1}) and communication-efficient FGAN
(FGAN-\uppercase\expandafter{\romannumeral2}). The former only implements a
single CNN model for processing each local dataset and the later only requires
coordination of intermediate model training parameters. An automatic data
classification and model updating framework has been proposed to automatically
identify unknown traffic from the synthesized data samples and create new
pseudo-labels for model training. Numerical results show that our proposed
framework has the ability to synthesize highly mixed service data traffic and
can significantly improve the traffic classification performance compared to
existing solutions.
Related papers
- Tackling Data Heterogeneity in Federated Time Series Forecasting [61.021413959988216]
Time series forecasting plays a critical role in various real-world applications, including energy consumption prediction, disease transmission monitoring, and weather forecasting.
Most existing methods rely on a centralized training paradigm, where large amounts of data are collected from distributed devices to a central cloud server.
We propose a novel framework, Fed-TREND, to address data heterogeneity by generating informative synthetic data as auxiliary knowledge carriers.
arXiv Detail & Related papers (2024-11-24T04:56:45Z) - XG-NID: Dual-Modality Network Intrusion Detection using a Heterogeneous Graph Neural Network and Large Language Model [5.298018090482744]
"XG-NID" is the first to fuse flow-level and packet-level data within a heterogeneous graph structure.
XG-NID uniquely enables real-time inference while effectively capturing the intricate relationships between flow and packet payload data.
arXiv Detail & Related papers (2024-08-27T01:14:34Z) - An Aggregation-Free Federated Learning for Tackling Data Heterogeneity [50.44021981013037]
Federated Learning (FL) relies on the effectiveness of utilizing knowledge from distributed datasets.
Traditional FL methods adopt an aggregate-then-adapt framework, where clients update local models based on a global model aggregated by the server from the previous training round.
We introduce FedAF, a novel aggregation-free FL algorithm.
arXiv Detail & Related papers (2024-04-29T05:55:23Z) - Federated Learning with Projected Trajectory Regularization [65.6266768678291]
Federated learning enables joint training of machine learning models from distributed clients without sharing their local data.
One key challenge in federated learning is to handle non-identically distributed data across the clients.
We propose a novel federated learning framework with projected trajectory regularization (FedPTR) for tackling the data issue.
arXiv Detail & Related papers (2023-12-22T02:12:08Z) - FedPNN: One-shot Federated Classification via Evolving Clustering Method
and Probabilistic Neural Network hybrid [4.241208172557663]
We propose a two-stage federated learning approach toward the objective of privacy protection.
In the first stage, the synthetic dataset is generated by employing two different distributions as noise.
In the second stage, the Federated Probabilistic Neural Network (FedPNN) is developed and employed for building globally shared classification model.
arXiv Detail & Related papers (2023-04-09T03:23:37Z) - Generative Adversarial Classification Network with Application to
Network Traffic Classification [25.93711502488151]
We propose a joint data imputation and data classification method, termed generative adversarial classification network (GACN)
For the scenario where some data samples are unlabeled, we propose an extension termed semi-supervised GACN (SSGACN), which is able to use the partially labeled data to improve classification accuracy.
We conduct experiments with real-world network traffic data traces, which demonstrate that GACN and SS-GACN can more accurately impute data features that are more important for classification, and they outperform existing methods in terms of classification accuracy.
arXiv Detail & Related papers (2023-03-19T15:00:47Z) - Distributed Traffic Synthesis and Classification in Edge Networks: A
Federated Self-supervised Learning Approach [83.2160310392168]
This paper proposes FS-GAN to support automatic traffic analysis and synthesis over a large number of heterogeneous datasets.
FS-GAN is composed of multiple distributed Generative Adversarial Networks (GANs)
FS-GAN can classify data of unknown types of service and create synthetic samples that capture the traffic distribution of the unknown types.
arXiv Detail & Related papers (2023-02-01T03:23:11Z) - DANCE: DAta-Network Co-optimization for Efficient Segmentation Model
Training and Inference [85.02494022662505]
DANCE is an automated simultaneous data-network co-optimization for efficient segmentation model training and inference.
It integrates automated data slimming which adaptively downsamples/drops input images and controls their corresponding contribution to the training loss guided by the images' spatial complexity.
Experiments and ablating studies demonstrate that DANCE can achieve "all-win" towards efficient segmentation.
arXiv Detail & Related papers (2021-07-16T04:58:58Z) - Pre-Trained Models for Heterogeneous Information Networks [57.78194356302626]
We propose a self-supervised pre-training and fine-tuning framework, PF-HIN, to capture the features of a heterogeneous information network.
PF-HIN consistently and significantly outperforms state-of-the-art alternatives on each of these tasks, on four datasets.
arXiv Detail & Related papers (2020-07-07T03:36:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.