Toward Generative Data Augmentation for Traffic Classification
- URL: http://arxiv.org/abs/2310.13935v1
- Date: Sat, 21 Oct 2023 08:08:37 GMT
- Title: Toward Generative Data Augmentation for Traffic Classification
- Authors: Chao Wang, Alessandro Finamore, Pietro Michiardi, Massimo Gallo, Dario
Rossi
- Abstract summary: Data Augmentation (DA)-augmenting training data with synthetic samples-is wildly adopted in Computer Vision (CV) to improve models performance.
DA has not been yet popularized in networking use cases, including Traffic Classification (TC)
- Score: 54.92823760790628
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Data Augmentation (DA)-augmenting training data with synthetic samples-is
wildly adopted in Computer Vision (CV) to improve models performance.
Conversely, DA has not been yet popularized in networking use cases, including
Traffic Classification (TC). In this work, we present a preliminary study of 14
hand-crafted DAs applied on the MIRAGE19 dataset. Our results (i) show that DA
can reap benefits previously unexplored in TC and (ii) foster a research agenda
on the use of generative models to automate DA design.
Related papers
- Generalized Group Data Attribution [28.056149996461286]
Data Attribution methods quantify the influence of individual training data points on model outputs.
Existing DA methods are often computationally intensive, limiting their applicability to large-scale machine learning models.
We introduce the Generalized Group Data Attribution (GGDA) framework, which computationally simplifies DA by attributing to groups of training points instead of individual ones.
arXiv Detail & Related papers (2024-10-13T17:51:21Z) - DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception [78.26734070960886]
Current perceptive models heavily depend on resource-intensive datasets.
We introduce perception-aware loss (P.A. loss) through segmentation, improving both quality and controllability.
Our method customizes data augmentation by extracting and utilizing perception-aware attribute (P.A. Attr) during generation.
arXiv Detail & Related papers (2024-03-20T04:58:03Z) - Data Augmentation for Traffic Classification [54.92823760790628]
Data Augmentation (DA) is a technique widely adopted in Computer Vision (CV) and Natural Language Processing (NLP) tasks.
DA has struggled to gain traction in networking contexts, particularly in Traffic Classification (TC) tasks.
arXiv Detail & Related papers (2024-01-19T15:25:09Z) - Defect Classification in Additive Manufacturing Using CNN-Based Vision
Processing [76.72662577101988]
This paper examines two scenarios: first, using convolutional neural networks (CNNs) to accurately classify defects in an image dataset from AM and second, applying active learning techniques to the developed classification model.
This allows the construction of a human-in-the-loop mechanism to reduce the size of the data required to train and generate training data.
arXiv Detail & Related papers (2023-07-14T14:36:58Z) - Learning Better with Less: Effective Augmentation for Sample-Efficient
Visual Reinforcement Learning [57.83232242068982]
Data augmentation (DA) is a crucial technique for enhancing the sample efficiency of visual reinforcement learning (RL) algorithms.
It remains unclear which attributes of DA account for its effectiveness in achieving sample-efficient visual RL.
This work conducts comprehensive experiments to assess the impact of DA's attributes on its efficacy.
arXiv Detail & Related papers (2023-05-25T15:46:20Z) - Instance-Conditioned GAN Data Augmentation for Representation Learning [29.36473147430433]
We introduce DA_IC-GAN, a learnable data augmentation module that can be used off-the-shelf in conjunction with most state-of-the-art training recipes.
We show that DA_IC-GAN can boost accuracy to between 1%p and 2%p with the highest capacity models.
We additionally couple DA_IC-GAN with a self-supervised training recipe and show that we can also achieve an improvement of 1%p in accuracy in some settings.
arXiv Detail & Related papers (2023-03-16T22:45:43Z) - Models in the Loop: Aiding Crowdworkers with Generative Annotation
Assistants [41.9785159975426]
We introduce Generative Assistants (GAAs) that provide real-time suggestions that annotators can approve, modify, or reject entirely.
GAAs provide significant efficiency benefits in terms of annotation speed, while leading to improved model fooling rates.
arXiv Detail & Related papers (2021-12-16T17:59:39Z) - On Data Augmentation for GAN Training [39.074761323958406]
We propose Data Augmentation Optimized for GAN (DAG) to enable the use of augmented data in GAN training.
We conduct experiments to apply DAG to different GAN models.
When DAG is used in some GAN models, the system establishes state-of-the-art Frechet Inception Distance (FID) scores.
arXiv Detail & Related papers (2020-06-09T15:19:26Z) - Generative Data Augmentation for Commonsense Reasoning [75.26876609249197]
G-DAUGC is a novel generative data augmentation method that aims to achieve more accurate and robust learning in the low-resource setting.
G-DAUGC consistently outperforms existing data augmentation methods based on back-translation.
Our analysis demonstrates that G-DAUGC produces a diverse set of fluent training examples, and that its selection and training approaches are important for performance.
arXiv Detail & Related papers (2020-04-24T06:12:10Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.