Active Learning for Network Traffic Classification: A Technical Survey
- URL: http://arxiv.org/abs/2106.06933v1
- Date: Sun, 13 Jun 2021 06:37:50 GMT
- Title: Active Learning for Network Traffic Classification: A Technical Survey
- Authors: Amin Shahraki, Mahmoud Abbasi, Amir Taherkordi and Anca Delia Jurcut
- Abstract summary: This study investigates the applicability of an active form of ML, called Active Learning (AL), which reduces the need for a high number of labeled examples.
The study first provides an overview of NTC and its fundamental challenges along with surveying the literature in the field of using ML techniques in NTC.
Further, challenges and open issues in the use of AL for NTC are discussed.
- Score: 1.942265343737899
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Network Traffic Classification (NTC) has become an important component in a
wide variety of network management operations, e.g., Quality of Service (QoS)
provisioning and security purposes. Machine Learning (ML) algorithms as a
common approach for NTC methods can achieve reasonable accuracy and handle
encrypted traffic. However, ML-based NTC techniques suffer from the shortage of
labeled traffic data which is the case in many real-world applications. This
study investigates the applicability of an active form of ML, called Active
Learning (AL), which reduces the need for a high number of labeled examples by
actively choosing the instances that should be labeled. The study first
provides an overview of NTC and its fundamental challenges along with surveying
the literature in the field of using ML techniques in NTC. Then, it introduces
the concepts of AL, discusses it in the context of NTC, and review the
literature in this field. Further, challenges and open issues in the use of AL
for NTC are discussed. Additionally, as a technical survey, some experiments
are conducted to show the broad applicability of AL in NTC. The simulation
results show that AL can achieve high accuracy with a small amount of data.
Related papers
- Supervised Gradual Machine Learning for Aspect Category Detection [0.9857683394266679]
Aspect Category Detection (ACD) aims to identify implicit and explicit aspects in a given review sentence.
We propose a novel approach to tackle the ACD task by combining Deep Neural Networks (DNNs) with Gradual Machine Learning (GML) in a supervised setting.
arXiv Detail & Related papers (2024-04-08T07:21:46Z) - Robust Analysis of Multi-Task Learning Efficiency: New Benchmarks on Light-Weighed Backbones and Effective Measurement of Multi-Task Learning Challenges by Feature Disentanglement [69.51496713076253]
In this paper, we focus on the aforementioned efficiency aspects of existing MTL methods.
We first carry out large-scale experiments of the methods with smaller backbones and on a the MetaGraspNet dataset as a new test ground.
We also propose Feature Disentanglement measure as a novel and efficient identifier of the challenges in MTL.
arXiv Detail & Related papers (2024-02-05T22:15:55Z) - SoK: Pragmatic Assessment of Machine Learning for Network Intrusion
Detection [3.8759572154478343]
This paper aims to reduce the practitioners' skepticism towards ML for NID by "changing" the evaluation methodology adopted in research.
We propose the notion of "pragmatic assessment", which enable practitioners to gauge the real value of ML methods for NID.
arXiv Detail & Related papers (2023-04-30T18:38:27Z) - Active Learning Framework to Automate NetworkTraffic Classification [0.0]
The paper presents a novel ActiveLearning Framework (ALF) to address this topic.
ALF provides components that can be used to deploy an activelearning loop and maintain an ALF instance that continuouslyevolves a dataset and ML model.
The resultingsolution is deployable for IP flow-based analysis of high-speed(100 Gb/s) networks.
arXiv Detail & Related papers (2022-10-26T10:15:18Z) - KnowDA: All-in-One Knowledge Mixture Model for Data Augmentation in
Few-Shot NLP [68.43279384561352]
Existing data augmentation algorithms leverage task-independent rules or fine-tune general-purpose pre-trained language models.
These methods have trivial task-specific knowledge and are limited to yielding low-quality synthetic data for weak baselines in simple tasks.
We propose the Knowledge Mixture Data Augmentation Model (KnowDA): an encoder-decoder LM pretrained on a mixture of diverse NLP tasks.
arXiv Detail & Related papers (2022-06-21T11:34:02Z) - A Lightweight, Efficient and Explainable-by-Design Convolutional Neural
Network for Internet Traffic Classification [9.365794791156972]
This paper introduces a new Lightweight, Efficient and eXplainable-by-design convolutional neural network (LEXNet) for Internet traffic classification.
LEXNet relies on a new residual block (for lightweight and efficiency purposes) and prototype layer (for explainability)
Based on a commercial-grade dataset, our evaluation shows that LEXNet succeeds to maintain the same accuracy as the best performing state-of-the-art neural network.
arXiv Detail & Related papers (2022-02-11T10:21:34Z) - Learning to Detect Critical Nodes in Sparse Graphs via Feature Importance Awareness [53.351863569314794]
The critical node problem (CNP) aims to find a set of critical nodes from a network whose deletion maximally degrades the pairwise connectivity of the residual network.
This work proposes a feature importance-aware graph attention network for node representation.
It combines it with dueling double deep Q-network to create an end-to-end algorithm to solve CNP for the first time.
arXiv Detail & Related papers (2021-12-03T14:23:05Z) - Meta-Learning with Fewer Tasks through Task Interpolation [67.03769747726666]
Current meta-learning algorithms require a large number of meta-training tasks, which may not be accessible in real-world scenarios.
By meta-learning with task gradient (MLTI), our approach effectively generates additional tasks by randomly sampling a pair of tasks and interpolating the corresponding features and labels.
Empirically, in our experiments on eight datasets from diverse domains, we find that the proposed general MLTI framework is compatible with representative meta-learning algorithms and consistently outperforms other state-of-the-art strategies.
arXiv Detail & Related papers (2021-06-04T20:15:34Z) - Deep Learning and Traffic Classification: Lessons learned from a
commercial-grade dataset with hundreds of encrypted and zero-day applications [72.02908263225919]
We share our experience on a commercial-grade DL traffic classification engine.
We identify known applications from encrypted traffic, as well as unknown zero-day applications.
We propose a novel technique, tailored for DL models, that is significantly more accurate and light-weight than the state of the art.
arXiv Detail & Related papers (2021-04-07T15:21:22Z) - Dif-MAML: Decentralized Multi-Agent Meta-Learning [54.39661018886268]
We propose a cooperative multi-agent meta-learning algorithm, referred to as MAML or Dif-MAML.
We show that the proposed strategy allows a collection of agents to attain agreement at a linear rate and to converge to a stationary point of the aggregate MAML.
Simulation results illustrate the theoretical findings and the superior performance relative to the traditional non-cooperative setting.
arXiv Detail & Related papers (2020-10-06T16:51:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.