NetGPT: Generative Pretrained Transformer for Network Traffic
- URL: http://arxiv.org/abs/2304.09513v2
- Date: Wed, 17 May 2023 11:23:35 GMT
- Title: NetGPT: Generative Pretrained Transformer for Network Traffic
- Authors: Xuying Meng, Chungang Lin, Yequan Wang, Yujun Zhang
- Abstract summary: Pretrained models for network traffic can utilize large-scale raw data to learn the essential characteristics of network traffic.
In this paper, we make the first attempt to provide a generative pretrained model NetGPT for both traffic understanding and generation tasks.
- Score: 4.205009931131087
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: All data on the Internet are transferred by network traffic, thus accurately
modeling network traffic can help improve network services quality and protect
data privacy. Pretrained models for network traffic can utilize large-scale raw
data to learn the essential characteristics of network traffic, and generate
distinguishable results for input traffic without considering specific
downstream tasks. Effective pretrained models can significantly optimize the
training efficiency and effectiveness of downstream tasks, such as application
classification, attack detection and traffic generation. Despite the great
success of pretraining in natural language processing, there is no work in the
network field. Considering the diverse demands and characteristics of network
traffic and network tasks, it is non-trivial to build a pretrained model for
network traffic and we face various challenges, especially the heterogeneous
headers and payloads in the multi-pattern network traffic and the different
dependencies for contexts of diverse downstream network tasks.
To tackle these challenges, in this paper, we make the first attempt to
provide a generative pretrained model NetGPT for both traffic understanding and
generation tasks. We propose the multi-pattern network traffic modeling to
construct unified text inputs and support both traffic understanding and
generation tasks. We further optimize the adaptation effect of the pretrained
model to diversified tasks by shuffling header fields, segmenting packets in
flows, and incorporating diverse task labels with prompts. With diverse traffic
datasets from encrypted software, DNS, private industrial protocols and
cryptocurrency mining, expensive experiments demonstrate the effectiveness of
our NetGPT in a range of traffic understanding and generation tasks on traffic
datasets, and outperform state-of-the-art baselines by a wide margin.
Related papers
- Towards a graph-based foundation model for network traffic analysis [3.0558245652654907]
Foundation models can grasp the complexities of network traffic dynamics and adapt to any specific task or environment with minimal fine-tuning.
Previous approaches have used tokenized hex-level packet data.
We propose a new, efficient graph-based alternative at the flow-level.
arXiv Detail & Related papers (2024-09-12T15:04:34Z) - NetBench: A Large-Scale and Comprehensive Network Traffic Benchmark Dataset for Foundation Models [15.452625276982987]
In computer networking, network traffic refers to the amount of data transmitted in the form of packets between internetworked computers or Cyber-Physical Systems.
We introduce the NetBench, a large-scale and comprehensive benchmark dataset for assessing machine learning models, especially foundation models, in both network traffic classification and generation tasks.
arXiv Detail & Related papers (2024-03-15T14:09:54Z) - BjTT: A Large-scale Multimodal Dataset for Traffic Prediction [49.93028461584377]
Traditional traffic prediction methods rely on historical traffic data to predict traffic trends.
In this work, we explore how generative models combined with text describing the traffic system can be applied for traffic generation.
We propose ChatTraffic, the first diffusion model for text-to-traffic generation.
arXiv Detail & Related papers (2024-03-08T04:19:56Z) - A Deep Reinforcement Learning Approach for Adaptive Traffic Routing in
Next-gen Networks [1.1586742546971471]
Next-gen networks require automation and adaptively adjust network configuration based on traffic dynamics.
Traditional techniques that decide traffic policies are usually based on hand-crafted programming optimization and algorithms.
We develop a deep reinforcement learning (DRL) approach for adaptive traffic routing.
arXiv Detail & Related papers (2024-02-07T01:48:29Z) - Lens: A Foundation Model for Network Traffic [19.3652490585798]
Lens is a foundation model for network traffic that leverages the T5 architecture to learn the pre-trained representations from large-scale unlabeled data.
We design a novel loss that combines three distinct tasks: Masked Span Prediction (MSP), Packet Order Prediction (POP), and Homologous Traffic Prediction (HTP)
arXiv Detail & Related papers (2024-02-06T02:45:13Z) - Towards Intelligent Network Management: Leveraging AI for Network
Service Detection [0.0]
This study focuses on leveraging Machine Learning methodologies to create an advanced network traffic classification system.
We introduce a novel data-driven approach that excels in identifying various network service types in real-time.
Our system demonstrates a remarkable accuracy in distinguishing the network services.
arXiv Detail & Related papers (2023-10-14T16:06:11Z) - Road Network Guided Fine-Grained Urban Traffic Flow Inference [108.64631590347352]
Accurate inference of fine-grained traffic flow from coarse-grained one is an emerging yet crucial problem.
We propose a novel Road-Aware Traffic Flow Magnifier (RATFM) that exploits the prior knowledge of road networks.
Our method can generate high-quality fine-grained traffic flow maps.
arXiv Detail & Related papers (2021-09-29T07:51:49Z) - Optimal transport in multilayer networks [68.8204255655161]
We propose a model where optimal flows on different layers contribute differently to the total cost to be minimized.
As an application, we consider transportation networks, where each layer is associated to a different transportation system.
We show an example of this result on the real 2-layer network of the city of Bordeaux with bus and tram, where in certain regimes the presence of the tram network significantly unburdens the traffic on the road network.
arXiv Detail & Related papers (2021-06-14T07:33:09Z) - Graph-Based Neural Network Models with Multiple Self-Supervised
Auxiliary Tasks [79.28094304325116]
Graph Convolutional Networks are among the most promising approaches for capturing relationships among structured data points.
We propose three novel self-supervised auxiliary tasks to train graph-based neural network models in a multi-task fashion.
arXiv Detail & Related papers (2020-11-14T11:09:51Z) - Pre-Trained Models for Heterogeneous Information Networks [57.78194356302626]
We propose a self-supervised pre-training and fine-tuning framework, PF-HIN, to capture the features of a heterogeneous information network.
PF-HIN consistently and significantly outperforms state-of-the-art alternatives on each of these tasks, on four datasets.
arXiv Detail & Related papers (2020-07-07T03:36:28Z) - Curriculum By Smoothing [52.08553521577014]
Convolutional Neural Networks (CNNs) have shown impressive performance in computer vision tasks such as image classification, detection, and segmentation.
We propose an elegant curriculum based scheme that smoothes the feature embedding of a CNN using anti-aliasing or low-pass filters.
As the amount of information in the feature maps increases during training, the network is able to progressively learn better representations of the data.
arXiv Detail & Related papers (2020-03-03T07:27:44Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.