Towards an Ensemble Regressor Model for Anomalous ISP Traffic Prediction
- URL: http://arxiv.org/abs/2205.01300v1
- Date: Tue, 3 May 2022 04:37:37 GMT
- Title: Towards an Ensemble Regressor Model for Anomalous ISP Traffic Prediction
- Authors: Sajal Saha, Anwar Haque, and Greg Sidebottom
- Abstract summary: We show that outlier detection and mitigation assist the regression model in learning the general trend and making better predictions.
Our ensemble regression model achieved the minimum average gap of 5.04% between actual and predicted traffic with nine outlier-adjusted inputs.
- Score: 3.689539481706835
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: Prediction of network traffic behavior is significant for the effective
management of modern telecommunication networks. However, the intuitive
approach of predicting network traffic using administrative experience and
market analysis data is inadequate for an efficient forecast framework. As a
result, many different mathematical models have been studied to capture the
general trend of the network traffic and predict accordingly. But the
comprehensive performance analysis of varying regression models and their
ensemble has not been studied before for analyzing real-world anomalous
traffic. In this paper, several regression models such as Extra Gradient Boost
(XGBoost), Light Gradient Boosting Machine (LightGBM), Stochastic Gradient
Descent (SGD), Gradient Boosting Regressor (GBR), and CatBoost Regressor were
analyzed to predict real traffic without and with outliers and show the
significance of outlier detection in real-world traffic prediction. Also, we
showed the outperformance of the ensemble regression model over the individual
prediction model. We compared the performance of different regression models
based on five different feature sets of lengths 6, 9, 12, 15, and 18. Our
ensemble regression model achieved the minimum average gap of 5.04% between
actual and predicted traffic with nine outlier-adjusted inputs. In general, our
experimental results indicate that the outliers in the data can significantly
impact the quality of the prediction. Thus, outlier detection and mitigation
assist the regression model in learning the general trend and making better
predictions.
Related papers
- Learning Augmentation Policies from A Model Zoo for Time Series Forecasting [58.66211334969299]
We introduce AutoTSAug, a learnable data augmentation method based on reinforcement learning.
By augmenting the marginal samples with a learnable policy, AutoTSAug substantially improves forecasting performance.
arXiv Detail & Related papers (2024-09-10T07:34:19Z) - Causally-Aware Spatio-Temporal Multi-Graph Convolution Network for Accurate and Reliable Traffic Prediction [5.200012764049096]
This study focuses on an instance of--temporal problem--traffic prediction--to demonstrate an advanced deep learning model for making accurate and reliable forecast.
We propose an end-to-end traffic prediction framework that leverages three primary components to accurate and reliable traffic predictions.
Experimental results on two real-world traffic datasets demonstrate that the method outperforms several state-of-the-art models in prediction accuracy.
arXiv Detail & Related papers (2024-08-23T14:35:54Z) - Towards Generalizable and Interpretable Motion Prediction: A Deep
Variational Bayes Approach [54.429396802848224]
This paper proposes an interpretable generative model for motion prediction with robust generalizability to out-of-distribution cases.
For interpretability, the model achieves the target-driven motion prediction by estimating the spatial distribution of long-term destinations.
Experiments on motion prediction datasets validate that the fitted model can be interpretable and generalizable.
arXiv Detail & Related papers (2024-03-10T04:16:04Z) - A Bayesian approach to quantifying uncertainties and improving
generalizability in traffic prediction models [0.0]
We propose a Bayesian recurrent neural network framework for uncertainty in traffic prediction with higher generalizability.
We show that normalization alters the training process of deep neural networks by controlling the model's complexity.
Our findings are especially relevant to traffic management applications, where predicting traffic conditions across multiple locations is the goal.
arXiv Detail & Related papers (2023-07-12T06:23:31Z) - Learning Sample Difficulty from Pre-trained Models for Reliable
Prediction [55.77136037458667]
We propose to utilize large-scale pre-trained models to guide downstream model training with sample difficulty-aware entropy regularization.
We simultaneously improve accuracy and uncertainty calibration across challenging benchmarks.
arXiv Detail & Related papers (2023-04-20T07:29:23Z) - Deep Sequence Modeling for Anomalous ISP Traffic Prediction [3.689539481706835]
We investigated and evaluated the performance of different deep sequence models for anomalous traffic prediction.
LSTM_Encoder_Decoder (LSTM_En_De) is the best prediction model in our experiment, reducing the deviation between actual and predicted traffic by more than 11% after adjusting the outliers.
arXiv Detail & Related papers (2022-05-03T17:01:45Z) - An Empirical Study on Internet Traffic Prediction Using Statistical
Rolling Model [3.689539481706835]
The seasonality of our traffic has been explicitly modeled using SARIMA, which reduces the rolling prediction Mean Average Percentage Error (MAPE) by more than 4%.
We further improved traffic prediction using SARIMAX to learn different factors extracted from the original traffic, which yielded the best rolling prediction results with a MAPE of 6.83%.
arXiv Detail & Related papers (2022-05-03T16:15:00Z) - Transforming Model Prediction for Tracking [109.08417327309937]
Transformers capture global relations with little inductive bias, allowing it to learn the prediction of more powerful target models.
We train the proposed tracker end-to-end and validate its performance by conducting comprehensive experiments on multiple tracking datasets.
Our tracker sets a new state of the art on three benchmarks, achieving an AUC of 68.5% on the challenging LaSOT dataset.
arXiv Detail & Related papers (2022-03-21T17:59:40Z) - Back2Future: Leveraging Backfill Dynamics for Improving Real-time
Predictions in Future [73.03458424369657]
In real-time forecasting in public health, data collection is a non-trivial and demanding task.
'Backfill' phenomenon and its effect on model performance has been barely studied in the prior literature.
We formulate a novel problem and neural framework Back2Future that aims to refine a given model's predictions in real-time.
arXiv Detail & Related papers (2021-06-08T14:48:20Z) - Regression Bugs Are In Your Model! Measuring, Reducing and Analyzing
Regressions In NLP Model Updates [68.09049111171862]
This work focuses on quantifying, reducing and analyzing regression errors in the NLP model updates.
We formulate the regression-free model updates into a constrained optimization problem.
We empirically analyze how model ensemble reduces regression.
arXiv Detail & Related papers (2021-05-07T03:33:00Z) - A Locally Adaptive Interpretable Regression [7.4267694612331905]
Linear regression is one of the most interpretable prediction models.
In this work, we introduce a locally adaptive interpretable regression (LoAIR)
Our model achieves comparable or better predictive performance than the other state-of-the-art baselines.
arXiv Detail & Related papers (2020-05-07T09:26:14Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.