Multivariate Forecasting of Bitcoin Volatility with Gradient Boosting: Deterministic, Probabilistic, and Feature Importance Perspectives
- URL: http://arxiv.org/abs/2511.20105v1
- Date: Tue, 25 Nov 2025 09:26:13 GMT
- Title: Multivariate Forecasting of Bitcoin Volatility with Gradient Boosting: Deterministic, Probabilistic, and Feature Importance Perspectives
- Authors: Grzegorz Dudek, Mateusz Kasprzyk, Paweł Pełka,
- Abstract summary: This study investigates the application of the Light Gradient Boosting Machine (LGBM) model for both deterministic and probabilistic forecasting of Bitcoin realized volatility.<n>We evaluate the performance of LGBM-based models and compare them with both econometric and machine learning baselines.
- Score: 1.8352113484137627
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: This study investigates the application of the Light Gradient Boosting Machine (LGBM) model for both deterministic and probabilistic forecasting of Bitcoin realized volatility. Utilizing a comprehensive set of 69 predictors -- encompassing market, behavioral, and macroeconomic indicators -- we evaluate the performance of LGBM-based models and compare them with both econometric and machine learning baselines. For probabilistic forecasting, we explore two quantile-based approaches: direct quantile regression using the pinball loss function, and a residual simulation method that transforms point forecasts into predictive distributions. To identify the main drivers of volatility, we employ gain-based and permutation feature importance techniques, consistently highlighting the significance of trading volume, lagged volatility measures, investor attention, and market capitalization. The results demonstrate that LGBM models effectively capture the nonlinear and high-variance characteristics of cryptocurrency markets while providing interpretable insights into the underlying volatility dynamics.
Related papers
- Exploring the Interpretability of Forecasting Models for Energy Balancing Market [43.548887305614585]
The balancing market in the energy sector plays a critical role in physically and financially balancing the supply and demand.<n>While complex machine learning models can achieve high accuracy, their black-box nature severely limits the model interpretability.<n>This paper explores the trade-off between model accuracy and interpretability for the energy balancing market.
arXiv Detail & Related papers (2026-01-19T12:56:41Z) - Partial multivariate transformer as a tool for cryptocurrencies time series prediction [0.0]
We show that a partial-multivariate strategy achieves significant statistical accuracy, effectively balancing informative signals with noise.<n>Lower prediction error did not consistently translate to higher financial returns in simulations.<n>This finding challenges the reliance on traditional error metrics and highlights the need to develop evaluation criteria more aligned with real-world financial objectives.
arXiv Detail & Related papers (2025-11-22T21:59:32Z) - Auditing Algorithmic Bias in Transformer-Based Trading [10.235738752130803]
We employ a transformer model for prediction, and introduce a metric based on Partial Information Decomposition (PID) to measure the influence of each asset on the model's decision making.<n>Our analysis reveals two key observations: first, the model disregards data volatility entirely, and second, it is biased toward data with lower-frequency price movements.
arXiv Detail & Related papers (2025-10-01T21:20:26Z) - Probabilistic Forecasting Cryptocurrencies Volatility: From Point to Quantile Forecasts [1.8352113484137627]
This paper introduces probabilistic forecasting methods that leverage point forecasts from a wide range of base models.<n>To the best of our knowledge, this is the first study in the literature to propose and systematically evaluate probabilistic forecasts of variance in cryptocurrency markets.<n>Our empirical results for Bitcoin demonstrate that the Quantile Estimation through Residual Simulation (QRS) method consistently outperforms more sophisticated alternatives.
arXiv Detail & Related papers (2025-08-21T18:42:11Z) - Autoencoder Enhanced Realised GARCH on Volatility Forecasting [2.1902930328664914]
This thesis aims to synthesise the impact of various realised volatility measures on volatility forecasting.
We propose an extension of the Realised GARCH model that incorporates an autoencoder-generated synthetic realised measure.
arXiv Detail & Related papers (2024-11-26T06:05:44Z) - Diffusion Variational Autoencoder for Tackling Stochasticity in
Multi-Step Regression Stock Price Prediction [54.21695754082441]
Multi-step stock price prediction over a long-term horizon is crucial for forecasting its volatility.
Current solutions to multi-step stock price prediction are mostly designed for single-step, classification-based predictions.
We combine a deep hierarchical variational-autoencoder (VAE) and diffusion probabilistic techniques to do seq2seq stock prediction.
Our model is shown to outperform state-of-the-art solutions in terms of its prediction accuracy and variance.
arXiv Detail & Related papers (2023-08-18T16:21:15Z) - Deep Learning Enhanced Realized GARCH [6.211385208178938]
We propose a new approach to volatility modeling by combining deep learning (LSTM) and realized volatility measures.
This LSTM-enhanced realized GARCH framework incorporates and distills modeling advances from financial econometrics, high frequency trading data and deep learning.
arXiv Detail & Related papers (2023-02-16T00:20:43Z) - Evaluating Probabilistic Classifiers: The Triptych [62.997667081978825]
We propose and study a triptych of diagnostic graphics that focus on distinct and complementary aspects of forecast performance.
The reliability diagram addresses calibration, the receiver operating characteristic (ROC) curve diagnoses discrimination ability, and the Murphy diagram visualizes overall predictive performance and value.
arXiv Detail & Related papers (2023-01-25T19:35:23Z) - DeepVol: Volatility Forecasting from High-Frequency Data with Dilated Causal Convolutions [53.37679435230207]
We propose DeepVol, a model based on Dilated Causal Convolutions that uses high-frequency data to forecast day-ahead volatility.
Our empirical results suggest that the proposed deep learning-based approach effectively learns global features from high-frequency data.
arXiv Detail & Related papers (2022-09-23T16:13:47Z) - Bayesian Bilinear Neural Network for Predicting the Mid-price Dynamics
in Limit-Order Book Markets [84.90242084523565]
Traditional time-series econometric methods often appear incapable of capturing the true complexity of the multi-level interactions driving the price dynamics.
By adopting a state-of-the-art second-order optimization algorithm, we train a Bayesian bilinear neural network with temporal attention.
By addressing the use of predictive distributions to analyze errors and uncertainties associated with the estimated parameters and model forecasts, we thoroughly compare our Bayesian model with traditional ML alternatives.
arXiv Detail & Related papers (2022-03-07T18:59:54Z) - A Sentiment Analysis Approach to the Prediction of Market Volatility [62.997667081978825]
We have explored the relationship between sentiment extracted from financial news and tweets and FTSE100 movements.
The sentiment captured from news headlines could be used as a signal to predict market returns; the same does not apply for volatility.
We developed an accurate classifier for the prediction of market volatility in response to the arrival of new information.
arXiv Detail & Related papers (2020-12-10T01:15:48Z) - A generative adversarial network approach to calibration of local
stochastic volatility models [2.1485350418225244]
We propose a fully data-driven approach to calibrate local volatility (LSV) models.
We parametrize the leverage function by a family of feed-forward neural networks and learn their parameters directly from the available market option prices.
This should be seen in the context of neural SDEs and (causal) generative adversarial networks.
arXiv Detail & Related papers (2020-05-05T21:26:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.