Conservative Predictions on Noisy Financial Data
- URL: http://arxiv.org/abs/2310.11815v1
- Date: Wed, 18 Oct 2023 09:14:19 GMT
- Title: Conservative Predictions on Noisy Financial Data
- Authors: Omkar Nabar, Gautam Shroff
- Abstract summary: Price movements in financial markets are well known to be very noisy.
Traditional rule-learning techniques would seek only high precision rules and refrain from making predictions where their antecedents did not apply.
We apply a similar approach, where a model abstains from making a prediction on data points that it is uncertain on.
- Score: 6.300716661852326
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Price movements in financial markets are well known to be very noisy. As a
result, even if there are, on occasion, exploitable patterns that could be
picked up by machine-learning algorithms, these are obscured by feature and
label noise rendering the predictions less useful, and risky in practice.
Traditional rule-learning techniques developed for noisy data, such as CN2,
would seek only high precision rules and refrain from making predictions where
their antecedents did not apply. We apply a similar approach, where a model
abstains from making a prediction on data points that it is uncertain on.
During training, a cascade of such models are learned in sequence, similar to
rule lists, with each model being trained only on data on which the previous
model(s) were uncertain. Similar pruning of data takes place at test-time, with
(higher accuracy) predictions being made albeit only on a fraction (support) of
test-time data. In a financial prediction setting, such an approach allows
decisions to be taken only when the ensemble model is confident, thereby
reducing risk. We present results using traditional MLPs as well as
differentiable decision trees, on synthetic data as well as real financial
market data, to predict fixed-term returns using commonly used features. We
submit that our approach is likely to result in better overall returns at a
lower level of risk. In this context we introduce an utility metric to measure
the average gain per trade, as well as the return adjusted for downside risk,
both of which are improved significantly by our approach.
Related papers
- Enhancing Mean-Reverting Time Series Prediction with Gaussian Processes:
Functional and Augmented Data Structures in Financial Forecasting [0.0]
We explore the application of Gaussian Processes (GPs) for predicting mean-reverting time series with an underlying structure.
GPs offer the potential to forecast not just the average prediction but the entire probability distribution over a future trajectory.
This is particularly beneficial in financial contexts, where accurate predictions alone may not suffice if incorrect volatility assessments lead to capital losses.
arXiv Detail & Related papers (2024-02-23T06:09:45Z) - Best of Many in Both Worlds: Online Resource Allocation with Predictions under Unknown Arrival Model [16.466711636334587]
Online decision-makers often obtain predictions on future variables, such as arrivals, demands, and so on.
Prediction accuracy is unknown to decision-makers a priori, hence blindly following the predictions can be harmful.
We develop algorithms that utilize predictions in a manner that is robust to the unknown prediction accuracy.
arXiv Detail & Related papers (2024-02-21T04:57:32Z) - Distribution-free risk assessment of regression-based machine learning
algorithms [6.507711025292814]
We focus on regression algorithms and the risk-assessment task of computing the probability of the true label lying inside an interval defined around the model's prediction.
We solve the risk-assessment problem using the conformal prediction approach, which provides prediction intervals that are guaranteed to contain the true label with a given probability.
arXiv Detail & Related papers (2023-10-05T13:57:24Z) - Feature Selection with Annealing for Forecasting Financial Time Series [2.44755919161855]
This study provides a comprehensive method for forecasting financial time series based on tactical input output feature mapping techniques using machine learning (ML) models.
Experiments indicate that the FSA algorithm increased the performance of ML models, regardless of problem type.
arXiv Detail & Related papers (2023-03-03T21:33:38Z) - DeepVol: Volatility Forecasting from High-Frequency Data with Dilated Causal Convolutions [53.37679435230207]
We propose DeepVol, a model based on Dilated Causal Convolutions that uses high-frequency data to forecast day-ahead volatility.
Our empirical results suggest that the proposed deep learning-based approach effectively learns global features from high-frequency data.
arXiv Detail & Related papers (2022-09-23T16:13:47Z) - Leveraging Unlabeled Data to Predict Out-of-Distribution Performance [63.740181251997306]
Real-world machine learning deployments are characterized by mismatches between the source (training) and target (test) distributions.
In this work, we investigate methods for predicting the target domain accuracy using only labeled source data and unlabeled target data.
We propose Average Thresholded Confidence (ATC), a practical method that learns a threshold on the model's confidence, predicting accuracy as the fraction of unlabeled examples.
arXiv Detail & Related papers (2022-01-11T23:01:12Z) - Imputation-Free Learning from Incomplete Observations [73.15386629370111]
We introduce the importance of guided gradient descent (IGSGD) method to train inference from inputs containing missing values without imputation.
We employ reinforcement learning (RL) to adjust the gradients used to train the models via back-propagation.
Our imputation-free predictions outperform the traditional two-step imputation-based predictions using state-of-the-art imputation methods.
arXiv Detail & Related papers (2021-07-05T12:44:39Z) - Quantifying sources of uncertainty in drug discovery predictions with
probabilistic models [0.0]
Knowing the uncertainty in a prediction is critical when making expensive investment decisions and when patient safety is paramount.
Machine learning (ML) models in drug discovery typically provide only a single best estimate and ignore all sources of uncertainty.
Probabilistic predictive models (PPMs) can incorporate uncertainty in both the data and model, and return a distribution of predicted values.
arXiv Detail & Related papers (2021-05-18T18:54:54Z) - SLOE: A Faster Method for Statistical Inference in High-Dimensional
Logistic Regression [68.66245730450915]
We develop an improved method for debiasing predictions and estimating frequentist uncertainty for practical datasets.
Our main contribution is SLOE, an estimator of the signal strength with convergence guarantees that reduces the computation time of estimation and inference by orders of magnitude.
arXiv Detail & Related papers (2021-03-23T17:48:56Z) - Unlabelled Data Improves Bayesian Uncertainty Calibration under
Covariate Shift [100.52588638477862]
We develop an approximate Bayesian inference scheme based on posterior regularisation.
We demonstrate the utility of our method in the context of transferring prognostic models of prostate cancer across globally diverse populations.
arXiv Detail & Related papers (2020-06-26T13:50:19Z) - Super-App Behavioral Patterns in Credit Risk Models: Financial,
Statistical and Regulatory Implications [110.54266632357673]
We present the impact of alternative data that originates from an app-based marketplace, in contrast to traditional bureau data, upon credit scoring models.
Our results, validated across two countries, show that these new sources of data are particularly useful for predicting financial behavior in low-wealth and young individuals.
arXiv Detail & Related papers (2020-05-09T01:32:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.