Interpretable ML-driven Strategy for Automated Trading Pattern
Extraction
- URL: http://arxiv.org/abs/2103.12419v1
- Date: Tue, 23 Mar 2021 09:55:46 GMT
- Title: Interpretable ML-driven Strategy for Automated Trading Pattern
Extraction
- Authors: Artur Sokolovsky, Luca Arnaboldi, Jaume Bacardit, Thomas Gross
- Abstract summary: We propose a volume-based data pre-processing method for financial time series analysis.
We use a statistical approach for assessing the performance of the method.
Our analysis shows that the proposed method allows successful classification of the financial time series patterns.
- Score: 2.7910505923792646
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Financial markets are a source of non-stationary multidimensional time series
which has been drawing attention for decades. Each financial instrument has its
specific changing over time properties, making their analysis a complex task.
Improvement of understanding and development of methods for financial time
series analysis is essential for successful operation on financial markets. In
this study we propose a volume-based data pre-processing method for making
financial time series more suitable for machine learning pipelines. We use a
statistical approach for assessing the performance of the method. Namely, we
formally state the hypotheses, set up associated classification tasks, compute
effect sizes with confidence intervals, and run statistical tests to validate
the hypotheses. We additionally assess the trading performance of the proposed
method on historical data and compare it to a previously published approach.
Our analysis shows that the proposed volume-based method allows successful
classification of the financial time series patterns, and also leads to better
classification performance than a price action-based method, excelling
specifically on more liquid financial instruments. Finally, we propose an
approach for obtaining feature interactions directly from tree-based models on
example of CatBoost estimator, as well as formally assess the relatedness of
the proposed approach and SHAP feature interactions with a positive outcome.
Related papers
- Context is Key: A Benchmark for Forecasting with Essential Textual Information [87.3175915185287]
"Context is Key" (CiK) is a time series forecasting benchmark that pairs numerical data with diverse types of carefully crafted textual context.
We evaluate a range of approaches, including statistical models, time series foundation models, and LLM-based forecasters.
Our experiments highlight the importance of incorporating contextual information, demonstrate surprising performance when using LLM-based forecasting models, and also reveal some of their critical shortcomings.
arXiv Detail & Related papers (2024-10-24T17:56:08Z) - AlphaFin: Benchmarking Financial Analysis with Retrieval-Augmented Stock-Chain Framework [48.3060010653088]
We release AlphaFin datasets, combining traditional research datasets, real-time financial data, and handwritten chain-of-thought (CoT) data.
We then use AlphaFin datasets to benchmark a state-of-the-art method, called Stock-Chain, for effectively tackling the financial analysis task.
arXiv Detail & Related papers (2024-03-19T09:45:33Z) - Reinforcement Learning for Financial Index Tracking [0.4297070083645049]
We propose the first discrete-time infinite-horizon dynamic formulation of the financial index tracking problem under both return-based tracking error and value-based tracking error.
The proposed method outperforms a benchmark method in terms of tracking accuracy and has the potential for earning extra profit through cash withdraw strategy.
arXiv Detail & Related papers (2023-08-05T08:34:52Z) - Bayesian Bilinear Neural Network for Predicting the Mid-price Dynamics
in Limit-Order Book Markets [84.90242084523565]
Traditional time-series econometric methods often appear incapable of capturing the true complexity of the multi-level interactions driving the price dynamics.
By adopting a state-of-the-art second-order optimization algorithm, we train a Bayesian bilinear neural network with temporal attention.
By addressing the use of predictive distributions to analyze errors and uncertainties associated with the estimated parameters and model forecasts, we thoroughly compare our Bayesian model with traditional ML alternatives.
arXiv Detail & Related papers (2022-03-07T18:59:54Z) - Leveraging Unlabeled Data to Predict Out-of-Distribution Performance [63.740181251997306]
Real-world machine learning deployments are characterized by mismatches between the source (training) and target (test) distributions.
In this work, we investigate methods for predicting the target domain accuracy using only labeled source data and unlabeled target data.
We propose Average Thresholded Confidence (ATC), a practical method that learns a threshold on the model's confidence, predicting accuracy as the fraction of unlabeled examples.
arXiv Detail & Related papers (2022-01-11T23:01:12Z) - A Hybrid Learning Approach to Detecting Regime Switches in Financial
Markets [0.0]
We present a novel framework for the detection of regime switches within the US financial markets.
Using a combination of cluster analysis and classification, we identify regimes in financial markets based on publicly available economic data.
We display the efficacy of the framework by constructing and assessing the performance of two trading strategies based on detected regimes.
arXiv Detail & Related papers (2021-08-05T01:15:19Z) - How to Identify Investor's types in real financial markets by means of
agent based simulation [0.0]
The paper proposes a computational adaptation of the principles underlying principal component analysis with agent based simulation.
The proposed methodology is to find a reduced set of investor s models which is able to approximate or explain a target financial time series.
arXiv Detail & Related papers (2020-12-31T16:22:30Z) - Deep Portfolio Optimization via Distributional Prediction of Residual
Factors [3.9189409002585562]
We propose a novel method of constructing a portfolio based on predicting the distribution of a financial quantity called residual factors.
We demonstrate the efficacy of our method on U.S. and Japanese stock market data.
arXiv Detail & Related papers (2020-12-14T04:09:52Z) - Evaluating data augmentation for financial time series classification [85.38479579398525]
We evaluate several augmentation methods applied to stocks datasets using two state-of-the-art deep learning models.
For a relatively small dataset augmentation methods achieve up to $400%$ improvement in risk adjusted return performance.
For a larger stock dataset augmentation methods achieve up to $40%$ improvement.
arXiv Detail & Related papers (2020-10-28T17:53:57Z) - Gaussian process imputation of multiple financial series [71.08576457371433]
Multiple time series such as financial indicators, stock prices and exchange rates are strongly coupled due to their dependence on the latent state of the market.
We focus on learning the relationships among financial time series by modelling them through a multi-output Gaussian process.
arXiv Detail & Related papers (2020-02-11T19:18:18Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.