Improving CNN-base Stock Trading By Considering Data Heterogeneity and
Burst
- URL: http://arxiv.org/abs/2303.09407v1
- Date: Tue, 14 Mar 2023 01:05:17 GMT
- Title: Improving CNN-base Stock Trading By Considering Data Heterogeneity and
Burst
- Authors: Keer Yang, Guanqun Zhang, Chuan Bi, Qiang Guan, Hailu Xu, Shuai Xu
- Abstract summary: We propose to use CNN as the core functionality of such framework, because it is able to learn the spatial dependency (i.e., between rows and columns) of the input data.
We then develop novel normalization process to prepare the stock data.
Experiment results show that our approach can outperform other comparing approaches.
- Score: 1.6637373649145604
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In recent years, there have been quite a few attempts to apply intelligent
techniques to financial trading, i.e., constructing automatic and intelligent
trading framework based on historical stock price. Due to the unpredictable,
uncertainty and volatile nature of financial market, researchers have also
resorted to deep learning to construct the intelligent trading framework. In
this paper, we propose to use CNN as the core functionality of such framework,
because it is able to learn the spatial dependency (i.e., between rows and
columns) of the input data. However, different with existing deep
learning-based trading frameworks, we develop novel normalization process to
prepare the stock data. In particular, we first empirically observe that the
stock data is intrinsically heterogeneous and bursty, and then validate the
heterogeneity and burst nature of stock data from a statistical perspective.
Next, we design the data normalization method in a way such that the data
heterogeneity is preserved and bursty events are suppressed. We verify out
developed CNN-based trading framework plus our new normalization method on 29
stocks. Experiment results show that our approach can outperform other
comparing approaches.
Related papers
- Trading through Earnings Seasons using Self-Supervised Contrastive Representation Learning [1.6574413179773761]
Contrastive Earnings Transformer (CET) is a self-supervised learning approach rooted in Contrastive Predictive Coding (CPC)
Our research delves deep into the intricacies of stock data, evaluating how various models handle the rapidly changing relevance of earnings data over time and over different sectors.
CET's foundation on CPC allows for a nuanced understanding, facilitating consistent stock predictions even as the earnings data ages.
arXiv Detail & Related papers (2024-09-25T22:09:59Z) - Overcoming Overconfidence for Active Learning [1.2776312584227847]
We present two novel methods to address the problem of overconfidence that arises in the active learning scenario.
The first is an augmentation strategy named Cross-Mix-and-Mix (CMaM), which aims to calibrate the model by expanding the limited training distribution.
The second is a selection strategy named Ranked Margin Sampling (RankedMS), which prevents choosing data that leads to overly confident predictions.
arXiv Detail & Related papers (2023-08-21T09:04:54Z) - Efficient Integration of Multi-Order Dynamics and Internal Dynamics in
Stock Movement Prediction [20.879245331384794]
Recent deep neural network (DNN) methods capture multi-order dynamics using hypergraphs, but rely on the Fourier basis in the convolution.
We propose a framework for stock movement prediction to overcome the above issues.
Our framework outperforms state-of-the-art methods in terms of profit and stability.
arXiv Detail & Related papers (2022-11-11T01:58:18Z) - Rethinking Data Heterogeneity in Federated Learning: Introducing a New
Notion and Standard Benchmarks [65.34113135080105]
We show that not only the issue of data heterogeneity in current setups is not necessarily a problem but also in fact it can be beneficial for the FL participants.
Our observations are intuitive.
Our code is available at https://github.com/MMorafah/FL-SC-NIID.
arXiv Detail & Related papers (2022-09-30T17:15:19Z) - DeepVol: Volatility Forecasting from High-Frequency Data with Dilated Causal Convolutions [53.37679435230207]
We propose DeepVol, a model based on Dilated Causal Convolutions that uses high-frequency data to forecast day-ahead volatility.
Our empirical results suggest that the proposed deep learning-based approach effectively learns global features from high-frequency data.
arXiv Detail & Related papers (2022-09-23T16:13:47Z) - Augmented Bilinear Network for Incremental Multi-Stock Time-Series
Classification [83.23129279407271]
We propose a method to efficiently retain the knowledge available in a neural network pre-trained on a set of securities.
In our method, the prior knowledge encoded in a pre-trained neural network is maintained by keeping existing connections fixed.
This knowledge is adjusted for the new securities by a set of augmented connections, which are optimized using the new data.
arXiv Detail & Related papers (2022-07-23T18:54:10Z) - Bayesian Bilinear Neural Network for Predicting the Mid-price Dynamics
in Limit-Order Book Markets [84.90242084523565]
Traditional time-series econometric methods often appear incapable of capturing the true complexity of the multi-level interactions driving the price dynamics.
By adopting a state-of-the-art second-order optimization algorithm, we train a Bayesian bilinear neural network with temporal attention.
By addressing the use of predictive distributions to analyze errors and uncertainties associated with the estimated parameters and model forecasts, we thoroughly compare our Bayesian model with traditional ML alternatives.
arXiv Detail & Related papers (2022-03-07T18:59:54Z) - Leveraging Unlabeled Data to Predict Out-of-Distribution Performance [63.740181251997306]
Real-world machine learning deployments are characterized by mismatches between the source (training) and target (test) distributions.
In this work, we investigate methods for predicting the target domain accuracy using only labeled source data and unlabeled target data.
We propose Average Thresholded Confidence (ATC), a practical method that learns a threshold on the model's confidence, predicting accuracy as the fraction of unlabeled examples.
arXiv Detail & Related papers (2022-01-11T23:01:12Z) - Super-App Behavioral Patterns in Credit Risk Models: Financial,
Statistical and Regulatory Implications [110.54266632357673]
We present the impact of alternative data that originates from an app-based marketplace, in contrast to traditional bureau data, upon credit scoring models.
Our results, validated across two countries, show that these new sources of data are particularly useful for predicting financial behavior in low-wealth and young individuals.
arXiv Detail & Related papers (2020-05-09T01:32:03Z) - Sentiment and Knowledge Based Algorithmic Trading with Deep
Reinforcement Learning [0.0]
Algorithmic trading, due to its inherent nature, is a difficult problem to tackle; there are too many variables involved in the real world which make it almost impossible to have reliable algorithms for automated stock trading.
The lack of reliable labelled data that considers physical and physiological factors that dictate the ups and downs of the market, has hindered the supervised learning attempts for dependable predictions.
We formulate an approach using reinforcement learning which uses traditional time series stock price data and combines it with news headline sentiments, while leveraging knowledge graphs for exploiting news about implicit relationships.
arXiv Detail & Related papers (2020-01-26T05:27:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.