Bi-LSTM Price Prediction based on Attention Mechanism
- URL: http://arxiv.org/abs/2212.03443v2
- Date: Sun, 18 Jun 2023 15:04:13 GMT
- Title: Bi-LSTM Price Prediction based on Attention Mechanism
- Authors: Jiashu Lou, Leyi Cui, Ye Li
- Abstract summary: We propose a bidirectional LSTM neural network based on an attention mechanism, which is based on two popular assets, gold and bitcoin.
Using the forecast results, we achieved a return of 1089.34% in two years.
We also compare the attention Bi-LSTM model proposed in this paper with the traditional model, and the results show that our model has the best performance in this data set.
- Score: 2.455751370157653
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: With the increasing enrichment and development of the financial derivatives
market, the frequency of transactions is also faster and faster. Due to human
limitations, algorithms and automatic trading have recently become the focus of
discussion. In this paper, we propose a bidirectional LSTM neural network based
on an attention mechanism, which is based on two popular assets, gold and
bitcoin. In terms of Feature Engineering, on the one hand, we add traditional
technical factors, and at the same time, we combine time series models to
develop factors. In the selection of model parameters, we finally chose a
two-layer deep learning network. According to AUC measurement, the accuracy of
bitcoin and gold is 71.94% and 73.03% respectively. Using the forecast results,
we achieved a return of 1089.34% in two years. At the same time, we also
compare the attention Bi-LSTM model proposed in this paper with the traditional
model, and the results show that our model has the best performance in this
data set. Finally, we discuss the significance of the model and the
experimental results, as well as the possible improvement direction in the
future.
Related papers
- Margin Matching Preference Optimization: Enhanced Model Alignment with Granular Feedback [64.67540769692074]
Large language models (LLMs) fine-tuned with alignment techniques, such as reinforcement learning from human feedback, have been instrumental in developing some of the most capable AI systems to date.
We introduce an approach called Margin Matching Preference Optimization (MMPO), which incorporates relative quality margins into optimization, leading to improved LLM policies and reward models.
Experiments with both human and AI feedback data demonstrate that MMPO consistently outperforms baseline methods, often by a substantial margin, on popular benchmarks including MT-bench and RewardBench.
arXiv Detail & Related papers (2024-10-04T04:56:11Z) - An Evaluation of Deep Learning Models for Stock Market Trend Prediction [0.3277163122167433]
This study investigates the efficacy of advanced deep learning models for short-term trend forecasting using daily and hourly closing prices from the S&P 500 index and the Brazilian ETF EWZ.
We introduce the Extended Long Short-Term Memory for Time Series (xLSTM-TS) model, an xLSTM adaptation optimised for time series prediction.
Among the models tested, xLSTM-TS consistently outperformed others. For example, it achieved a test accuracy of 72.82% and an F1 score of 73.16% on the EWZ daily dataset.
arXiv Detail & Related papers (2024-08-22T13:58:55Z) - AI-Powered Energy Algorithmic Trading: Integrating Hidden Markov Models with Neural Networks [0.0]
This study introduces a new approach that combines Hidden Markov Models (HMM) and neural networks, integrated with Black-Litterman portfolio optimization.
During the COVID period ( 2019-2022), this dual-model approach achieved a 83% return with a Sharpe ratio of 0.77.
arXiv Detail & Related papers (2024-07-29T10:26:52Z) - Gated recurrent neural network with TPE Bayesian optimization for enhancing stock index prediction accuracy [0.0]
The aim is to improve the prediction accuracy of the next day's closing price of the NIFTY 50 index, a prominent Indian stock market index.
A combination of eight influential factors is carefully chosen from fundamental stock data, technical indicators, crude oil price, and macroeconomic data to train the models.
arXiv Detail & Related papers (2024-06-02T06:39:01Z) - Comparative Study of Bitcoin Price Prediction [0.0]
We employ five-fold cross-validation to enhance generalization and utilize L2 regularization to reduce overfitting and noise.
Our study demonstrates that the GRUs models offer better accuracy than LSTMs model for predicting Bitcoin's price.
arXiv Detail & Related papers (2024-05-13T18:10:34Z) - Bayesian Bilinear Neural Network for Predicting the Mid-price Dynamics
in Limit-Order Book Markets [84.90242084523565]
Traditional time-series econometric methods often appear incapable of capturing the true complexity of the multi-level interactions driving the price dynamics.
By adopting a state-of-the-art second-order optimization algorithm, we train a Bayesian bilinear neural network with temporal attention.
By addressing the use of predictive distributions to analyze errors and uncertainties associated with the estimated parameters and model forecasts, we thoroughly compare our Bayesian model with traditional ML alternatives.
arXiv Detail & Related papers (2022-03-07T18:59:54Z) - Sparse MoEs meet Efficient Ensembles [49.313497379189315]
We study the interplay of two popular classes of such models: ensembles of neural networks and sparse mixture of experts (sparse MoEs)
We present Efficient Ensemble of Experts (E$3$), a scalable and simple ensemble of sparse MoEs that takes the best of both classes of models, while using up to 45% fewer FLOPs than a deep ensemble.
arXiv Detail & Related papers (2021-10-07T11:58:35Z) - Markdowns in E-Commerce Fresh Retail: A Counterfactual Prediction and
Multi-Period Optimization Approach [29.11201102550876]
We build a semi-parametric structural model to learn individual price elasticity and predict counterfactual demand.
We propose a multi-period dynamic pricing algorithm to maximize the overall profit of a perishable product over its finite selling horizon.
The proposed framework has been successfully deployed to the well-known e-commerce fresh retail scenario - Freshippo.
arXiv Detail & Related papers (2021-05-18T07:01:37Z) - ANNETTE: Accurate Neural Network Execution Time Estimation with Stacked
Models [56.21470608621633]
We propose a time estimation framework to decouple the architectural search from the target hardware.
The proposed methodology extracts a set of models from micro- kernel and multi-layer benchmarks and generates a stacked model for mapping and network execution time estimation.
We compare estimation accuracy and fidelity of the generated mixed models, statistical models with the roofline model, and a refined roofline model for evaluation.
arXiv Detail & Related papers (2021-05-07T11:39:05Z) - Deep Stock Predictions [58.720142291102135]
We consider the design of a trading strategy that performs portfolio optimization using Long Short Term Memory (LSTM) neural networks.
We then customize the loss function used to train the LSTM to increase the profit earned.
We find the LSTM model with the customized loss function to have an improved performance in the training bot over a regressive baseline such as ARIMA.
arXiv Detail & Related papers (2020-06-08T23:37:47Z) - Deep Probabilistic Modelling of Price Movements for High-Frequency
Trading [0.0]
We propose a deep recurrent architecture for the probabilistic modelling of high-frequency market prices.
The resulting deep mixture models simultaneously address several practical challenges important in the development of automated high-frequency trading strategies.
We show that our model outperforms the benchmark models in both a metric-based test and in a simulated trading scenario.
arXiv Detail & Related papers (2020-03-31T19:25:40Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.