Can we imitate stock price behavior to reinforcement learn option price?
- URL: http://arxiv.org/abs/2105.11376v1
- Date: Mon, 24 May 2021 16:08:58 GMT
- Title: Can we imitate stock price behavior to reinforcement learn option price?
- Authors: Xin Jin
- Abstract summary: This paper presents a framework of imitating the price behavior of the underlying stock for reinforcement learning option price.
We use accessible features of the equities pricing data to construct a non-deterministic Markov decision process.
Our algorithm then maps imitative principal investor's decisions to simulated stock price paths by a Bayesian deep neural network.
- Score: 7.771514118651335
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This paper presents a framework of imitating the price behavior of the
underlying stock for reinforcement learning option price. We use accessible
features of the equities pricing data to construct a non-deterministic Markov
decision process for modeling stock price behavior driven by principal
investor's decision making. However, low signal-to-noise ratio and instability
that appear immanent in equity markets pose challenges to determine the state
transition (price change) after executing an action (principal investor's
decision) as well as decide an action based on current state (spot price). In
order to conquer these challenges, we resort to a Bayesian deep neural network
for computing the predictive distribution of the state transition led by an
action. Additionally, instead of exploring a state-action relationship to
formulate a policy, we seek for an episode based visible-hidden state-action
relationship to probabilistically imitate principal investor's successive
decision making. Our algorithm then maps imitative principal investor's
decisions to simulated stock price paths by a Bayesian deep neural network.
Eventually the optimal option price is reinforcement learned through maximizing
the cumulative risk-adjusted return of a dynamically hedged portfolio over
simulated price paths of the underlying.
Related papers
- Deep State-Space Model for Predicting Cryptocurrency Price [16.871928140625332]
We tackle the challenging problem of predicting day-ahead crypto-currency prices.
Our approach keeps the probabilistic formulation of the state-space model.
We call the proposed approach the deep state-space model.
arXiv Detail & Related papers (2023-11-21T08:49:55Z) - Cryptocurrency Portfolio Optimization by Neural Networks [81.20955733184398]
This paper proposes an effective algorithm based on neural networks to take advantage of these investment products.
A deep neural network, which outputs the allocation weight of each asset at a time interval, is trained to maximize the Sharpe ratio.
A novel loss term is proposed to regulate the network's bias towards a specific asset, thus enforcing the network to learn an allocation strategy that is close to a minimum variance strategy.
arXiv Detail & Related papers (2023-10-02T12:33:28Z) - Diffusion Variational Autoencoder for Tackling Stochasticity in
Multi-Step Regression Stock Price Prediction [54.21695754082441]
Multi-step stock price prediction over a long-term horizon is crucial for forecasting its volatility.
Current solutions to multi-step stock price prediction are mostly designed for single-step, classification-based predictions.
We combine a deep hierarchical variational-autoencoder (VAE) and diffusion probabilistic techniques to do seq2seq stock prediction.
Our model is shown to outperform state-of-the-art solutions in terms of its prediction accuracy and variance.
arXiv Detail & Related papers (2023-08-18T16:21:15Z) - Contextual Dynamic Pricing with Strategic Buyers [93.97401997137564]
We study the contextual dynamic pricing problem with strategic buyers.
Seller does not observe the buyer's true feature, but a manipulated feature according to buyers' strategic behavior.
We propose a strategic dynamic pricing policy that incorporates the buyers' strategic behavior into the online learning to maximize the seller's cumulative revenue.
arXiv Detail & Related papers (2023-07-08T23:06:42Z) - HireVAE: An Online and Adaptive Factor Model Based on Hierarchical and
Regime-Switch VAE [113.47287249524008]
It is still an open question to build a factor model that can conduct stock prediction in an online and adaptive setting.
We propose the first deep learning based online and adaptive factor model, HireVAE, at the core of which is a hierarchical latent space that embeds the relationship between the market situation and stock-wise latent factors.
Across four commonly used real stock market benchmarks, the proposed HireVAE demonstrate superior performance in terms of active returns over previous methods.
arXiv Detail & Related papers (2023-06-05T12:58:13Z) - Autoregressive Bandits [58.46584210388307]
We propose a novel online learning setting, Autoregressive Bandits, in which the observed reward is governed by an autoregressive process of order $k$.
We show that, under mild assumptions on the reward process, the optimal policy can be conveniently computed.
We then devise a new optimistic regret minimization algorithm, namely, AutoRegressive Upper Confidence Bound (AR-UCB), that suffers sublinear regret of order $widetildemathcalO left( frac(k+1)3/2sqrtnT (1-G
arXiv Detail & Related papers (2022-12-12T21:37:36Z) - Deep Hedging of Derivatives Using Reinforcement Learning [0.3313576045747072]
We show how reinforcement learning can be used to derive optimal hedging strategies for derivatives when there are transaction costs.
We find that a hybrid approach involving the use of an accounting P&L approach that incorporates a relatively simple valuation model works well.
arXiv Detail & Related papers (2021-03-29T07:43:30Z) - Deep Hedging: Learning Risk-Neutral Implied Volatility Dynamics [0.0]
numerically efficient approach for learning a risk-neutral measure for paths of simulated spot and option prices.
We show that market dynamics are free from "statistical arbitrage" in the absence of transaction costs if and only if they follow a risk-neutral measure.
arXiv Detail & Related papers (2021-03-22T15:38:25Z) - Deep Reinforcement Learning for Stock Portfolio Optimization [0.0]
We will formulate the problem such that we can apply Reinforcement Learning for the task properly.
To maintain a realistic assumption about the market, we will incorporate transaction cost and risk factor into the state as well.
We will present the end-to-end solution for the task with Minimum Variance Portfolio for stock subset selection, and Wavelet Transform for extracting multi-frequency data pattern.
arXiv Detail & Related papers (2020-12-09T10:19:12Z) - Upper Confidence Primal-Dual Reinforcement Learning for CMDP with
Adversarial Loss [145.54544979467872]
We consider online learning for episodically constrained Markov decision processes (CMDPs)
We propose a new emphupper confidence primal-dual algorithm, which only requires the trajectories sampled from the transition model.
Our analysis incorporates a new high-probability drift analysis of Lagrange multiplier processes into the celebrated regret analysis of upper confidence reinforcement learning.
arXiv Detail & Related papers (2020-03-02T05:02:23Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.