Unravelling the Probabilistic Forest: Arbitrage in Prediction Markets
- URL: http://arxiv.org/abs/2508.03474v1
- Date: Tue, 05 Aug 2025 14:06:50 GMT
- Title: Unravelling the Probabilistic Forest: Arbitrage in Prediction Markets
- Authors: Oriol Saguillo, Vahid Ghafouri, Lucianna Kiffer, Guillermo Suarez-Tangil,
- Abstract summary: Polymarket is a prediction market platform where users can speculate on future events by shares tied to specific outcomes, known as conditions.<n>Despite this design, Polymarket exhibits cases where dependent assets are mispriced.<n>This phenomenon, known as arbitrage, could enable sophisticated participants to exploit such inconsistencies.
- Score: 2.9311414545087366
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Polymarket is a prediction market platform where users can speculate on future events by trading shares tied to specific outcomes, known as conditions. Each market is associated with a set of one or more such conditions. To ensure proper market resolution, the condition set must be exhaustive -- collectively accounting for all possible outcomes -- and mutually exclusive -- only one condition may resolve as true. Thus, the collective prices of all related outcomes should be \$1, representing a combined probability of 1 of any outcome. Despite this design, Polymarket exhibits cases where dependent assets are mispriced, allowing for purchasing (or selling) a certain outcome for less than (or more than) \$1, guaranteeing profit. This phenomenon, known as arbitrage, could enable sophisticated participants to exploit such inconsistencies. In this paper, we conduct an empirical arbitrage analysis on Polymarket data to answer three key questions: (Q1) What conditions give rise to arbitrage (Q2) Does arbitrage actually occur on Polymarket and (Q3) Has anyone exploited these opportunities. A major challenge in analyzing arbitrage between related markets lies in the scalability of comparisons across a large number of markets and conditions, with a naive analysis requiring $O(2^{n+m})$ comparisons. To overcome this, we employ a heuristic-driven reduction strategy based on timeliness, topical similarity, and combinatorial relationships, further validated by expert input. Our study reveals two distinct forms of arbitrage on Polymarket: Market Rebalancing Arbitrage, which occurs within a single market or condition, and Combinatorial Arbitrage, which spans across multiple markets. We use on-chain historical order book data to analyze when these types of arbitrage opportunities have existed, and when they have been executed by users. We find a realized estimate of 40 million USD of profit extracted.
Related papers
- Quantifying Arbitrage in Automated Market Makers: An Empirical Study of Ethereum ZK Rollups [6.892626226074608]
This work systematically reviews arbitrage opportunities between Automated Market Makers (AMMs) on ZK rollups, and Centralised Exchanges (CEXs)
We propose a theoretical framework to measure such arbitrage opportunities and derive a formula for the related Maximal Arbitrage Value (MAV)
Overall, the cumulative MAV from July to 2023 on the USDC-ETH SyncSwap pool amounts to $104.96k (0.24% of trading volume)
arXiv Detail & Related papers (2024-03-24T10:26:34Z) - Diffusion Variational Autoencoder for Tackling Stochasticity in
Multi-Step Regression Stock Price Prediction [54.21695754082441]
Multi-step stock price prediction over a long-term horizon is crucial for forecasting its volatility.
Current solutions to multi-step stock price prediction are mostly designed for single-step, classification-based predictions.
We combine a deep hierarchical variational-autoencoder (VAE) and diffusion probabilistic techniques to do seq2seq stock prediction.
Our model is shown to outperform state-of-the-art solutions in terms of its prediction accuracy and variance.
arXiv Detail & Related papers (2023-08-18T16:21:15Z) - Joint Latent Topic Discovery and Expectation Modeling for Financial
Markets [45.758436505779386]
We present a groundbreaking framework for financial market analysis.
This approach is the first to jointly model investor expectations and automatically mine latent stock relationships.
Our model consistently achieves an annual return exceeding 10%.
arXiv Detail & Related papers (2023-06-01T01:36:51Z) - Parity in Markets -- Methods, Costs, and Consequences [109.5267969644294]
We show how market designers can use taxes or subsidies in Fisher markets to ensure that market equilibrium outcomes fall within certain constraints.
We adapt various types of fairness constraints proposed in existing literature to the market case and show who benefits and who loses from these constraints.
arXiv Detail & Related papers (2022-10-05T22:27:44Z) - Quantum computational finance: martingale asset pricing for incomplete
markets [69.73491758935712]
We show that a variety of quantum techniques can be applied to the pricing problem in finance.
We discuss three different methods that are distinct from previous works.
arXiv Detail & Related papers (2022-09-19T09:22:01Z) - Competition, Alignment, and Equilibria in Digital Marketplaces [97.03797129675951]
We study a duopoly market where platform actions are bandit algorithms and the two platforms compete for user participation.
Our main finding is that competition in this market does not perfectly align market outcomes with user utility.
arXiv Detail & Related papers (2022-08-30T17:43:58Z) - Price Interpretability of Prediction Markets: A Convergence Analysis [6.623653707600985]
We show that the limiting price converges to the geometric mean of agent beliefs in exponential utility-based markets.
We also show that the limiting price can be characterized by systems of equations that encapsulate agent beliefs, risk parameters, and wealth.
arXiv Detail & Related papers (2022-05-18T13:15:11Z) - Deep Q-Learning Market Makers in a Multi-Agent Simulated Stock Market [58.720142291102135]
This paper focuses precisely on the study of these markets makers strategies from an agent-based perspective.
We propose the application of Reinforcement Learning (RL) for the creation of intelligent market markers in simulated stock markets.
arXiv Detail & Related papers (2021-12-08T14:55:21Z) - Learning who is in the market from time series: market participant
discovery through adversarial calibration of multi-agent simulators [0.0]
In electronic trading markets only the price or volume time series are directly observable.
We propose a novel two-step method to train a discriminator that is able to distinguish between "real" and "fake" price and volume time series.
arXiv Detail & Related papers (2021-08-02T06:53:37Z) - Deep Learning Statistical Arbitrage [0.0]
We propose a unifying conceptual framework for statistical arbitrage and develop a novel deep learning solution.
We construct arbitrage portfolios of similar assets as residual portfolios from conditional latent asset pricing factors.
We extract the time series signals of these residual portfolios with one of the most powerful machine learning time-series solutions.
arXiv Detail & Related papers (2021-06-08T00:48:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.