Discovering indicators of dark horse of soccer games by deep learning
from sequential trading data
- URL: http://arxiv.org/abs/2008.00682v2
- Date: Tue, 4 Aug 2020 01:59:51 GMT
- Title: Discovering indicators of dark horse of soccer games by deep learning
from sequential trading data
- Authors: Liyao Lu and Qiang Lyu
- Abstract summary: A deep learning model is designed and trained on a real sequential trading data from the real prediction market.
A new loss function is proposed which biases the selection toward matches with high investment return to train our model.
Full investigation of 4669 top soccer league matches showed that our model traded off prediction accuracy for high value return.
- Score: 0.6526824510982799
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: It is not surprise for machine learning models to provide decent prediction
accuracy of soccer games outcomes based on various objective metrics. However,
the performance is not that decent in terms of predicting difficult and
valuable matches. A deep learning model is designed and trained on a real
sequential trading data from the real prediction market, with the assumption
that such trading data contain critical latent information to determine the
game outcomes. A new loss function is proposed which biases the selection
toward matches with high investment return to train our model. Full
investigation of 4669 top soccer league matches showed that our model traded
off prediction accuracy for high value return due to a certain ability to
detect dark horses. A further try is conducted to depict some indicators
discovered by our model for describing key features of big dark horses and
regular hot horses.
Related papers
- Machine Learning for Soccer Match Result Prediction [0.9002260638342727]
This chapter discusses available datasets, the types of models and features, and ways of evaluating model performance.
The aim of this chapter is to give a broad overview of the current state and potential future developments in machine learning for soccer match results prediction.
arXiv Detail & Related papers (2024-03-12T14:00:50Z) - Zero-shot Retrieval: Augmenting Pre-trained Models with Search Engines [83.65380507372483]
Large pre-trained models can dramatically reduce the amount of task-specific data required to solve a problem, but they often fail to capture domain-specific nuances out of the box.
This paper shows how to leverage recent advances in NLP and multi-modal learning to augment a pre-trained model with search engine retrieval.
arXiv Detail & Related papers (2023-11-29T05:33:28Z) - Explainable artificial intelligence model for identifying Market Value
in Professional Soccer Players [2.2590064835234913]
Using data from about 12,000 players from Sofifa, the Boruta algorithm streamlined feature selection.
The Gradient Boosting Decision Tree (GBDT) model excelled in predictive accuracy, with an R-squared of 0.901 and a Root Mean Squared Error (RMSE) of 3,221,632.175.
arXiv Detail & Related papers (2023-11-08T11:01:32Z) - Evaluating Soccer Match Prediction Models: A Deep Learning Approach and
Feature Optimization for Gradient-Boosted Trees [0.8009842832476994]
The 2023 Soccer Prediction Challenge required the prediction of match results first in terms of the exact goals scored by each team, and second, in terms of the probabilities for a win, draw, and loss.
A CatBoost model was employed using pi-ratings as the features, which were initially identified as the optimal choice for calculating the win/draw/loss probabilities.
In this study, we aimed to assess the performance of a deep learning model and determine the optimal feature set for a gradient-boosted tree model.
arXiv Detail & Related papers (2023-09-26T10:05:46Z) - Betting the system: Using lineups to predict football scores [0.0]
This paper aims to reduce randomness in football by analysing the role of lineups in final scores.
Football clubs invest millions of dollars on lineups and knowing how individual statistics translate to better outcomes can optimise investments.
Sports betting is growing exponentially and being able to predict the future is profitable and desirable.
arXiv Detail & Related papers (2022-10-12T15:47:42Z) - VisFIS: Visual Feature Importance Supervision with
Right-for-the-Right-Reason Objectives [84.48039784446166]
We show that model FI supervision can meaningfully improve VQA model accuracy as well as performance on several Right-for-the-Right-Reason metrics.
Our best performing method, Visual Feature Importance Supervision (VisFIS), outperforms strong baselines on benchmark VQA datasets.
Predictions are more accurate when explanations are plausible and faithful, and not when they are plausible but not faithful.
arXiv Detail & Related papers (2022-06-22T17:02:01Z) - Learning to Predict Trustworthiness with Steep Slope Loss [69.40817968905495]
We study the problem of predicting trustworthiness on real-world large-scale datasets.
We observe that the trustworthiness predictors trained with prior-art loss functions are prone to view both correct predictions and incorrect predictions to be trustworthy.
We propose a novel steep slope loss to separate the features w.r.t. correct predictions from the ones w.r.t. incorrect predictions by two slide-like curves that oppose each other.
arXiv Detail & Related papers (2021-09-30T19:19:09Z) - Machine learning models for DOTA 2 outcomes prediction [8.388178167818635]
This research paper predominantly focuses on building predictive machine and deep learning models to identify the outcome of the Dota 2 MOBA game.
Three models were investigated and compared: Linear Regression (LR), Neural Networks (NN), and a type of recurrent neural network Long Short-Term Memory (LSTM)
arXiv Detail & Related papers (2021-06-03T12:10:26Z) - Hidden Biases in Unreliable News Detection Datasets [60.71991809782698]
We show that selection bias during data collection leads to undesired artifacts in the datasets.
We observed a significant drop (>10%) in accuracy for all models tested in a clean split with no train/test source overlap.
We suggest future dataset creation include a simple model as a difficulty/bias probe and future model development use a clean non-overlapping site and date split.
arXiv Detail & Related papers (2021-04-20T17:16:41Z) - Injecting Knowledge in Data-driven Vehicle Trajectory Predictors [82.91398970736391]
Vehicle trajectory prediction tasks have been commonly tackled from two perspectives: knowledge-driven or data-driven.
In this paper, we propose to learn a "Realistic Residual Block" (RRB) which effectively connects these two perspectives.
Our proposed method outputs realistic predictions by confining the residual range and taking into account its uncertainty.
arXiv Detail & Related papers (2021-03-08T16:03:09Z) - Value-driven Hindsight Modelling [68.658900923595]
Value estimation is a critical component of the reinforcement learning (RL) paradigm.
Model learning can make use of the rich transition structure present in sequences of observations, but this approach is usually not sensitive to the reward function.
We develop an approach for representation learning in RL that sits in between these two extremes.
This provides tractable prediction targets that are directly relevant for a task, and can thus accelerate learning the value function.
arXiv Detail & Related papers (2020-02-19T18:10:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.