Biases in Expected Goals Models Confound Finishing Ability
- URL: http://arxiv.org/abs/2401.09940v1
- Date: Thu, 18 Jan 2024 12:41:58 GMT
- Title: Biases in Expected Goals Models Confound Finishing Ability
- Authors: Jesse Davis and Pieter Robberechts
- Abstract summary: Expected Goals (xG) has emerged as a popular tool for evaluating finishing skill in soccer analytics.
This paper aims to address the limitations and nuances surrounding the evaluation of finishing skill using xG statistics.
- Score: 18.67526513350852
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Expected Goals (xG) has emerged as a popular tool for evaluating finishing
skill in soccer analytics. It involves comparing a player's cumulative xG with
their actual goal output, where consistent overperformance indicates strong
finishing ability. However, the assessment of finishing skill in soccer using
xG remains contentious due to players' difficulty in consistently outperforming
their cumulative xG. In this paper, we aim to address the limitations and
nuances surrounding the evaluation of finishing skill using xG statistics.
Specifically, we explore three hypotheses: (1) the deviation between actual and
expected goals is an inadequate metric due to the high variance of shot
outcomes and limited sample sizes, (2) the inclusion of all shots in cumulative
xG calculation may be inappropriate, and (3) xG models contain biases arising
from interdependencies in the data that affect skill measurement. We found that
sustained overperformance of cumulative xG requires both high shot volumes and
exceptional finishing, including all shot types can obscure the finishing
ability of proficient strikers, and that there is a persistent bias that makes
the actual and expected goals closer for excellent finishers than it really is.
Overall, our analysis indicates that we need more nuanced quantitative
approaches for investigating a player's finishing ability, which we achieved
using a technique from AI fairness to learn an xG model that is calibrated for
multiple subgroups of players. As a concrete use case, we show that (1) the
standard biased xG model underestimates Messi's GAX by 17% and (2) Messi's GAX
is 27% higher than the typical elite high-shot-volume attacker, indicating that
Messi is even a more exceptional finisher than people commonly believed.
Related papers
- Infinite Ends from Finite Samples: Open-Ended Goal Inference as Top-Down Bayesian Filtering of Bottom-Up Proposals [48.437581268398866]
We introduce a sequential Monte Carlo model of open-ended goal inference.
We validate this model in a goal inference task called Block Words.
Our experiments highlight the importance of uniting top-down and bottom-up models for explaining the speed, accuracy, and generality of human theory-of-mind.
arXiv Detail & Related papers (2024-07-23T18:04:40Z) - Expected Possession Value of Control and Duel Actions for Soccer Player's Skills Estimation [0.0]
This paper introduces multiple extensions to a widely used model, expected possession value (EPV)
We assign greater weights to events occurring immediately prior to the shot rather than those preceding them (decay effect)
Our model incorporates possession risk more accurately by considering the decay effect and effective playing time.
arXiv Detail & Related papers (2024-06-02T17:29:42Z) - Bayes-xG: Player and Position Correction on Expected Goals (xG) using
Bayesian Hierarchical Approach [55.2480439325792]
This study investigates the influence of player or positional factors in predicting a shot resulting in a goal, measured by the expected goals (xG) metric.
It uses publicly available data from StatsBomb to analyse 10,000 shots from the English Premier League.
The study extends its analysis to data from Spain's La Liga and Germany's Bundesliga, yielding comparable results.
arXiv Detail & Related papers (2023-11-22T21:54:02Z) - ApproxED: Approximate exploitability descent via learned best responses [61.17702187957206]
We study the problem of finding an approximate Nash equilibrium of games with continuous action sets.
We propose two new methods that minimize an approximation of exploitability with respect to the strategy profile.
arXiv Detail & Related papers (2023-01-20T23:55:30Z) - A Machine Learning Approach for Player and Position Adjusted Expected
Goals in Football (Soccer) [0.0]
Expected Goals (xG) allow further insight than just a scoreline.
This paper uses machine learning applications that are developed and applied to Football Event data.
The model successfully predicts xGs probability values for football players based on 15,575 shots.
arXiv Detail & Related papers (2023-01-19T22:17:38Z) - Explainable expected goal models for performance analysis in football
analytics [5.802346990263708]
This paper proposes an accurate expected goal model trained consisting of 315,430 shots from seven seasons between 2014-15 and 2020-21 of the top-five European football leagues.
To best of our knowledge, this is the first paper that demonstrates a practical application of an explainable artificial intelligence tool aggregated profiles.
arXiv Detail & Related papers (2022-06-14T23:56:03Z) - Collusion Detection in Team-Based Multiplayer Games [57.153233321515984]
We propose a system that detects colluding behaviors in team-based multiplayer games.
The proposed method analyzes the players' social relationships paired with their in-game behavioral patterns.
We then automate the detection using Isolation Forest, an unsupervised learning technique specialized in highlighting outliers.
arXiv Detail & Related papers (2022-03-10T02:37:39Z) - CommonsenseQA 2.0: Exposing the Limits of AI through Gamification [126.85096257968414]
We construct benchmarks that test the abilities of modern natural language understanding models.
In this work, we propose gamification as a framework for data construction.
arXiv Detail & Related papers (2022-01-14T06:49:15Z) - Evaluation of soccer team defense based on prediction models of ball
recovery and being attacked [0.8921166277011345]
We propose a method to evaluate team defense based on the prediction of ball recovery and being attacked.
Using data from 45 soccer matches, we examined the relationship between the proposed index and team performance.
arXiv Detail & Related papers (2021-03-17T13:15:41Z) - Unbiased Scene Graph Generation from Biased Training [99.88125954889937]
We present a novel SGG framework based on causal inference but not the conventional likelihood.
We propose to draw the counterfactual causality from the trained graph to infer the effect from the bad bias.
In particular, we use Total Direct Effect (TDE) as the proposed final predicate score for unbiased SGG.
arXiv Detail & Related papers (2020-02-27T07:29:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.