FOOTPASS: A Multi-Modal Multi-Agent Tactical Context Dataset for Play-by-Play Action Spotting in Soccer Broadcast Videos
- URL: http://arxiv.org/abs/2511.16183v1
- Date: Thu, 20 Nov 2025 09:42:28 GMT
- Title: FOOTPASS: A Multi-Modal Multi-Agent Tactical Context Dataset for Play-by-Play Action Spotting in Soccer Broadcast Videos
- Authors: Jeremie Ochin, Raphael Chekroun, Bogdan Stanciulescu, Sotiris Manitsaris,
- Abstract summary: We introduce Footovision Play-by-Play Spot Actionting in Soccer dataset (FOOTPASS)<n>It is the first benchmark for play-by-play action spotting over entire soccer matches in a multi-agent tactical context.<n>It enables the development of methods for player-centric action spotting that exploit both outputs from computer-vision tasks and prior knowledge of soccer.
- Score: 1.264619835497501
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Soccer video understanding has motivated the creation of datasets for tasks such as temporal action localization, spatiotemporal action detection (STAD), or multiobject tracking (MOT). The annotation of structured sequences of events (who does what, when, and where) used for soccer analytics requires a holistic approach that integrates both STAD and MOT. However, current action recognition methods remain insufficient for constructing reliable play-by-play data and are typically used to assist rather than fully automate annotation. Parallel research has advanced tactical modeling, trajectory forecasting, and performance analysis, all grounded in game-state and play-by-play data. This motivates leveraging tactical knowledge as a prior to support computer-vision-based predictions, enabling more automated and reliable extraction of play-by-play data. We introduce Footovision Play-by-Play Action Spotting in Soccer Dataset (FOOTPASS), the first benchmark for play-by-play action spotting over entire soccer matches in a multi-modal, multi-agent tactical context. It enables the development of methods for player-centric action spotting that exploit both outputs from computer-vision tasks (e.g., tracking, identification) and prior knowledge of soccer, including its tactical regularities over long time horizons, to generate reliable play-by-play data streams. These streams form an essential input for data-driven sports analytics.
Related papers
- Hand Held Multi-Object Tracking Dataset in American Football [9.92798361398834]
Multi-Object Tracking (MOT) plays a critical role in analyzing player behavior from videos, enabling performance evaluation.<n>Current MOT methods are often evaluated using publicly available datasets.<n>No standardized dataset has been publicly available, making comparisons between methods difficult.<n>Our results demonstrate that accurate detection and tracking can be achieved even in crowded scenarios.
arXiv Detail & Related papers (2025-11-12T16:15:18Z) - Velocity Completion Task and Method for Event-based Player Positional Data in Soccer [0.9002260638342727]
Event-based positional data lacks continuous temporal information needed to calculate crucial properties such as velocity.<n>We propose a new method to simultaneously complete the velocity of all agents using only the event-based positional data from team sports.
arXiv Detail & Related papers (2025-05-22T04:01:49Z) - Beyond Pixels: Leveraging the Language of Soccer to Improve Spatio-Temporal Action Detection in Broadcast Videos [1.4249472316161877]
State-of-the-art,temporal action detection methods show promising results for extracting events from broadcast videos.<n>Many false positives could be resolved by considering a broader sequence of actions and game-state information.<n>We address this by reasoning at the game level and improving STAD through the addition of a denoising sequence task.
arXiv Detail & Related papers (2025-05-14T15:05:36Z) - Action Anticipation from SoccerNet Football Video Broadcasts [84.87912817065506]
We introduce the task of action anticipation for football broadcast videos.<n>We predict future actions in unobserved future frames within a five- or ten-second anticipation window.<n>Our work will enable applications in automated broadcasting, tactical analysis, and player decision-making.
arXiv Detail & Related papers (2025-04-16T12:24:33Z) - Deep learning for action spotting in association football videos [64.10841325879996]
The SoccerNet initiative organizes yearly challenges, during which participants from all around the world compete to achieve state-of-the-art performances.
This paper traces the history of action spotting in sports, from the creation of the task back in 2018, to the role it plays today in research and the sports industry.
arXiv Detail & Related papers (2024-10-02T07:56:15Z) - Towards Active Learning for Action Spotting in Association Football
Videos [59.84375958757395]
Analyzing football videos is challenging and requires identifying subtle and diverse-temporal patterns.
Current algorithms face significant challenges when learning from limited annotated data.
We propose an active learning framework that selects the most informative video samples to be annotated next.
arXiv Detail & Related papers (2023-04-09T11:50:41Z) - A Graph-Based Method for Soccer Action Spotting Using Unsupervised
Player Classification [75.93186954061943]
Action spotting involves understanding the dynamics of the game, the complexity of events, and the variation of video sequences.
In this work, we focus on the former by (a) identifying and representing the players, referees, and goalkeepers as nodes in a graph, and by (b) modeling their temporal interactions as sequences of graphs.
For the player identification task, our method obtains an overall performance of 57.83% average-mAP by combining it with other modalities.
arXiv Detail & Related papers (2022-11-22T15:23:53Z) - SoccerNet-Tracking: Multiple Object Tracking Dataset and Benchmark in
Soccer Videos [62.686484228479095]
We propose a novel dataset for multiple object tracking composed of 200 sequences of 30s each.
The dataset is fully annotated with bounding boxes and tracklet IDs.
Our analysis shows that multiple player, referee and ball tracking in soccer videos is far from being solved.
arXiv Detail & Related papers (2022-04-14T12:22:12Z) - Temporally-Aware Feature Pooling for Action Spotting in Soccer
Broadcasts [86.56462654572813]
We focus our analysis on action spotting in soccer broadcast, which consists in temporally localizing the main actions in a soccer game.
We propose a novel feature pooling method based on NetVLAD, dubbed NetVLAD++, that embeds temporally-aware knowledge.
We train and evaluate our methodology on the recent large-scale dataset SoccerNet-v2, reaching 53.4% Average-mAP for action spotting.
arXiv Detail & Related papers (2021-04-14T11:09:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.