A Computer Vision Framework for Multi-Class Detection and Tracking in Soccer Broadcast Footage
- URL: http://arxiv.org/abs/2602.18504v1
- Date: Tue, 17 Feb 2026 21:44:09 GMT
- Title: A Computer Vision Framework for Multi-Class Detection and Tracking in Soccer Broadcast Footage
- Authors: Daniel Tshiani,
- Abstract summary: This paper examines whether such data can instead be extracted directly from standard broadcast footage using a single-camera computer vision pipeline.<n>This project develops an end-to-end system that combines a YOLO object detector with the ByteTrack tracking algorithm to identify and track players, referees, goalkeepers, and the ball throughout a match.<n> Experimental results show that the pipeline achieves high performance in detecting and tracking players and officials, with strong precision, recall, and mAP50 scores, while ball detection remains the primary challenge.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Clubs with access to expensive multi-camera setups or GPS tracking systems gain a competitive advantage through detailed data, whereas lower-budget teams are often unable to collect similar information. This paper examines whether such data can instead be extracted directly from standard broadcast footage using a single-camera computer vision pipeline. This project develops an end-to-end system that combines a YOLO object detector with the ByteTrack tracking algorithm to identify and track players, referees, goalkeepers, and the ball throughout a match. Experimental results show that the pipeline achieves high performance in detecting and tracking players and officials, with strong precision, recall, and mAP50 scores, while ball detection remains the primary challenge. Despite this limitation, our findings demonstrate that AI can extract meaningful player-level spatial information from a single broadcast camera. By reducing reliance on specialized hardware, the proposed approach enables colleges, academies, and amateur clubs to adopt scalable, data-driven analysis methods previously accessible only to professional teams, highlighting the potential for affordable computer vision-based soccer analytics.
Related papers
- SoccerNet 2025 Challenges Results [205.71032061537747]
SoccerNet 2025 Challenges mark the fifth annual edition of the SoccerNet open effort, dedicated to advancing computer vision research in football video understanding.<n>This year's challenges span four vision-based tasks: Team Ball Action Spotting, Monocular Depth Estimation, Multi-View Foul Recognition, and Game State Reconstruction.<n>Report presents the results of each challenge, highlights the top-performing solutions, and provides insights into the progress made by the community.
arXiv Detail & Related papers (2025-08-26T16:37:07Z) - Continuous football player tracking from discrete broadcast data [0.6144680854063939]
We present a method that can estimate continuous full-pitch tracking data from discrete data made from broadcast footage.
Such data could be collected by clubs or players at a similar cost to event data, which is widely available down to semi-professional level.
arXiv Detail & Related papers (2023-11-24T18:16:28Z) - SpikeMOT: Event-based Multi-Object Tracking with Sparse Motion Features [52.213656737672935]
SpikeMOT is an event-based multi-object tracker.
SpikeMOT uses spiking neural networks to extract sparsetemporal features from event streams associated with objects.
arXiv Detail & Related papers (2023-09-29T05:13:43Z) - Design and Implementation of A Soccer Ball Detection System with
Multiple Cameras [15.399112952297335]
This paper designed and implemented football detection system under multiple cameras for the detection and capture of targets in real-time matches.
The main work mainly consists of three parts, football detector, single camera detection, and multi-cameras detection.
By testing the system, it shows that the system can accurately detect and capture the moving targets in 3D.
arXiv Detail & Related papers (2023-01-31T22:04:53Z) - Unifying Tracking and Image-Video Object Detection [54.91658924277527]
TrIVD (Tracking and Image-Video Detection) is the first framework that unifies image OD, video OD, and MOT within one end-to-end model.
To handle the discrepancies and semantic overlaps of category labels, TrIVD formulates detection/tracking as grounding and reasons about object categories.
arXiv Detail & Related papers (2022-11-20T20:30:28Z) - Graph-Based Multi-Camera Soccer Player Tracker [1.6244541005112743]
The paper presents a multi-camera tracking method intended for tracking soccer players in long shot video recordings from multiple calibrated cameras installed around the playing field.
The large distance to the camera makes it difficult to visually distinguish individual players, which adversely affects the performance of traditional solutions.
Our method focuses on individual player dynamics and interactions between neighborhood players to improve tracking performance.
arXiv Detail & Related papers (2022-11-03T20:01:48Z) - Scalable and Real-time Multi-Camera Vehicle Detection,
Re-Identification, and Tracking [58.95210121654722]
We propose a real-time city-scale multi-camera vehicle tracking system that handles real-world, low-resolution CCTV instead of idealized and curated video streams.
Our method is ranked among the top five performers on the public leaderboard.
arXiv Detail & Related papers (2022-04-15T12:47:01Z) - SoccerNet-Tracking: Multiple Object Tracking Dataset and Benchmark in
Soccer Videos [62.686484228479095]
We propose a novel dataset for multiple object tracking composed of 200 sequences of 30s each.
The dataset is fully annotated with bounding boxes and tracklet IDs.
Our analysis shows that multiple player, referee and ball tracking in soccer videos is far from being solved.
arXiv Detail & Related papers (2022-04-14T12:22:12Z) - Self-Supervised Small Soccer Player Detection and Tracking [8.851964372308801]
State-of-the-art tracking algorithms achieve impressive results in scenarios on which they have been trained for, but they fail in challenging ones such as soccer games.
This is frequently due to the player small relative size and the similar appearance among players of the same team.
We propose a self-supervised pipeline which is able to detect and track low-resolution soccer players under different recording conditions without any need of ground-truth data.
arXiv Detail & Related papers (2020-11-20T10:57:18Z) - Detection and Tracking Meet Drones Challenge [131.31749447313197]
This paper presents a review of object detection and tracking datasets and benchmarks, and discusses the challenges of collecting large-scale drone-based object detection and tracking datasets with manual annotations.
We describe our VisDrone dataset, which is captured over various urban/suburban areas of 14 different cities across China from North to South.
We provide a detailed analysis of the current state of the field of large-scale object detection and tracking on drones, and conclude the challenge as well as propose future directions.
arXiv Detail & Related papers (2020-01-16T00:11:56Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.