No Train Yet Gain: Towards Generic Multi-Object Tracking in Sports and Beyond
- URL: http://arxiv.org/abs/2506.01373v1
- Date: Mon, 02 Jun 2025 07:00:15 GMT
- Title: No Train Yet Gain: Towards Generic Multi-Object Tracking in Sports and Beyond
- Authors: Tomasz Stanczyk, Seongro Yoon, Francois Bremond,
- Abstract summary: Multi-object tracking (MOT) is essential for sports analytics, enabling performance evaluation and tactical insights.<n>Traditional tracking-by-detection methods require extensive tuning, while segmentation-based approaches struggle with track processing.<n>We propose McByte, a tracking-by-detection framework that integrates temporally propagated segmentation mask as an association cue to improve robustness without per-video tuning.
- Score: 1.0806835533814036
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Multi-object tracking (MOT) is essential for sports analytics, enabling performance evaluation and tactical insights. However, tracking in sports is challenging due to fast movements, occlusions, and camera shifts. Traditional tracking-by-detection methods require extensive tuning, while segmentation-based approaches struggle with track processing. We propose McByte, a tracking-by-detection framework that integrates temporally propagated segmentation mask as an association cue to improve robustness without per-video tuning. Unlike many existing methods, McByte does not require training, relying solely on pre-trained models and object detectors commonly used in the community. Evaluated on SportsMOT, DanceTrack, SoccerNet-tracking 2022 and MOT17, McByte demonstrates strong performance across sports and general pedestrian tracking. Our results highlight the benefits of mask propagation for a more adaptable and generalizable MOT approach. Code will be made available at https://github.com/tstanczyk95/McByte.
Related papers
- CAMELTrack: Context-Aware Multi-cue ExpLoitation for Online Multi-Object Tracking [68.24998698508344]
We introduce CAMEL, a novel association module for Context-Aware Multi-Cue ExpLoitation.<n>Unlike end-to-end detection-by-tracking approaches, our method remains lightweight and fast to train while being able to leverage external off-the-shelf models.<n>Our proposed online tracking pipeline, CAMELTrack, achieves state-of-the-art performance on multiple tracking benchmarks.
arXiv Detail & Related papers (2025-05-02T13:26:23Z) - Masks and Boxes: Combining the Best of Both Worlds for Multi-Object Tracking [1.2145800134384477]
Multi-object tracking (MOT) involves identifying and consistently tracking objects across video sequences.
Traditional tracking-by-detection methods require extensive tuning and lack generalizability.
We propose a novel approach, McByte, which incorporates a temporally propagated segmentation mask as a strong association cue.
arXiv Detail & Related papers (2024-09-21T18:52:07Z) - Temporal Correlation Meets Embedding: Towards a 2nd Generation of JDE-based Real-Time Multi-Object Tracking [52.04679257903805]
Joint Detection and Embedding (JDE) trackers have demonstrated excellent performance in Multi-Object Tracking (MOT) tasks.
Our tracker, named TCBTrack, achieves state-of-the-art performance on multiple public benchmarks.
arXiv Detail & Related papers (2024-07-19T07:48:45Z) - Sports-Traj: A Unified Trajectory Generation Model for Multi-Agent Movement in Sports [53.637837706712794]
We propose a Unified Trajectory Generation model, UniTraj, that processes arbitrary trajectories as masked inputs.<n>Specifically, we introduce a Ghost Spatial Masking (GSM) module, embedded within a Transformer encoder, for spatial feature extraction.<n>We benchmark three practical sports datasets, Basketball-U, Football-U, and Soccer-U, for evaluation.
arXiv Detail & Related papers (2024-05-27T22:15:23Z) - TeamTrack: A Dataset for Multi-Sport Multi-Object Tracking in Full-pitch Videos [11.35998213546475]
Multi-object tracking (MOT) is a critical and challenging task in computer vision.
We introduce TeamTrack, a pioneering benchmark dataset specifically designed for MOT in sports.
TeamTrack is an extensive collection of full-pitch video data from various sports, including soccer, basketball, and handball.
arXiv Detail & Related papers (2024-04-22T04:33:40Z) - Tracking Anything in High Quality [63.63653185865726]
HQTrack is a framework for High Quality Tracking anything in videos.
It consists of a video multi-object segmenter (VMOS) and a mask refiner (MR)
arXiv Detail & Related papers (2023-07-26T06:19:46Z) - CoTracker: It is Better to Track Together [70.63040730154984]
CoTracker is a transformer-based model that tracks a large number of 2D points in long video sequences.
We show that joint tracking significantly improves tracking accuracy and robustness, and allows CoTracker to track occluded points and points outside of the camera view.
arXiv Detail & Related papers (2023-07-14T21:13:04Z) - Iterative Scale-Up ExpansionIoU and Deep Features Association for
Multi-Object Tracking in Sports [26.33239898091364]
We propose a novel online and robust multi-object tracking approach named deep ExpansionIoU (Deep-EIoU) for sports scenarios.
Unlike conventional methods, we abandon the use of the Kalman filter and leverage the iterative scale-up ExpansionIoU and deep features for robust tracking in sports scenarios.
Our proposed method demonstrates remarkable effectiveness in tracking irregular motion objects, achieving a score of 77.2% on the SportsMOT dataset and 85.4% on the SoccerNet-Tracking dataset.
arXiv Detail & Related papers (2023-06-22T17:47:08Z) - SportsMOT: A Large Multi-Object Tracking Dataset in Multiple Sports
Scenes [44.46768991505495]
We present a new large-scale multi-object tracking dataset in diverse sports scenes, coined as emphSportsMOT.
It consists of 240 video sequences, over 150K frames and over 1.6M bounding boxes collected from 3 sports categories, including basketball, volleyball and football.
We propose a new multi-object tracking framework, termed as emphMixSort, introducing a MixFormer-like structure as an auxiliary association model to prevailing tracking-by-detection trackers.
arXiv Detail & Related papers (2023-04-11T12:07:31Z) - Observation Centric and Central Distance Recovery on Sports Player
Tracking [24.396926939889532]
We propose a motionbased tracking algorithm and three post-processing pipelines for three sports including basketball, football, and volleyball.
Our method achieves a HOTA of 73.968, ranking 3rd place on the 2022 Sportsmot workshop final leaderboard.
arXiv Detail & Related papers (2022-09-27T04:48:11Z) - SoccerNet-Tracking: Multiple Object Tracking Dataset and Benchmark in
Soccer Videos [62.686484228479095]
We propose a novel dataset for multiple object tracking composed of 200 sequences of 30s each.
The dataset is fully annotated with bounding boxes and tracklet IDs.
Our analysis shows that multiple player, referee and ball tracking in soccer videos is far from being solved.
arXiv Detail & Related papers (2022-04-14T12:22:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.