SportsMOT: A Large Multi-Object Tracking Dataset in Multiple Sports
Scenes
- URL: http://arxiv.org/abs/2304.05170v2
- Date: Thu, 13 Apr 2023 12:23:36 GMT
- Title: SportsMOT: A Large Multi-Object Tracking Dataset in Multiple Sports
Scenes
- Authors: Yutao Cui, Chenkai Zeng, Xiaoyu Zhao, Yichun Yang, Gangshan Wu and
Limin Wang
- Abstract summary: We present a new large-scale multi-object tracking dataset in diverse sports scenes, coined as emphSportsMOT.
It consists of 240 video sequences, over 150K frames and over 1.6M bounding boxes collected from 3 sports categories, including basketball, volleyball and football.
We propose a new multi-object tracking framework, termed as emphMixSort, introducing a MixFormer-like structure as an auxiliary association model to prevailing tracking-by-detection trackers.
- Score: 44.46768991505495
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Multi-object tracking in sports scenes plays a critical role in gathering
players statistics, supporting further analysis, such as automatic tactical
analysis. Yet existing MOT benchmarks cast little attention on the domain,
limiting its development. In this work, we present a new large-scale
multi-object tracking dataset in diverse sports scenes, coined as
\emph{SportsMOT}, where all players on the court are supposed to be tracked. It
consists of 240 video sequences, over 150K frames (almost 15\times MOT17) and
over 1.6M bounding boxes (3\times MOT17) collected from 3 sports categories,
including basketball, volleyball and football. Our dataset is characterized
with two key properties: 1) fast and variable-speed motion and 2) similar yet
distinguishable appearance. We expect SportsMOT to encourage the MOT trackers
to promote in both motion-based association and appearance-based association.
We benchmark several state-of-the-art trackers and reveal the key challenge of
SportsMOT lies in object association. To alleviate the issue, we further
propose a new multi-object tracking framework, termed as \emph{MixSort},
introducing a MixFormer-like structure as an auxiliary association model to
prevailing tracking-by-detection trackers. By integrating the customized
appearance-based association with the original motion-based association,
MixSort achieves state-of-the-art performance on SportsMOT and MOT17. Based on
MixSort, we give an in-depth analysis and provide some profound insights into
SportsMOT. The dataset and code will be available at
https://deeperaction.github.io/datasets/sportsmot.html.
Related papers
- Walker: Self-supervised Multiple Object Tracking by Walking on Temporal Appearance Graphs [117.67620297750685]
We introduce Walker, the first self-supervised tracker that learns from videos with sparse bounding box annotations, and no tracking labels.
Walker is the first self-supervised tracker to achieve competitive performance on MOT17, DanceTrack, and BDD100K.
arXiv Detail & Related papers (2024-09-25T18:00:00Z) - TeamTrack: A Dataset for Multi-Sport Multi-Object Tracking in Full-pitch Videos [11.35998213546475]
Multi-object tracking (MOT) is a critical and challenging task in computer vision.
We introduce TeamTrack, a pioneering benchmark dataset specifically designed for MOT in sports.
TeamTrack is an extensive collection of full-pitch video data from various sports, including soccer, basketball, and handball.
arXiv Detail & Related papers (2024-04-22T04:33:40Z) - Tracking Anything in High Quality [63.63653185865726]
HQTrack is a framework for High Quality Tracking anything in videos.
It consists of a video multi-object segmenter (VMOS) and a mask refiner (MR)
arXiv Detail & Related papers (2023-07-26T06:19:46Z) - Observation Centric and Central Distance Recovery on Sports Player
Tracking [24.396926939889532]
We propose a motionbased tracking algorithm and three post-processing pipelines for three sports including basketball, football, and volleyball.
Our method achieves a HOTA of 73.968, ranking 3rd place on the 2022 Sportsmot workshop final leaderboard.
arXiv Detail & Related papers (2022-09-27T04:48:11Z) - D$^{\bf{3}}$: Duplicate Detection Decontaminator for Multi-Athlete
Tracking in Sports Videos [44.027619577289144]
The duplicate detection is newly and precisely defined as occlusion misreporting on the same athlete by multiple detection boxes in one frame.
To address this problem, we meticulously design a novel transformer-based Detection Decontaminator (D$3$) for training, and a specific algorithm Rally-Hungarian (RH) for matching.
Our model, which is trained only with volleyball videos, can be applied directly to basketball and soccer videos for MAT.
arXiv Detail & Related papers (2022-09-25T15:46:39Z) - SoccerNet-Tracking: Multiple Object Tracking Dataset and Benchmark in
Soccer Videos [62.686484228479095]
We propose a novel dataset for multiple object tracking composed of 200 sequences of 30s each.
The dataset is fully annotated with bounding boxes and tracklet IDs.
Our analysis shows that multiple player, referee and ball tracking in soccer videos is far from being solved.
arXiv Detail & Related papers (2022-04-14T12:22:12Z) - DanceTrack: Multi-Object Tracking in Uniform Appearance and Diverse
Motion [56.1428110894411]
We propose a large-scale dataset for multi-human tracking, where humans have similar appearance, diverse motion and extreme articulation.
As the dataset contains mostly group dancing videos, we name it "DanceTrack"
We benchmark several state-of-the-art trackers on our dataset and observe a significant performance drop on DanceTrack when compared against existing benchmarks.
arXiv Detail & Related papers (2021-11-29T16:49:06Z) - SiamMOT: Siamese Multi-Object Tracking [28.97401838563374]
We introduce a region-based Siamese Multi-Object Tracking network, which we name SiamMOT.
SiamMOT includes a motion model that estimates the instance's movement between two frames such that detected instances are associated.
SiamMOT is efficient, and it runs at 17 FPS for 720P videos on a single modern GPU.
arXiv Detail & Related papers (2021-05-25T01:09:26Z) - MultiSports: A Multi-Person Video Dataset of Spatio-Temporally Localized
Sports Actions [39.27858380391081]
This paper aims to present a new multi-person dataset of atomic-temporal actions, coined as MultiSports.
We build the dataset of MultiSports v1.0 by selecting 4 sports classes, collecting around 3200 video clips, and annotating around 37790 action instances with 907k bounding boxes.
arXiv Detail & Related papers (2021-05-16T10:40:30Z) - MOTChallenge: A Benchmark for Single-Camera Multiple Target Tracking [72.76685780516371]
We present MOTChallenge, a benchmark for single-camera Multiple Object Tracking (MOT)
The benchmark is focused on multiple people tracking, since pedestrians are by far the most studied object in the tracking community.
We provide a categorization of state-of-the-art trackers and a broad error analysis.
arXiv Detail & Related papers (2020-10-15T06:52:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.