A General Framework for Jersey Number Recognition in Sports Video
- URL: http://arxiv.org/abs/2405.13896v1
- Date: Wed, 22 May 2024 18:08:26 GMT
- Title: A General Framework for Jersey Number Recognition in Sports Video
- Authors: Maria Koshkina, James H. Elder,
- Abstract summary: Jersey number recognition is an important task in sports video analysis, partly due to its importance for long-term player tracking.
Here we introduce a novel public jersey number recognition dataset for hockey and study how scene text recognition methods can be adapted to this problem.
We demonstrate high performance on image- and tracklet-level tasks, achieving 91.4% accuracy for hockey images and 87.4% for soccer tracklets.
- Score: 5.985204759362746
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Jersey number recognition is an important task in sports video analysis, partly due to its importance for long-term player tracking. It can be viewed as a variant of scene text recognition. However, there is a lack of published attempts to apply scene text recognition models on jersey number data. Here we introduce a novel public jersey number recognition dataset for hockey and study how scene text recognition methods can be adapted to this problem. We address issues of occlusions and assess the degree to which training on one sport (hockey) can be generalized to another (soccer). For the latter, we also consider how jersey number recognition at the single-image level can be aggregated across frames to yield tracklet-level jersey number labels. We demonstrate high performance on image- and tracklet-level tasks, achieving 91.4% accuracy for hockey images and 87.4% for soccer tracklets. Code, models, and data are available at https://github.com/mkoshkina/jersey-number-pipeline.
Related papers
- Generalized Jersey Number Recognition Using Multi-task Learning With Orientation-guided Weight Refinement [12.058303459124003]
Jersey number recognition (JNR) has always been an important task in sports analytics.
Recent research has addressed these problems using number localization and optical character recognition.
This paper proposes a multi-task learning method called the angle-digit scheme (ADRS), which combines human body orientation angles and digit number clues to recognize athletic jersey numbers.
arXiv Detail & Related papers (2024-06-03T06:35:11Z) - SoccerNet Game State Reconstruction: End-to-End Athlete Tracking and Identification on a Minimap [102.5232204867158]
We formalize the task of Game State Reconstruction and introduce SoccerNet-GSR, a novel Game State Reconstruction dataset focusing on football videos.
SoccerNet-GSR is composed of 200 video sequences of 30 seconds, annotated with 9.37 million line points for pitch localization and camera calibration.
Our experiments show that GSR is a challenging novel task, which opens the field for future research.
arXiv Detail & Related papers (2024-04-17T12:53:45Z) - Domain-Guided Masked Autoencoders for Unique Player Identification [62.87054782745536]
Masked autoencoders (MAEs) have emerged as a superior alternative to conventional feature extractors.
Motivated by human vision, we devise a novel domain-guided masking policy for MAEs termed d-MAE.
We conduct experiments on three large-scale sports datasets.
arXiv Detail & Related papers (2024-03-17T20:14:57Z) - Jersey Number Recognition using Keyframe Identification from
Low-Resolution Broadcast Videos [7.776923607006088]
Player identification is a crucial component in vision-driven soccer analytics, enabling various tasks such as player assessment, in-game analysis, and broadcast evaluations.
Previous methods have shown success in image data but struggle with real-world video data, where jersey numbers are not visible in most frames.
We propose a robust downstream identification module that extracts frames containing essential high-level information about the jersey number.
arXiv Detail & Related papers (2023-09-12T14:43:50Z) - GOAL: A Challenging Knowledge-grounded Video Captioning Benchmark for
Real-time Soccer Commentary Generation [75.60413443783953]
We present GOAL, a benchmark of over 8.9k soccer video clips, 22k sentences, and 42k knowledge triples for proposing a challenging new task setting as Knowledge-grounded Video Captioning (KGVC)
Our data and code are available at https://github.com/THU-KEG/goal.
arXiv Detail & Related papers (2023-03-26T08:43:36Z) - P2ANet: A Dataset and Benchmark for Dense Action Detection from Table Tennis Match Broadcasting Videos [64.57435509822416]
This work consists of 2,721 video clips collected from the broadcasting videos of professional table tennis matches in World Table Tennis Championships and Olympiads.
We formulate two sets of action detection problems -- emphaction localization and emphaction recognition.
The results confirm that TheName is still a challenging task and can be used as a special benchmark for dense action detection from videos.
arXiv Detail & Related papers (2022-07-26T08:34:17Z) - A Survey on Video Action Recognition in Sports: Datasets, Methods and
Applications [60.3327085463545]
We present a survey on video action recognition for sports analytics.
We introduce more than ten types of sports, including team sports, such as football, basketball, volleyball, hockey and individual sports, such as figure skating, gymnastics, table tennis, diving and badminton.
We develop a toolbox using PaddlePaddle, which supports football, basketball, table tennis and figure skating action recognition.
arXiv Detail & Related papers (2022-06-02T13:19:36Z) - SoccerNet-Tracking: Multiple Object Tracking Dataset and Benchmark in
Soccer Videos [62.686484228479095]
We propose a novel dataset for multiple object tracking composed of 200 sequences of 30s each.
The dataset is fully annotated with bounding boxes and tracklet IDs.
Our analysis shows that multiple player, referee and ball tracking in soccer videos is far from being solved.
arXiv Detail & Related papers (2022-04-14T12:22:12Z) - Knock, knock. Who's there? -- Identifying football player jersey numbers
with synthetic data [0.0]
We present a novel approach for jersey number identification in a small, highly imbalanced dataset from the Seattle Seahawks practice videos.
Our results indicate that simple models can achieve an acceptable performance on the jersey number detection task and that synthetic data can improve the performance dramatically.
arXiv Detail & Related papers (2022-03-01T20:44:34Z) - Ice hockey player identification via transformers [11.28395713457468]
We introduce a transformer network for recognizing players through their jersey numbers in broadcast National Hockey League (NHL) videos.
The proposed network performs better than the previous benchmark on the dataset used.
We also utilize player shifts available in the NHL play-by-play data by reading the game time using optical character recognition (OCR) to get the players on the ice rink at a certain game time.
arXiv Detail & Related papers (2021-11-22T21:10:26Z) - Player Identification in Hockey Broadcast Videos [18.616544581429835]
We present a deep convolutional neural network approach to solve the problem of hockey player identification in NHL broadcast.
We employ a secondary 1-dimensional convolutional neural network as a late score-level fusion method to classify the output of the ResNet+LSTM network.
This achieves an overall player identification accuracy score over 87% on the test split of our new dataset.
arXiv Detail & Related papers (2020-09-05T01:30:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.