Player Identification in Hockey Broadcast Videos
- URL: http://arxiv.org/abs/2009.02429v2
- Date: Mon, 14 Sep 2020 01:18:30 GMT
- Title: Player Identification in Hockey Broadcast Videos
- Authors: Alvin Chan, Martin D. Levine, Mehrsan Javan
- Abstract summary: We present a deep convolutional neural network approach to solve the problem of hockey player identification in NHL broadcast.
We employ a secondary 1-dimensional convolutional neural network as a late score-level fusion method to classify the output of the ResNet+LSTM network.
This achieves an overall player identification accuracy score over 87% on the test split of our new dataset.
- Score: 18.616544581429835
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We present a deep recurrent convolutional neural network (CNN) approach to
solve the problem of hockey player identification in NHL broadcast videos.
Player identification is a difficult computer vision problem mainly because of
the players' similar appearance, occlusion, and blurry facial and physical
features. However, we can observe players' jersey numbers over time by
processing variable length image sequences of players (aka 'tracklets'). We
propose an end-to-end trainable ResNet+LSTM network, with a residual network
(ResNet) base and a long short-term memory (LSTM) layer, to discover
spatio-temporal features of jersey numbers over time and learn long-term
dependencies. For this work, we created a new hockey player tracklet dataset
that contains sequences of hockey player bounding boxes. Additionally, we
employ a secondary 1-dimensional convolutional neural network classifier as a
late score-level fusion method to classify the output of the ResNet+LSTM
network. This achieves an overall player identification accuracy score over 87%
on the test split of our new dataset.
Related papers
- SoccerNet Game State Reconstruction: End-to-End Athlete Tracking and Identification on a Minimap [102.5232204867158]
We formalize the task of Game State Reconstruction and introduce SoccerNet-GSR, a novel Game State Reconstruction dataset focusing on football videos.
SoccerNet-GSR is composed of 200 video sequences of 30 seconds, annotated with 9.37 million line points for pitch localization and camera calibration.
Our experiments show that GSR is a challenging novel task, which opens the field for future research.
arXiv Detail & Related papers (2024-04-17T12:53:45Z) - Domain-Guided Masked Autoencoders for Unique Player Identification [62.87054782745536]
Masked autoencoders (MAEs) have emerged as a superior alternative to conventional feature extractors.
Motivated by human vision, we devise a novel domain-guided masking policy for MAEs termed d-MAE.
We conduct experiments on three large-scale sports datasets.
arXiv Detail & Related papers (2024-03-17T20:14:57Z) - Jersey Number Recognition using Keyframe Identification from
Low-Resolution Broadcast Videos [7.776923607006088]
Player identification is a crucial component in vision-driven soccer analytics, enabling various tasks such as player assessment, in-game analysis, and broadcast evaluations.
Previous methods have shown success in image data but struggle with real-world video data, where jersey numbers are not visible in most frames.
We propose a robust downstream identification module that extracts frames containing essential high-level information about the jersey number.
arXiv Detail & Related papers (2023-09-12T14:43:50Z) - Automated player identification and indexing using two-stage deep
learning network [0.23610495849936355]
We propose a deep learning-based player tracking system to automatically track players and index their participation per play in American football games.
It is a two-stage network design to highlight areas of interest and identify jersey number information with high accuracy.
We demonstrate the effectiveness and reliability of player tracking system by analyzing the qualitative and quantitative results on football videos.
arXiv Detail & Related papers (2022-04-26T02:59:03Z) - SoccerNet-Tracking: Multiple Object Tracking Dataset and Benchmark in
Soccer Videos [62.686484228479095]
We propose a novel dataset for multiple object tracking composed of 200 sequences of 30s each.
The dataset is fully annotated with bounding boxes and tracklet IDs.
Our analysis shows that multiple player, referee and ball tracking in soccer videos is far from being solved.
arXiv Detail & Related papers (2022-04-14T12:22:12Z) - Music Generation Using an LSTM [52.77024349608834]
Long Short-Term Memory (LSTM) network structures have proven to be very useful for making predictions for the next output in a series.
We demonstrate an approach of music generation using Recurrent Neural Networks (RNN)
We provide a brief synopsis of the intuition, theory, and application of LSTMs in music generation, develop and present the network we found to best achieve this goal, identify and address issues and challenges faced, and include potential future improvements for our network.
arXiv Detail & Related papers (2022-03-23T00:13:41Z) - Ice hockey player identification via transformers [11.28395713457468]
We introduce a transformer network for recognizing players through their jersey numbers in broadcast National Hockey League (NHL) videos.
The proposed network performs better than the previous benchmark on the dataset used.
We also utilize player shifts available in the NHL play-by-play data by reading the game time using optical character recognition (OCR) to get the players on the ice rink at a certain game time.
arXiv Detail & Related papers (2021-11-22T21:10:26Z) - Keypoint Message Passing for Video-based Person Re-Identification [106.41022426556776]
Video-based person re-identification (re-ID) is an important technique in visual surveillance systems which aims to match video snippets of people captured by different cameras.
Existing methods are mostly based on convolutional neural networks (CNNs), whose building blocks either process local neighbor pixels at a time, or, when 3D convolutions are used to model temporal information, suffer from the misalignment problem caused by person movement.
In this paper, we propose to overcome the limitations of normal convolutions with a human-oriented graph method. Specifically, features located at person joint keypoints are extracted and connected as a spatial-temporal graph
arXiv Detail & Related papers (2021-11-16T08:01:16Z) - Player Tracking and Identification in Ice Hockey [9.577770317771087]
This paper introduces an automated system to track and identify players in broadcast NHL hockey videos.
The system is composed of three components (1) Player tracking, (2) Team identification and (3) Player identification.
For team identification, the away-team jerseys are grouped into a single class and home-team jerseys are grouped in classes according to their jersey color.
A novel player identification model is introduced that utilizes a temporal one-dimensional convolutional network to identify players from player bounding box sequences.
arXiv Detail & Related papers (2021-10-06T22:37:08Z) - Top-1 Solution of Multi-Moments in Time Challenge 2019 [56.15819266653481]
We conduct several experiments with popular Image-Based action recognition methods TRN, TSN, and TSM.
A novel temporal interlacing network is proposed towards fast and accurate recognition.
We ensemble all the above models and achieve 67.22% on the validation set and 60.77% on the test set, which ranks 1st on the final leaderboard.
arXiv Detail & Related papers (2020-03-12T15:11:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.