Multi-task learning for jersey number recognition in Ice Hockey
- URL: http://arxiv.org/abs/2108.07848v1
- Date: Tue, 17 Aug 2021 19:33:28 GMT
- Title: Multi-task learning for jersey number recognition in Ice Hockey
- Authors: Kanav Vats, Mehrnaz Fani, David A. Clausi and John Zelek
- Abstract summary: We have designed and implemented a multi-task learning network for jersey number recognition.
The proposed network learns both holistic and digit-wise representations through a multi-task loss function.
Experimental results demonstrate that the proposed multi-task learning network performs better than the constituent holistic and digit-wise single-task learning networks.
- Score: 9.316018917078049
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Identifying players in sports videos by recognizing their jersey numbers is a
challenging task in computer vision. We have designed and implemented a
multi-task learning network for jersey number recognition. In order to train a
network to recognize jersey numbers, two output label representations are used
(1) Holistic - considers the entire jersey number as one class, and (2)
Digit-wise - considers the two digits in a jersey number as two separate
classes. The proposed network learns both holistic and digit-wise
representations through a multi-task loss function. We determine the optimal
weights to be assigned to holistic and digit-wise losses through an ablation
study. Experimental results demonstrate that the proposed multi-task learning
network performs better than the constituent holistic and digit-wise
single-task learning networks.
Related papers
- Generalized Jersey Number Recognition Using Multi-task Learning With Orientation-guided Weight Refinement [12.058303459124003]
Jersey number recognition (JNR) has always been an important task in sports analytics.
Recent research has addressed these problems using number localization and optical character recognition.
This paper proposes a multi-task learning method called the angle-digit scheme (ADRS), which combines human body orientation angles and digit number clues to recognize athletic jersey numbers.
arXiv Detail & Related papers (2024-06-03T06:35:11Z) - A General Framework for Jersey Number Recognition in Sports Video [5.985204759362746]
Jersey number recognition is an important task in sports video analysis, partly due to its importance for long-term player tracking.
Here we introduce a novel public jersey number recognition dataset for hockey and study how scene text recognition methods can be adapted to this problem.
We demonstrate high performance on image- and tracklet-level tasks, achieving 91.4% accuracy for hockey images and 87.4% for soccer tracklets.
arXiv Detail & Related papers (2024-05-22T18:08:26Z) - Jersey Number Recognition using Keyframe Identification from
Low-Resolution Broadcast Videos [7.776923607006088]
Player identification is a crucial component in vision-driven soccer analytics, enabling various tasks such as player assessment, in-game analysis, and broadcast evaluations.
Previous methods have shown success in image data but struggle with real-world video data, where jersey numbers are not visible in most frames.
We propose a robust downstream identification module that extracts frames containing essential high-level information about the jersey number.
arXiv Detail & Related papers (2023-09-12T14:43:50Z) - Multitask Learning with No Regret: from Improved Confidence Bounds to
Active Learning [79.07658065326592]
Quantifying uncertainty in the estimated tasks is of pivotal importance for many downstream applications, such as online or active learning.
We provide novel multitask confidence intervals in the challenging setting when neither the similarity between tasks nor the tasks' features are available to the learner.
We propose a novel online learning algorithm that achieves such improved regret without knowing this parameter in advance.
arXiv Detail & Related papers (2023-08-03T13:08:09Z) - Automated player identification and indexing using two-stage deep
learning network [0.23610495849936355]
We propose a deep learning-based player tracking system to automatically track players and index their participation per play in American football games.
It is a two-stage network design to highlight areas of interest and identify jersey number information with high accuracy.
We demonstrate the effectiveness and reliability of player tracking system by analyzing the qualitative and quantitative results on football videos.
arXiv Detail & Related papers (2022-04-26T02:59:03Z) - Multi-Task Learning with Sequence-Conditioned Transporter Networks [67.57293592529517]
We aim to solve multi-task learning through the lens of sequence-conditioning and weighted sampling.
We propose a new suite of benchmark aimed at compositional tasks, MultiRavens, which allows defining custom task combinations.
Second, we propose a vision-based end-to-end system architecture, Sequence-Conditioned Transporter Networks, which augments Goal-Conditioned Transporter Networks with sequence-conditioning and weighted sampling.
arXiv Detail & Related papers (2021-09-15T21:19:11Z) - Multi-script Handwritten Digit Recognition Using Multi-task Learning [2.8698937226234795]
It is not very common for multi-script digit recognition which encourage the development of robust and multipurpose systems.
In this study multi-script handwritten digit recognition using multi-task learning will be investigated.
The handwritten digits of three scripts including Latin, Arabic and Kannada are studied to show that multi-task models with reformulation of the individual tasks have shown promising results.
arXiv Detail & Related papers (2021-06-15T16:30:37Z) - A robot that counts like a child: a developmental model of counting and
pointing [69.26619423111092]
A novel neuro-robotics model capable of counting real items is introduced.
The model allows us to investigate the interaction between embodiment and numerical cognition.
The trained model is able to count a set of items and at the same time points to them.
arXiv Detail & Related papers (2020-08-05T21:06:27Z) - ResNeSt: Split-Attention Networks [86.25490825631763]
We present a modularized architecture, which applies the channel-wise attention on different network branches to leverage their success in capturing cross-feature interactions and learning diverse representations.
Our model, named ResNeSt, outperforms EfficientNet in accuracy and latency trade-off on image classification.
arXiv Detail & Related papers (2020-04-19T20:40:31Z) - MTI-Net: Multi-Scale Task Interaction Networks for Multi-Task Learning [82.62433731378455]
We show that tasks with high affinity at a certain scale are not guaranteed to retain this behaviour at other scales.
We propose a novel architecture, namely MTI-Net, that builds upon this finding.
arXiv Detail & Related papers (2020-01-19T21:02:36Z) - Uncertainty-Aware Multi-Shot Knowledge Distillation for Image-Based
Object Re-Identification [93.39253443415392]
We propose exploiting the multi-shots of the same identity to guide the feature learning of each individual image.
It consists of a teacher network (T-net) that learns the comprehensive features from multiple images of the same object, and a student network (S-net) that takes a single image as input.
We validate the effectiveness of our approach on the popular vehicle re-id and person re-id datasets.
arXiv Detail & Related papers (2020-01-15T09:39:05Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.