Generalized Jersey Number Recognition Using Multi-task Learning With Orientation-guided Weight Refinement
- URL: http://arxiv.org/abs/2406.01033v1
- Date: Mon, 3 Jun 2024 06:35:11 GMT
- Title: Generalized Jersey Number Recognition Using Multi-task Learning With Orientation-guided Weight Refinement
- Authors: Yung-Hui Lin, Yu-Wen Chang, Huang-Chia Shih, Takahiro Ogawa,
- Abstract summary: Jersey number recognition (JNR) has always been an important task in sports analytics.
Recent research has addressed these problems using number localization and optical character recognition.
This paper proposes a multi-task learning method called the angle-digit scheme (ADRS), which combines human body orientation angles and digit number clues to recognize athletic jersey numbers.
- Score: 12.058303459124003
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: Jersey number recognition (JNR) has always been an important task in sports analytics. Improving recognition accuracy remains an ongoing challenge because images are subject to blurring, occlusion, deformity, and low resolution. Recent research has addressed these problems using number localization and optical character recognition. Some approaches apply player identification schemes to image sequences, ignoring the impact of human body rotation angles on jersey digit identification. Accurately predicting the number of jersey digits by using a multi-task scheme to recognize each individual digit enables more robust results. Based on the above considerations, this paper proposes a multi-task learning method called the angle-digit refine scheme (ADRS), which combines human body orientation angles and digit number clues to recognize athletic jersey numbers. Based on our experimental results, our approach increases inference information, significantly improving prediction accuracy. Compared to state-of-the-art methods, which can only handle a single type of sport, the proposed method produces a more diverse and practical JNR application. The incorporation of diverse types of team sports such as soccer, football, basketball, volleyball, and baseball into our dataset contributes greatly to generalized JNR in sports analytics. Our accuracy achieves 64.07% on Top-1 and 89.97% on Top-2, with corresponding F1 scores of 67.46% and 90.64%, respectively.
Related papers
- A General Framework for Jersey Number Recognition in Sports Video [5.985204759362746]
Jersey number recognition is an important task in sports video analysis, partly due to its importance for long-term player tracking.
Here we introduce a novel public jersey number recognition dataset for hockey and study how scene text recognition methods can be adapted to this problem.
We demonstrate high performance on image- and tracklet-level tasks, achieving 91.4% accuracy for hockey images and 87.4% for soccer tracklets.
arXiv Detail & Related papers (2024-05-22T18:08:26Z) - Domain-Guided Masked Autoencoders for Unique Player Identification [62.87054782745536]
Masked autoencoders (MAEs) have emerged as a superior alternative to conventional feature extractors.
Motivated by human vision, we devise a novel domain-guided masking policy for MAEs termed d-MAE.
We conduct experiments on three large-scale sports datasets.
arXiv Detail & Related papers (2024-03-17T20:14:57Z) - Whole-body Detection, Recognition and Identification at Altitude and
Range [57.445372305202405]
We propose an end-to-end system evaluated on diverse datasets.
Our approach involves pre-training the detector on common image datasets and fine-tuning it on BRIAR's complex videos and images.
We conduct thorough evaluations under various conditions, such as different ranges and angles in indoor, outdoor, and aerial scenarios.
arXiv Detail & Related papers (2023-11-09T20:20:23Z) - Jersey Number Recognition using Keyframe Identification from
Low-Resolution Broadcast Videos [7.776923607006088]
Player identification is a crucial component in vision-driven soccer analytics, enabling various tasks such as player assessment, in-game analysis, and broadcast evaluations.
Previous methods have shown success in image data but struggle with real-world video data, where jersey numbers are not visible in most frames.
We propose a robust downstream identification module that extracts frames containing essential high-level information about the jersey number.
arXiv Detail & Related papers (2023-09-12T14:43:50Z) - SwinFace: A Multi-task Transformer for Face Recognition, Expression
Recognition, Age Estimation and Attribute Estimation [60.94239810407917]
This paper presents a multi-purpose algorithm for simultaneous face recognition, facial expression recognition, age estimation, and face attribute estimation based on a single Swin Transformer.
To address the conflicts among multiple tasks, a Multi-Level Channel Attention (MLCA) module is integrated into each task-specific analysis.
Experiments show that the proposed model has a better understanding of the face and achieves excellent performance for all tasks.
arXiv Detail & Related papers (2023-08-22T15:38:39Z) - From Global to Local: Multi-scale Out-of-distribution Detection [129.37607313927458]
Out-of-distribution (OOD) detection aims to detect "unknown" data whose labels have not been seen during the in-distribution (ID) training process.
Recent progress in representation learning gives rise to distance-based OOD detection.
We propose Multi-scale OOD DEtection (MODE), a first framework leveraging both global visual information and local region details.
arXiv Detail & Related papers (2023-08-20T11:56:25Z) - Sports Re-ID: Improving Re-Identification Of Players In Broadcast Videos
Of Team Sports [0.0]
This work focuses on player re-identification in broadcast videos of team sports.
Specifically, we focus on identifying the same player in images captured from different camera viewpoints during any given moment of a match.
arXiv Detail & Related papers (2022-06-06T06:06:23Z) - Knock, knock. Who's there? -- Identifying football player jersey numbers
with synthetic data [0.0]
We present a novel approach for jersey number identification in a small, highly imbalanced dataset from the Seattle Seahawks practice videos.
Our results indicate that simple models can achieve an acceptable performance on the jersey number detection task and that synthetic data can improve the performance dramatically.
arXiv Detail & Related papers (2022-03-01T20:44:34Z) - Multi-task learning for jersey number recognition in Ice Hockey [9.316018917078049]
We have designed and implemented a multi-task learning network for jersey number recognition.
The proposed network learns both holistic and digit-wise representations through a multi-task loss function.
Experimental results demonstrate that the proposed multi-task learning network performs better than the constituent holistic and digit-wise single-task learning networks.
arXiv Detail & Related papers (2021-08-17T19:33:28Z) - MRDet: A Multi-Head Network for Accurate Oriented Object Detection in
Aerial Images [51.227489316673484]
We propose an arbitrary-oriented region proposal network (AO-RPN) to generate oriented proposals transformed from horizontal anchors.
To obtain accurate bounding boxes, we decouple the detection task into multiple subtasks and propose a multi-head network.
Each head is specially designed to learn the features optimal for the corresponding task, which allows our network to detect objects accurately.
arXiv Detail & Related papers (2020-12-24T06:36:48Z) - Diverse Knowledge Distillation for End-to-End Person Search [81.4926655119318]
Person search aims to localize and identify a specific person from a gallery of images.
Recent methods can be categorized into two groups, i.e., two-step and end-to-end approaches.
We propose a simple yet strong end-to-end network with diverse knowledge distillation to break the bottleneck.
arXiv Detail & Related papers (2020-12-21T09:04:27Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.