Sports Re-ID: Improving Re-Identification Of Players In Broadcast Videos
Of Team Sports
- URL: http://arxiv.org/abs/2206.02373v1
- Date: Mon, 6 Jun 2022 06:06:23 GMT
- Title: Sports Re-ID: Improving Re-Identification Of Players In Broadcast Videos
Of Team Sports
- Authors: Bharath Comandur
- Abstract summary: This work focuses on player re-identification in broadcast videos of team sports.
Specifically, we focus on identifying the same player in images captured from different camera viewpoints during any given moment of a match.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This work focuses on player re-identification in broadcast videos of team
sports. Specifically, we focus on identifying the same player in images
captured from different camera viewpoints during any given moment of a match.
This task differs from traditional applications of person re-id in a few
important ways. Firstly, players from the same team wear highly similar
clothes, thereby making it harder to tell them apart. Secondly, there are only
a few number of samples for each identity, which makes it harder to train a
re-id system. Thirdly, the resolutions of the images are often quite low and
vary a lot. This combined with heavy occlusions and fast movements of players
greatly increase the challenges for re-id. In this paper, we propose a simple
but effective hierarchical data sampling procedure and a centroid loss function
that, when used together, increase the mean average precision (mAP) by 7 - 11.5
and the rank-1 (R1) by 8.8 - 14.9 without any change in the network or
hyper-parameters used. Our data sampling procedure improves the similarity of
the training and test distributions, and thereby aids in creating better
estimates of the centroids of the embeddings (or feature vectors).
Surprisingly, our study shows that in the presence of severely limited data, as
is the case for our application, a simple centroid loss function based on
euclidean distances significantly outperforms the popular triplet-centroid loss
function. We show comparable improvements for both convolutional networks and
vision transformers. Our approach is among the top ranked methods in the
SoccerNet Re-Identification Challenge 2022 leaderboard (test-split) with a mAP
of 86.0 and a R1 of 81.5. On the sequestered challenge split, we achieve an mAP
of 84.9 and a R1 of 80.1. Research on re-id for sports-related applications is
very limited and our work presents one of the first discussions in the
literature on this.
Related papers
- Domain-Guided Masked Autoencoders for Unique Player Identification [62.87054782745536]
Masked autoencoders (MAEs) have emerged as a superior alternative to conventional feature extractors.
Motivated by human vision, we devise a novel domain-guided masking policy for MAEs termed d-MAE.
We conduct experiments on three large-scale sports datasets.
arXiv Detail & Related papers (2024-03-17T20:14:57Z) - AaP-ReID: Improved Attention-Aware Person Re-identification [2.5761958263376745]
AaP-ReID is a more effective method for person ReID that incorporates channel-wise attention into a ResNet-based architecture.
Our method incorporates the Channel-Wise Attention Bottleneck block and can learn discriminating features by dynamically adjusting the importance ofeach channel in the feature maps.
arXiv Detail & Related papers (2023-09-27T16:54:38Z) - CLIP-ReIdent: Contrastive Training for Player Re-Identification [0.0]
We investigate whether it is possible to transfer the out-standing zero-shot performance of pre-trained CLIP models to the domain of player re-identification.
Unlike previous work, our approach is entirely class-agnostic and benefits from large-scale pre-training.
arXiv Detail & Related papers (2023-03-21T13:55:27Z) - A Graph-Based Method for Soccer Action Spotting Using Unsupervised
Player Classification [75.93186954061943]
Action spotting involves understanding the dynamics of the game, the complexity of events, and the variation of video sequences.
In this work, we focus on the former by (a) identifying and representing the players, referees, and goalkeepers as nodes in a graph, and by (b) modeling their temporal interactions as sequences of graphs.
For the player identification task, our method obtains an overall performance of 57.83% average-mAP by combining it with other modalities.
arXiv Detail & Related papers (2022-11-22T15:23:53Z) - D$^{\bf{3}}$: Duplicate Detection Decontaminator for Multi-Athlete
Tracking in Sports Videos [44.027619577289144]
The duplicate detection is newly and precisely defined as occlusion misreporting on the same athlete by multiple detection boxes in one frame.
To address this problem, we meticulously design a novel transformer-based Detection Decontaminator (D$3$) for training, and a specific algorithm Rally-Hungarian (RH) for matching.
Our model, which is trained only with volleyball videos, can be applied directly to basketball and soccer videos for MAT.
arXiv Detail & Related papers (2022-09-25T15:46:39Z) - On the Unreasonable Effectiveness of Centroids in Image Retrieval [0.1933681537640272]
We propose to use the mean centroid representation both during training and retrieval.
As each class is represented by a single embedding - the class centroid - both retrieval time and storage requirements are reduced significantly.
arXiv Detail & Related papers (2021-04-28T08:57:57Z) - Camera-aware Proxies for Unsupervised Person Re-Identification [60.26031011794513]
This paper tackles the purely unsupervised person re-identification (Re-ID) problem that requires no annotations.
We propose to split each single cluster into multiple proxies and each proxy represents the instances coming from the same camera.
Based on the camera-aware proxies, we design both intra- and inter-camera contrastive learning components for our Re-ID model.
arXiv Detail & Related papers (2020-12-19T12:37:04Z) - Unsupervised Pre-training for Person Re-identification [90.98552221699508]
We present a large scale unlabeled person re-identification (Re-ID) dataset "LUPerson"
We make the first attempt of performing unsupervised pre-training for improving the generalization ability of the learned person Re-ID feature representation.
arXiv Detail & Related papers (2020-12-07T14:48:26Z) - Unsupervised Temporal Feature Aggregation for Event Detection in
Unstructured Sports Videos [10.230408415438966]
We study the case of event detection in sports videos for unstructured environments with arbitrary camera angles.
We identify and solve two major problems: unsupervised identification of players in an unstructured setting and generalization of the trained models to pose variations due to arbitrary shooting angles.
arXiv Detail & Related papers (2020-02-19T10:24:22Z) - Intra-Camera Supervised Person Re-Identification [87.88852321309433]
We propose a novel person re-identification paradigm based on an idea of independent per-camera identity annotation.
This eliminates the most time-consuming and tedious inter-camera identity labelling process.
We formulate a Multi-tAsk mulTi-labEl (MATE) deep learning method for Intra-Camera Supervised (ICS) person re-id.
arXiv Detail & Related papers (2020-02-12T15:26:33Z) - Rethinking the Distribution Gap of Person Re-identification with
Camera-based Batch Normalization [90.9485099181197]
This paper rethinks the working mechanism of conventional ReID approaches.
We force the image data of all cameras to fall onto the same subspace, so that the distribution gap between any camera pair is largely shrunk.
Experiments on a wide range of ReID tasks demonstrate the effectiveness of our approach.
arXiv Detail & Related papers (2020-01-23T17:22:34Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.