RMS-Net: Regression and Masking for Soccer Event Spotting
- URL: http://arxiv.org/abs/2102.07624v1
- Date: Mon, 15 Feb 2021 16:04:18 GMT
- Title: RMS-Net: Regression and Masking for Soccer Event Spotting
- Authors: Matteo Tomei, Lorenzo Baraldi, Simone Calderara, Simone Bronzin, Rita
Cucchiara
- Abstract summary: We devise a lightweight and modular network for action spotting, which can simultaneously predict the event label and its temporal offset.
When tested on the SoccerNet dataset and using standard features, our full proposal exceeds the current state of the art by 3 Average-mAP points.
- Score: 52.742046866220484
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The recently proposed action spotting task consists in finding the exact
timestamp in which an event occurs. This task fits particularly well for soccer
videos, where events correspond to salient actions strictly defined by soccer
rules (a goal occurs when the ball crosses the goal line). In this paper, we
devise a lightweight and modular network for action spotting, which can
simultaneously predict the event label and its temporal offset using the same
underlying features. We enrich our model with two training strategies: the
first one for data balancing and uniform sampling, the second for masking
ambiguous frames and keeping the most discriminative visual cues. When tested
on the SoccerNet dataset and using standard features, our full proposal exceeds
the current state of the art by 3 Average-mAP points. Additionally, it reaches
a gain of more than 10 Average-mAP points on the test set when fine-tuned in
combination with a strong 2D backbone.
Related papers
- Towards Active Learning for Action Spotting in Association Football
Videos [59.84375958757395]
Analyzing football videos is challenging and requires identifying subtle and diverse-temporal patterns.
Current algorithms face significant challenges when learning from limited annotated data.
We propose an active learning framework that selects the most informative video samples to be annotated next.
arXiv Detail & Related papers (2023-04-09T11:50:41Z) - Dual Memory Aggregation Network for Event-Based Object Detection with
Learnable Representation [79.02808071245634]
Event-based cameras are bio-inspired sensors that capture brightness change of every pixel in an asynchronous manner.
Event streams are divided into grids in the x-y-t coordinates for both positive and negative polarity, producing a set of pillars as 3D tensor representation.
Long memory is encoded in the hidden state of adaptive convLSTMs while short memory is modeled by computing spatial-temporal correlation between event pillars.
arXiv Detail & Related papers (2023-03-17T12:12:41Z) - A Graph-Based Method for Soccer Action Spotting Using Unsupervised
Player Classification [75.93186954061943]
Action spotting involves understanding the dynamics of the game, the complexity of events, and the variation of video sequences.
In this work, we focus on the former by (a) identifying and representing the players, referees, and goalkeepers as nodes in a graph, and by (b) modeling their temporal interactions as sequences of graphs.
For the player identification task, our method obtains an overall performance of 57.83% average-mAP by combining it with other modalities.
arXiv Detail & Related papers (2022-11-22T15:23:53Z) - Spotting Temporally Precise, Fine-Grained Events in Video [23.731838969934206]
We introduce the task of spotting temporally precise, fine-grained events in video.
Models must reason globally about the full-time scale of actions and locally to identify subtle frame-to-frame appearance and motion differences.
We propose E2E-Spot, a compact, end-to-end model that performs well on the precise spotting task and can be trained quickly on a single GPU.
arXiv Detail & Related papers (2022-07-20T22:15:07Z) - Temporally Precise Action Spotting in Soccer Videos Using Dense
Detection Anchors [1.6114012813668934]
We present a model for temporally precise action spotting in videos, which uses a dense set of detection anchors, predicting a detection confidence and corresponding fine-grained temporal displacement for each anchor.
We achieve a new state-of-the-art on SoccerNet-v2, the largest soccer video dataset of its kind, with marked improvements in temporal localization.
arXiv Detail & Related papers (2022-05-20T22:14:02Z) - Bridging the Gap between Events and Frames through Unsupervised Domain
Adaptation [57.22705137545853]
We propose a task transfer method that allows models to be trained directly with labeled images and unlabeled event data.
We leverage the generative event model to split event features into content and motion features.
Our approach unlocks the vast amount of existing image datasets for the training of event-based neural networks.
arXiv Detail & Related papers (2021-09-06T17:31:37Z) - A Unified Taxonomy and Multimodal Dataset for Events in Invasion Games [3.7111751305143654]
We present a universal taxonomy that covers a wide range of low and high-level events for invasion games.
We release two multi-modal datasets comprising video and positional data with gold-standard annotations to foster research in fine-grained and ball-centered event spotting.
arXiv Detail & Related papers (2021-08-25T10:09:28Z) - Dense Scene Multiple Object Tracking with Box-Plane Matching [73.54369833671772]
Multiple Object Tracking (MOT) is an important task in computer vision.
We propose the Box-Plane Matching (BPM) method to improve the MOT performacne in dense scenes.
With the effectiveness of the three modules, our team achieves the 1st place on the Track-1 leaderboard in the ACM MM Grand Challenge HiEve 2020.
arXiv Detail & Related papers (2020-07-30T16:39:22Z) - Event detection in coarsely annotated sports videos via parallel multi
receptive field 1D convolutions [14.30009544149561]
In problems such as sports video analytics, it is difficult to obtain accurate frame level annotations and exact event duration.
We propose the task of event detection in coarsely annotated videos.
We introduce a multi-tower temporal convolutional network architecture for the proposed task.
arXiv Detail & Related papers (2020-04-13T19:51:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.