Slicing and dicing soccer: automatic detection of complex events from
spatio-temporal data
- URL: http://arxiv.org/abs/2004.04147v2
- Date: Fri, 10 Apr 2020 07:30:55 GMT
- Title: Slicing and dicing soccer: automatic detection of complex events from
spatio-temporal data
- Authors: Lia Morra, Francesco Manigrasso, Giuseppe Canto, Claudio Gianfrate,
Enrico Guarino, Fabrizio Lamberti
- Abstract summary: The automatic detection of events in sport videos has im-portant applications for data analytics, as well as for broadcasting andmedia companies.
This paper presents a comprehensive approach for de-tecting a wide range of complex events in soccer videos starting frompositional data.
- Score: 4.652872227256855
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The automatic detection of events in sport videos has im-portant applications
for data analytics, as well as for broadcasting andmedia companies. This paper
presents a comprehensive approach for de-tecting a wide range of complex events
in soccer videos starting frompositional data. The event detector is designed
as a two-tier system thatdetectsatomicandcomplex events. Atomic events are
detected basedon temporal and logical combinations of the detected objects,
their rel-ative distances, as well as spatio-temporal features such as velocity
andacceleration. Complex events are defined as temporal and logical
com-binations of atomic and complex events, and are expressed by meansof a
declarative Interval Temporal Logic (ITL). The effectiveness of theproposed
approach is demonstrated over 16 different events, includingcomplex situations
such as tackles and filtering passes. By formalizingevents based on principled
ITL, it is possible to easily perform reason-ing tasks, such as understanding
which passes or crosses result in a goalbeing scored. To counterbalance the
lack of suitable, annotated publicdatasets, we built on an open source soccer
simulation engine to re-lease the synthetic SoccER (Soccer Event Recognition)
dataset, whichincludes complete positional data and annotations for more than
1.6 mil-lion atomic events and 9,000 complex events. The dataset and code
areavailable at https://gitlab.com/grains2/slicing-and-dicing-soccer
Related papers
- Grounding Partially-Defined Events in Multimodal Data [61.0063273919745]
We introduce a multimodal formulation for partially-defined events and cast the extraction of these events as a three-stage span retrieval task.
We propose a benchmark for this task, MultiVENT-G, that consists of 14.5 hours of densely annotated current event videos and 1,168 text documents, containing 22.8K labeled event-centric entities.
Results illustrate the challenges that abstract event understanding poses and demonstrates promise in event-centric video-language systems.
arXiv Detail & Related papers (2024-10-07T17:59:48Z) - Document-Level Multi-Event Extraction with Event Proxy Nodes and
Hausdorff Distance Minimization [31.065768513381414]
Document-level multi-event extraction aims to extract structural information from a given document automatically.
We propose an alternative approach for document-level multi-event extraction with event proxy nodes and Hausdorff distance minimization.
Our model outperforms previous state-of-the-art method in F1-score on two datasets with only a fraction of training time.
arXiv Detail & Related papers (2023-05-30T10:33:05Z) - Abnormal Event Detection via Hypergraph Contrastive Learning [54.80429341415227]
Abnormal event detection plays an important role in many real applications.
In this paper, we study the unsupervised abnormal event detection problem in Attributed Heterogeneous Information Network.
A novel hypergraph contrastive learning method, named AEHCL, is proposed to fully capture abnormal event patterns.
arXiv Detail & Related papers (2023-04-02T08:23:20Z) - Dual Memory Aggregation Network for Event-Based Object Detection with
Learnable Representation [79.02808071245634]
Event-based cameras are bio-inspired sensors that capture brightness change of every pixel in an asynchronous manner.
Event streams are divided into grids in the x-y-t coordinates for both positive and negative polarity, producing a set of pillars as 3D tensor representation.
Long memory is encoded in the hidden state of adaptive convLSTMs while short memory is modeled by computing spatial-temporal correlation between event pillars.
arXiv Detail & Related papers (2023-03-17T12:12:41Z) - eCDT: Event Clustering for Simultaneous Feature Detection and Tracking- [4.094848360328624]
Event Clustering-based Detection Tracking (eCDT)
Our method 30% better feature tracking ages compared with the state-of-the-art approach.
arXiv Detail & Related papers (2022-07-19T07:39:04Z) - Event Transformer [43.193463048148374]
Event camera's low power consumption and ability to capture microsecond brightness make it attractive for various computer vision tasks.
Existing event representation methods typically convert events into frames, voxel grids, or spikes for deep neural networks (DNNs)
This work introduces a novel token-based event representation, where each event is considered a fundamental processing unit termed an event-token.
arXiv Detail & Related papers (2022-04-11T15:05:06Z) - Learning Temporal Rules from Noisy Timeseries Data [72.93572292157593]
We focus on uncovering the underlying atomic events and their relations that lead to the composite events within a noisy temporal data setting.
We propose a Neural Temporal Logic Programming (Neural TLP) which first learns implicit temporal relations between atomic events and then lifts logic rules for supervision.
arXiv Detail & Related papers (2022-02-11T01:29:02Z) - Learning Constraints and Descriptive Segmentation for Subevent Detection [74.48201657623218]
We propose an approach to learning and enforcing constraints that capture dependencies between subevent detection and EventSeg prediction.
We adopt Rectifier Networks for constraint learning and then convert the learned constraints to a regularization term in the loss function of the neural model.
arXiv Detail & Related papers (2021-09-13T20:50:37Z) - RMS-Net: Regression and Masking for Soccer Event Spotting [52.742046866220484]
We devise a lightweight and modular network for action spotting, which can simultaneously predict the event label and its temporal offset.
When tested on the SoccerNet dataset and using standard features, our full proposal exceeds the current state of the art by 3 Average-mAP points.
arXiv Detail & Related papers (2021-02-15T16:04:18Z) - Automatic Pass Annotation from Soccer VideoStreams Based on Object
Detection and LSTM [6.87782863484826]
PassNet is a method to recognize the most frequent events in soccer, i.e., passes, from video streams.
Our results show good results and significant improvement in the accuracy of pass detection.
PassNet is the first step towards an automated event annotation system.
arXiv Detail & Related papers (2020-07-13T16:14:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.