MMSys'22 Grand Challenge on AI-based Video Production for Soccer
- URL: http://arxiv.org/abs/2202.01031v1
- Date: Wed, 2 Feb 2022 13:53:42 GMT
- Title: MMSys'22 Grand Challenge on AI-based Video Production for Soccer
- Authors: Cise Midoglu, Steven A. Hicks, Vajira Thambawita, Tomas Kupka, P{\aa}l
Halvorsen
- Abstract summary: This challenge aims to assist the automation of such a production pipeline using AI.
In particular, we focus on the enhancement operations that take place after an event has been detected.
- Score: 2.14475390920102
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Soccer has a considerable market share of the global sports industry, and the
interest in viewing videos from soccer games continues to grow. In this
respect, it is important to provide game summaries and highlights of the main
game events. However, annotating and producing events and summaries often
require expensive equipment and a lot of tedious, cumbersome, manual labor.
Therefore, automating the video production pipeline providing fast game
highlights at a much lower cost is seen as the "holy grail". In this context,
recent developments in Artificial Intelligence (AI) technology have shown great
potential. Still, state-of-the-art approaches are far from being adequate for
practical scenarios that have demanding real-time requirements, as well as
strict performance criteria (where at least the detection of official events
such as goals and cards must be 100% accurate). In addition, event detection
should be thoroughly enhanced by annotation and classification, proper
clipping, generating short descriptions, selecting appropriate thumbnails for
highlight clips, and finally, combining the event highlights into an overall
game summary, similar to what is commonly aired during sports news. Even though
the event tagging operation has by far received the most attention, an
end-to-end video production pipeline also includes various other operations
which serve the overall purpose of automated soccer analysis. This challenge
aims to assist the automation of such a production pipeline using AI. In
particular, we focus on the enhancement operations that take place after an
event has been detected, namely event clipping (Task 1), thumbnail selection
(Task 2), and game summarization (Task 3). Challenge website:
https://mmsys2022.ie/authors/grand-challenge.
Related papers
- Deep learning for action spotting in association football videos [64.10841325879996]
The SoccerNet initiative organizes yearly challenges, during which participants from all around the world compete to achieve state-of-the-art performances.
This paper traces the history of action spotting in sports, from the creation of the task back in 2018, to the role it plays today in research and the sports industry.
arXiv Detail & Related papers (2024-10-02T07:56:15Z) - Sports-QA: A Large-Scale Video Question Answering Benchmark for Complex
and Professional Sports [90.79212954022218]
We introduce the first dataset, named Sports-QA, specifically designed for the sports VideoQA task.
Sports-QA dataset includes various types of questions, such as descriptions, chronologies, causalities, and counterfactual conditions.
We propose a new Auto-Focus Transformer (AFT) capable of automatically focusing on particular scales of temporal information for question answering.
arXiv Detail & Related papers (2024-01-03T02:22:34Z) - GOAL: A Challenging Knowledge-grounded Video Captioning Benchmark for
Real-time Soccer Commentary Generation [75.60413443783953]
We present GOAL, a benchmark of over 8.9k soccer video clips, 22k sentences, and 42k knowledge triples for proposing a challenging new task setting as Knowledge-grounded Video Captioning (KGVC)
Our data and code are available at https://github.com/THU-KEG/goal.
arXiv Detail & Related papers (2023-03-26T08:43:36Z) - A Graph-Based Method for Soccer Action Spotting Using Unsupervised
Player Classification [75.93186954061943]
Action spotting involves understanding the dynamics of the game, the complexity of events, and the variation of video sequences.
In this work, we focus on the former by (a) identifying and representing the players, referees, and goalkeepers as nodes in a graph, and by (b) modeling their temporal interactions as sequences of graphs.
For the player identification task, our method obtains an overall performance of 57.83% average-mAP by combining it with other modalities.
arXiv Detail & Related papers (2022-11-22T15:23:53Z) - A Multi-stage deep architecture for summary generation of soccer videos [11.41978608521222]
We propose a method to generate the summary of a soccer match exploiting both the audio and the event metadata.
The results show that our method can detect the actions of the match, identify which of these actions should belong to the summary and then propose multiple candidate summaries.
arXiv Detail & Related papers (2022-05-02T07:26:35Z) - Smart Director: An Event-Driven Directing System for Live Broadcasting [110.30675947733167]
Smart Director aims at mimicking the typical human-in-the-loop broadcasting process to automatically create near-professional broadcasting programs in real-time.
Our system is the first end-to-end automated directing system for multi-camera sports broadcasting.
arXiv Detail & Related papers (2022-01-11T16:14:41Z) - A Unified Taxonomy and Multimodal Dataset for Events in Invasion Games [3.7111751305143654]
We present a universal taxonomy that covers a wide range of low and high-level events for invasion games.
We release two multi-modal datasets comprising video and positional data with gold-standard annotations to foster research in fine-grained and ball-centered event spotting.
arXiv Detail & Related papers (2021-08-25T10:09:28Z) - Feature Combination Meets Attention: Baidu Soccer Embeddings and
Transformer based Temporal Detection [3.7709686875144337]
We present a two-stage paradigm to detect what and when events happen in soccer broadcast videos.
Specifically, we fine-tune multiple action recognition models on soccer data to extract high-level semantic features.
This approach achieved the state-of-the-art performance in both two tasks, i.e., action spotting and replay grounding, in the SoccerNet-v2 Challenge.
arXiv Detail & Related papers (2021-06-28T08:00:21Z) - SoccerNet-v2: A Dataset and Benchmarks for Holistic Understanding of
Broadcast Soccer Videos [71.72665910128975]
SoccerNet-v2 is a novel large-scale corpus of manual annotations for the SoccerNet video dataset.
We release around 300k annotations within SoccerNet's 500 untrimmed broadcast soccer videos.
We extend current tasks in the realm of soccer to include action spotting, camera shot segmentation with boundary detection.
arXiv Detail & Related papers (2020-11-26T16:10:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.