SoccerNet-Caption: Dense Video Captioning for Soccer Broadcasts
Commentaries
- URL: http://arxiv.org/abs/2304.04565v1
- Date: Mon, 10 Apr 2023 13:08:03 GMT
- Title: SoccerNet-Caption: Dense Video Captioning for Soccer Broadcasts
Commentaries
- Authors: Hassan Mkhallati and Anthony Cioppa and Silvio Giancola and Bernard
Ghanem and Marc Van Droogenbroeck
- Abstract summary: We propose a novel task of dense video captioning focusing on the generation of textual commentaries anchored with single timestamps.
We present a challenging dataset consisting of almost 37k timestamped commentaries across 715.9 hours of soccer broadcast videos.
- Score: 71.44210436913029
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Soccer is more than just a game - it is a passion that transcends borders and
unites people worldwide. From the roar of the crowds to the excitement of the
commentators, every moment of a soccer match is a thrill. Yet, with so many
games happening simultaneously, fans cannot watch them all live. Notifications
for main actions can help, but lack the engagement of live commentary, leaving
fans feeling disconnected. To fulfill this need, we propose in this paper a
novel task of dense video captioning focusing on the generation of textual
commentaries anchored with single timestamps. To support this task, we
additionally present a challenging dataset consisting of almost 37k timestamped
commentaries across 715.9 hours of soccer broadcast videos. Additionally, we
propose a first benchmark and baseline for this task, highlighting the
difficulty of temporally anchoring commentaries yet showing the capacity to
generate meaningful commentaries. By providing broadcasters with a tool to
summarize the content of their video with the same level of engagement as a
live game, our method could help satisfy the needs of the numerous fans who
follow their team but cannot necessarily watch the live game. We believe our
method has the potential to enhance the accessibility and understanding of
soccer content for a wider audience, bringing the excitement of the game to
more people.
Related papers
- SoccerNet 2024 Challenges Results [152.8534707514927]
SoccerNet 2024 challenges represent the fourth annual video understanding challenges organized by the SoccerNet team.
The challenges aim to advance research across multiple themes in football, including broadcast video understanding, field understanding, and player understanding.
This year, the challenges encompass four vision-based tasks.
arXiv Detail & Related papers (2024-09-16T14:12:22Z) - Deep Understanding of Soccer Match Videos [20.783415560412003]
Soccer is one of the most popular sport worldwide, with live broadcasts frequently available for major matches.
Our system can detect key objects such as soccer balls, players and referees.
It also tracks the movements of players and the ball, recognizes player numbers, classifies scenes, and identifies highlights such as goal kicks.
arXiv Detail & Related papers (2024-07-11T05:54:13Z) - SoccerNet 2023 Challenges Results [165.5977813812761]
SoccerNet 2023 challenges were the third annual video understanding challenges organized by the SoccerNet team.
For this third edition, the challenges were composed of seven vision-based tasks split into three main themes.
arXiv Detail & Related papers (2023-09-12T07:03:30Z) - GOAL: A Challenging Knowledge-grounded Video Captioning Benchmark for
Real-time Soccer Commentary Generation [75.60413443783953]
We present GOAL, a benchmark of over 8.9k soccer video clips, 22k sentences, and 42k knowledge triples for proposing a challenging new task setting as Knowledge-grounded Video Captioning (KGVC)
Our data and code are available at https://github.com/THU-KEG/goal.
arXiv Detail & Related papers (2023-03-26T08:43:36Z) - Going for GOAL: A Resource for Grounded Football Commentaries [66.10040637644697]
We present GrOunded footbAlL commentaries (GOAL), a novel dataset of football (or soccer') highlights videos with transcribed live commentaries in English.
We provide state-of-the-art baselines for the following tasks: frame reordering, moment retrieval, live commentary retrieval and play-by-play live commentary generation.
Results show that SOTA models perform reasonably well in most tasks.
arXiv Detail & Related papers (2022-11-08T20:04:27Z) - GOAL: Towards Benchmarking Few-Shot Sports Game Summarization [0.3683202928838613]
We release GOAL, the first English sports game summarization dataset.
There are 103 commentary-news pairs in GOAL, where the average lengths of commentaries and news are 2724.9 and 476.3 words, respectively.
arXiv Detail & Related papers (2022-07-18T14:29:18Z) - SoccerNet-v2: A Dataset and Benchmarks for Holistic Understanding of
Broadcast Soccer Videos [71.72665910128975]
SoccerNet-v2 is a novel large-scale corpus of manual annotations for the SoccerNet video dataset.
We release around 300k annotations within SoccerNet's 500 untrimmed broadcast soccer videos.
We extend current tasks in the realm of soccer to include action spotting, camera shot segmentation with boundary detection.
arXiv Detail & Related papers (2020-11-26T16:10:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.