Knowledge Enhanced Sports Game Summarization
- URL: http://arxiv.org/abs/2111.12535v1
- Date: Wed, 24 Nov 2021 15:06:20 GMT
- Title: Knowledge Enhanced Sports Game Summarization
- Authors: Jiaan Wang, Zhixu Li, Tingyi Zhang, Duo Zheng, Jianfeng Qu, An Liu,
Lei Zhao, Zhigang Chen
- Abstract summary: We introduce K-SportsSum, a new dataset with two characteristics.
K-SportsSum collects a large amount of data from massive games.
K-SportsSum further provides a large-scale knowledge corpus that contains the information of 523 sports teams and 14,724 sports players.
- Score: 14.389241106925438
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Sports game summarization aims at generating sports news from live
commentaries. However, existing datasets are all constructed through automated
collection and cleaning processes, resulting in a lot of noise. Besides,
current works neglect the knowledge gap between live commentaries and sports
news, which limits the performance of sports game summarization. In this paper,
we introduce K-SportsSum, a new dataset with two characteristics: (1)
K-SportsSum collects a large amount of data from massive games. It has 7,854
commentary-news pairs. To improve the quality, K-SportsSum employs a manual
cleaning process; (2) Different from existing datasets, to narrow the knowledge
gap, K-SportsSum further provides a large-scale knowledge corpus that contains
the information of 523 sports teams and 14,724 sports players. Additionally, we
also introduce a knowledge-enhanced summarizer that utilizes both live
commentaries and the knowledge to generate sports news. Extensive experiments
on K-SportsSum and SportsSum datasets show that our model achieves new
state-of-the-art performances. Qualitative analysis and human study further
verify that our model generates more informative sports news.
Related papers
- Deep learning for action spotting in association football videos [64.10841325879996]
The SoccerNet initiative organizes yearly challenges, during which participants from all around the world compete to achieve state-of-the-art performances.
This paper traces the history of action spotting in sports, from the creation of the task back in 2018, to the role it plays today in research and the sports industry.
arXiv Detail & Related papers (2024-10-02T07:56:15Z) - Game-MUG: Multimodal Oriented Game Situation Understanding and Commentary Generation Dataset [8.837048597513059]
This paper introduces GAME-MUG, a new multimodal game situation understanding and audience-engaged commentary generation dataset.
Our dataset is collected from 2020-2022 LOL game live streams from YouTube and Twitch, and includes multimodal esports game information, including text, audio, and time-series event logs.
In addition, we also propose a new audience conversation augmented commentary dataset by covering the game situation and audience conversation understanding.
arXiv Detail & Related papers (2024-04-30T00:39:26Z) - Knowledge Guided Entity-aware Video Captioning and A Basketball
Benchmark [49.54265459763042]
We construct a multimodal basketball game knowledge graph (KG_NBA_2022) to provide additional knowledge beyond videos.
Then, a dataset that contains 9 types of fine-grained shooting events and 286 players' knowledge is constructed based on KG_NBA_2022.
We develop a knowledge guided entity-aware video captioning network (KEANet) based on a candidate player list in encoder-decoder form for basketball live text broadcast.
arXiv Detail & Related papers (2024-01-25T02:08:37Z) - Sports-QA: A Large-Scale Video Question Answering Benchmark for Complex
and Professional Sports [90.79212954022218]
We introduce the first dataset, named Sports-QA, specifically designed for the sports VideoQA task.
Sports-QA dataset includes various types of questions, such as descriptions, chronologies, causalities, and counterfactual conditions.
We propose a new Auto-Focus Transformer (AFT) capable of automatically focusing on particular scales of temporal information for question answering.
arXiv Detail & Related papers (2024-01-03T02:22:34Z) - GOAL: A Challenging Knowledge-grounded Video Captioning Benchmark for
Real-time Soccer Commentary Generation [75.60413443783953]
We present GOAL, a benchmark of over 8.9k soccer video clips, 22k sentences, and 42k knowledge triples for proposing a challenging new task setting as Knowledge-grounded Video Captioning (KGVC)
Our data and code are available at https://github.com/THU-KEG/goal.
arXiv Detail & Related papers (2023-03-26T08:43:36Z) - Commentary Generation from Data Records of Multiplayer Strategy Esports Game [21.133690853111133]
We build large-scale datasets that pair structured data and commentaries from a popular esports game, League of Legends.
We then evaluate Transformer-based models to generate game commentaries from structured data records.
We will release our dataset to boost potential research in the data-to-text generation community.
arXiv Detail & Related papers (2022-12-21T11:23:31Z) - GOAL: Towards Benchmarking Few-Shot Sports Game Summarization [0.3683202928838613]
We release GOAL, the first English sports game summarization dataset.
There are 103 commentary-news pairs in GOAL, where the average lengths of commentaries and news are 2724.9 and 476.3 words, respectively.
arXiv Detail & Related papers (2022-07-18T14:29:18Z) - A Survey on Video Action Recognition in Sports: Datasets, Methods and
Applications [60.3327085463545]
We present a survey on video action recognition for sports analytics.
We introduce more than ten types of sports, including team sports, such as football, basketball, volleyball, hockey and individual sports, such as figure skating, gymnastics, table tennis, diving and badminton.
We develop a toolbox using PaddlePaddle, which supports football, basketball, table tennis and figure skating action recognition.
arXiv Detail & Related papers (2022-06-02T13:19:36Z) - SportsSum2.0: Generating High-Quality Sports News from Live Text
Commentary [18.52461327269355]
Sports game summarization aims to generate news articles from live text commentaries.
Recent work, SportsSum, not only constructs a large benchmark dataset, but also proposes a two-step framework.
In this paper, we publish a new benchmark dataset SportsSum2.0, together with a modified summarization framework.
arXiv Detail & Related papers (2021-10-12T05:39:48Z) - SoccerNet-v2: A Dataset and Benchmarks for Holistic Understanding of
Broadcast Soccer Videos [71.72665910128975]
SoccerNet-v2 is a novel large-scale corpus of manual annotations for the SoccerNet video dataset.
We release around 300k annotations within SoccerNet's 500 untrimmed broadcast soccer videos.
We extend current tasks in the realm of soccer to include action spotting, camera shot segmentation with boundary detection.
arXiv Detail & Related papers (2020-11-26T16:10:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.