Multi-Agent System for Comprehensive Soccer Understanding
- URL: http://arxiv.org/abs/2505.03735v1
- Date: Tue, 06 May 2025 17:59:31 GMT
- Title: Multi-Agent System for Comprehensive Soccer Understanding
- Authors: Jiayuan Rao, Zifeng Li, Haoning Wu, Ya Zhang, Yanfeng Wang, Weidi Xie,
- Abstract summary: We construct SoccerWiki, the first large-scale multimodal soccer knowledge base.<n>We present SoccerBench, the largest and most comprehensive soccer-specific benchmark.<n>We introduce SoccerAgent, a novel multi-agent system that decomposes complex soccer questions.
- Score: 56.28536879015841
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Recent advancements in AI-driven soccer understanding have demonstrated rapid progress, yet existing research predominantly focuses on isolated or narrow tasks. To bridge this gap, we propose a comprehensive framework for holistic soccer understanding. Specifically, we make the following contributions in this paper: (i) we construct SoccerWiki, the first large-scale multimodal soccer knowledge base, integrating rich domain knowledge about players, teams, referees, and venues to enable knowledge-driven reasoning; (ii) we present SoccerBench, the largest and most comprehensive soccer-specific benchmark, featuring around 10K standardized multimodal (text, image, video) multi-choice QA pairs across 13 distinct understanding tasks, curated through automated pipelines and manual verification; (iii) we introduce SoccerAgent, a novel multi-agent system that decomposes complex soccer questions via collaborative reasoning, leveraging domain expertise from SoccerWiki and achieving robust performance; (iv) extensive evaluations and ablations that benchmark state-of-the-art MLLMs on SoccerBench, highlighting the superiority of our proposed agentic system. All data and code are publicly available at: https://jyrao.github.io/SoccerAgent/.
Related papers
- SoccerChat: Integrating Multimodal Data for Enhanced Soccer Game Understanding [44.04695944511487]
SoccerChat is a conversational AI framework that integrates visual and textual data for enhanced soccer video comprehension.<n>We benchmark SoccerChat on action classification and referee decision-making tasks, demonstrating its performance in general soccer event comprehension.<n>Our findings highlight the importance of multimodal integration in advancing soccer analytics, paving the way for more interactive and explainable AI-driven sports analysis.
arXiv Detail & Related papers (2025-05-22T13:01:51Z) - Towards Universal Soccer Video Understanding [58.889409980618396]
This paper aims to a comprehensive multi-modal framework for soccer understanding.<n>We introduce SoccerReplay-1988, the largest multi-modal soccer dataset to date, featuring videos and detailed annotations from 1, complete matches.<n>We present an advanced soccer-specific visual, MatchVision, which leveragestemporal information across soccer videos and excels in various downstream tasks.
arXiv Detail & Related papers (2024-12-02T18:58:04Z) - X-VARS: Introducing Explainability in Football Refereeing with Multi-Modal Large Language Model [56.393522913188704]
We introduce the Explainable Video Assistant Referee System, X- VARS, a multi-modal large language model designed for understanding football videos from the point of view of a referee.
X- VARS can perform a multitude of tasks, including video description, question answering, action recognition, and conducting meaningful conversations.
We validate X- VARS on our novel dataset, SoccerNet-XFoul, which consists of more than 22k video-question-answer triplets annotated by over 70 experienced football referees.
arXiv Detail & Related papers (2024-04-07T12:42:02Z) - MindAgent: Emergent Gaming Interaction [103.73707345211892]
Large Language Models (LLMs) have the capacity of performing complex scheduling in a multi-agent system.
We propose MindAgent to evaluate planning and coordination emergent capabilities for gaming interaction.
arXiv Detail & Related papers (2023-09-18T17:52:22Z) - An Empirical Study on Google Research Football Multi-agent Scenarios [30.926070192524193]
We open-source our training framework Light-MALib which extends the MALib by distributed and asynchronized implementation with additional analytical tools for football games.
We provide guidance for building strong football AI with population-based training and release diverse pretrained policies for benchmarking.
arXiv Detail & Related papers (2023-05-16T14:18:53Z) - TiZero: Mastering Multi-Agent Football with Curriculum Learning and
Self-Play [19.98100026335148]
TiZero is a self-evolving, multi-agent system that learns from scratch.
It outperforms previous systems by a large margin on the Google Research Football environment.
arXiv Detail & Related papers (2023-02-15T08:19:18Z) - Is it worth the effort? Understanding and contextualizing physical
metrics in soccer [1.2205797997133396]
This framework gives a deep insight into the link between physical and technical-tactical aspects of soccer.
It allows associating physical performance with value generation thanks to a top-down approach.
arXiv Detail & Related papers (2022-04-05T16:14:40Z) - SoccerNet-v2: A Dataset and Benchmarks for Holistic Understanding of
Broadcast Soccer Videos [71.72665910128975]
SoccerNet-v2 is a novel large-scale corpus of manual annotations for the SoccerNet video dataset.
We release around 300k annotations within SoccerNet's 500 untrimmed broadcast soccer videos.
We extend current tasks in the realm of soccer to include action spotting, camera shot segmentation with boundary detection.
arXiv Detail & Related papers (2020-11-26T16:10:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.