BST: Badminton Stroke-type Transformer for Skeleton-based Action Recognition in Racket Sports
- URL: http://arxiv.org/abs/2502.21085v1
- Date: Fri, 28 Feb 2025 14:18:39 GMT
- Title: BST: Badminton Stroke-type Transformer for Skeleton-based Action Recognition in Racket Sports
- Authors: Jing-Yuan Chang,
- Abstract summary: We introduce a novel video segmentation strategy to extract frames of each player's racket swing in a badminton broadcast match.<n>We propose Badminton Stroke-type Transformer (BST) to classify player stroke-types in singles.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Badminton, known for having the fastest ball speeds among all sports, presents significant challenges to the field of computer vision, including player identification, court line detection, shuttlecock trajectory tracking, and player stroke-type classification. In this paper, we introduce a novel video segmentation strategy to extract frames of each player's racket swing in a badminton broadcast match. These segmented frames are then processed by two existing models: one for Human Pose Estimation to obtain player skeletal joints, and the other for shuttlecock trajectory detection to extract shuttlecock trajectories. Leveraging these joints, trajectories, and player positions as inputs, we propose Badminton Stroke-type Transformer (BST) to classify player stroke-types in singles. To the best of our knowledge, experimental results demonstrate that our method outperforms the previous state-of-the-art on the largest publicly available badminton video dataset, ShuttleSet, which shows that effectively leveraging ball trajectory is likely to be a trend for racket sports action recognition.
Related papers
- Deep learning for action spotting in association football videos [64.10841325879996]
The SoccerNet initiative organizes yearly challenges, during which participants from all around the world compete to achieve state-of-the-art performances.
This paper traces the history of action spotting in sports, from the creation of the task back in 2018, to the role it plays today in research and the sports industry.
arXiv Detail & Related papers (2024-10-02T07:56:15Z) - Sports-Traj: A Unified Trajectory Generation Model for Multi-Agent Movement in Sports [53.637837706712794]
We propose a Unified Trajectory Generation model, UniTraj, that processes arbitrary trajectories as masked inputs.<n>Specifically, we introduce a Ghost Spatial Masking (GSM) module, embedded within a Transformer encoder, for spatial feature extraction.<n>We benchmark three practical sports datasets, Basketball-U, Football-U, and Soccer-U, for evaluation.
arXiv Detail & Related papers (2024-05-27T22:15:23Z) - ShuttleSHAP: A Turn-Based Feature Attribution Approach for Analyzing
Forecasting Models in Badminton [52.21869064818728]
Deep learning approaches for player tactic forecasting in badminton show promising performance partially attributed to effective reasoning about rally-player interactions.
We propose a turn-based feature attribution approach, ShuttleSHAP, for analyzing forecasting models in badminton based on variants of Shapley values.
arXiv Detail & Related papers (2023-12-18T05:37:51Z) - Tracking Players in a Badminton Court by Two Cameras [8.394650183250974]
This study proposes a simple method for multi-object tracking (MOT) of players in a badminton court.
We leverage two off-the-shelf cameras, one on the top of the court and the other on the side of the court.
arXiv Detail & Related papers (2023-08-09T11:10:11Z) - Benchmarking Stroke Forecasting with Stroke-Level Badminton Dataset [13.502952342104644]
We provide a badminton singles dataset, ShuttleSet22, which is collected from high-ranking matches in 2022.
To benchmark existing work with ShuttleSet22, we hold a challenge, Track 2: Forecasting Future Turn-Based Strokes in Badminton Rallies, at CoachAI Badminton Challenge @ IJCAI 2023.
arXiv Detail & Related papers (2023-06-27T17:57:34Z) - A Badminton Recognition and Tracking System Based on Context
Multi-feature Fusion [6.068573093901329]
Two trajectory clip trackers are designed based on different rules to capture the correct trajectory of the ball.
Two rounds of detection from coarse-grained to fine-grained are used to solve the challenges encountered in badminton detection.
arXiv Detail & Related papers (2023-06-26T08:07:56Z) - P2ANet: A Dataset and Benchmark for Dense Action Detection from Table Tennis Match Broadcasting Videos [64.57435509822416]
This work consists of 2,721 video clips collected from the broadcasting videos of professional table tennis matches in World Table Tennis Championships and Olympiads.
We formulate two sets of action detection problems -- emphaction localization and emphaction recognition.
The results confirm that TheName is still a challenging task and can be used as a special benchmark for dense action detection from videos.
arXiv Detail & Related papers (2022-07-26T08:34:17Z) - A Survey on Video Action Recognition in Sports: Datasets, Methods and
Applications [60.3327085463545]
We present a survey on video action recognition for sports analytics.
We introduce more than ten types of sports, including team sports, such as football, basketball, volleyball, hockey and individual sports, such as figure skating, gymnastics, table tennis, diving and badminton.
We develop a toolbox using PaddlePaddle, which supports football, basketball, table tennis and figure skating action recognition.
arXiv Detail & Related papers (2022-06-02T13:19:36Z) - MonoTrack: Shuttle trajectory reconstruction from monocular badminton
video [6.218613353519723]
We present the first complete end-to-end system for the extraction and segmentation of 3D shuttle trajectories from monocular badminton videos.
Our system integrates badminton domain knowledge such as court dimension, shot placement, physical laws of motion, along with vision-based features such as player poses and shuttle tracking.
arXiv Detail & Related papers (2022-04-04T23:57:57Z) - Exploring the Long Short-Term Dependencies to Infer Shot Influence in
Badminton Matches [9.553207911311926]
We introduce a badminton language to fully describe the process of the shot.
We propose a deep learning model composed of a novel short-term extractor and a long-term encoder.
Our model incorporates an attention mechanism to enable the transparency of the action sequence to the rally result.
arXiv Detail & Related papers (2021-09-14T04:44:40Z) - Unsupervised Visual Representation Learning by Tracking Patches in Video [88.56860674483752]
We propose to use tracking as a proxy task for a computer vision system to learn the visual representations.
Modelled on the Catch game played by the children, we design a Catch-the-Patch (CtP) game for a 3D-CNN model to learn visual representations.
arXiv Detail & Related papers (2021-05-06T09:46:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.