Mitigating Motion Blur for Robust 3D Baseball Player Pose Modeling for
Pitch Analysis
- URL: http://arxiv.org/abs/2309.01010v1
- Date: Sat, 2 Sep 2023 19:42:59 GMT
- Title: Mitigating Motion Blur for Robust 3D Baseball Player Pose Modeling for
Pitch Analysis
- Authors: Jerrin Bright, Yuhao Chen, John Zelek
- Abstract summary: We propose a synthetic data augmentation pipeline to enhance the model's capability to deal with the pitcher's blurry actions.
We observe a notable reduction in the loss by 54.2% and 36.2% on the test dataset for 2D and 3D pose estimation respectively.
- Score: 5.010690651107531
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Using videos to analyze pitchers in baseball can play a vital role in
strategizing and injury prevention. Computer vision-based pose analysis offers
a time-efficient and cost-effective approach. However, the use of accessible
broadcast videos, with a 30fps framerate, often results in partial body motion
blur during fast actions, limiting the performance of existing pose keypoint
estimation models. Previous works have primarily relied on fixed backgrounds,
assuming minimal motion differences between frames, or utilized multiview data
to address this problem. To this end, we propose a synthetic data augmentation
pipeline to enhance the model's capability to deal with the pitcher's blurry
actions. In addition, we leverage in-the-wild videos to make our model robust
under different real-world conditions and camera positions. By carefully
optimizing the augmentation parameters, we observed a notable reduction in the
loss by 54.2% and 36.2% on the test dataset for 2D and 3D pose estimation
respectively. By applying our approach to existing state-of-the-art pose
estimators, we demonstrate an average improvement of 29.2%. The findings
highlight the effectiveness of our method in mitigating the challenges posed by
motion blur, thereby enhancing the overall quality of pose estimation.
Related papers
- Gait Recognition from Highly Compressed Videos [3.1049440318608568]
A common mitigation strategy involves fine-tuning pose estimation models on noisy data to improve robustness.
We propose a processing pipeline that incorporates a task-targeted artifact correction model designed to pre-process and enhance surveillance footage.
Our experiments show a clear enhancement in gait analysis performance, supporting the viability of the proposed method.
arXiv Detail & Related papers (2024-04-18T13:46:16Z) - Gaussian Splatting on the Move: Blur and Rolling Shutter Compensation for Natural Camera Motion [25.54868552979793]
We present a method that adapts to camera motion and allows high-quality scene reconstruction with handheld video data.
Our results with both synthetic and real data demonstrate superior performance in mitigating camera motion over existing methods.
arXiv Detail & Related papers (2024-03-20T06:19:41Z) - Learning Robust Multi-Scale Representation for Neural Radiance Fields
from Unposed Images [65.41966114373373]
We present an improved solution to the neural image-based rendering problem in computer vision.
The proposed approach could synthesize a realistic image of the scene from a novel viewpoint at test time.
arXiv Detail & Related papers (2023-11-08T08:18:23Z) - Sample Less, Learn More: Efficient Action Recognition via Frame Feature
Restoration [59.6021678234829]
We propose a novel method to restore the intermediate features for two sparsely sampled and adjacent video frames.
With the integration of our method, the efficiency of three commonly used baselines has been improved by over 50%, with a mere 0.5% reduction in recognition accuracy.
arXiv Detail & Related papers (2023-07-27T13:52:42Z) - Kinematic-aware Hierarchical Attention Network for Human Pose Estimation
in Videos [17.831839654593452]
Previous-based human pose estimation methods have shown promising results by leveraging features of consecutive frames.
Most approaches compromise accuracy to jitter and do not comprehend the temporal aspects of human motion.
We design an architecture that exploits kinematic keypoint features.
arXiv Detail & Related papers (2022-11-29T01:46:11Z) - Motion-from-Blur: 3D Shape and Motion Estimation of Motion-blurred
Objects in Videos [115.71874459429381]
We propose a method for jointly estimating the 3D motion, 3D shape, and appearance of highly motion-blurred objects from a video.
Experiments on benchmark datasets demonstrate that our method outperforms previous methods for fast moving object deblurring and 3D reconstruction.
arXiv Detail & Related papers (2021-11-29T11:25:14Z) - Learning to Segment Rigid Motions from Two Frames [72.14906744113125]
We propose a modular network, motivated by a geometric analysis of what independent object motions can be recovered from an egomotion field.
It takes two consecutive frames as input and predicts segmentation masks for the background and multiple rigidly moving objects, which are then parameterized by 3D rigid transformations.
Our method achieves state-of-the-art performance for rigid motion segmentation on KITTI and Sintel.
arXiv Detail & Related papers (2021-01-11T04:20:30Z) - Towards Accurate Human Pose Estimation in Videos of Crowded Scenes [134.60638597115872]
We focus on improving human pose estimation in videos of crowded scenes from the perspectives of exploiting temporal context and collecting new data.
For one frame, we forward the historical poses from the previous frames and backward the future poses from the subsequent frames to current frame, leading to stable and accurate human pose estimation in videos.
In this way, our model achieves best performance on 7 out of 13 videos and 56.33 average w_AP on test dataset of HIE challenge.
arXiv Detail & Related papers (2020-10-16T13:19:11Z) - Self-Supervised 3D Human Pose Estimation via Part Guided Novel Image
Synthesis [72.34794624243281]
We propose a self-supervised learning framework to disentangle variations from unlabeled video frames.
Our differentiable formalization, bridging the representation gap between the 3D pose and spatial part maps, allows us to operate on videos with diverse camera movements.
arXiv Detail & Related papers (2020-04-09T07:55:01Z) - Prior-enlightened and Motion-robust Video Deblurring [29.158836861982742]
PRiOr-enlightened and MOTION-robust deblurring model (PROMOTION) suitable for challenging blurs.
We use 3D group convolution to efficiently encode heterogeneous prior information.
We also design the priors representing blur distribution, to better handle non-uniform blur-temporal domain.
arXiv Detail & Related papers (2020-03-25T04:16:56Z) - Synergetic Reconstruction from 2D Pose and 3D Motion for Wide-Space
Multi-Person Video Motion Capture in the Wild [3.0015034534260665]
We propose a markerless motion capture method with accuracy and smoothness from multiple cameras.
The proposed method predicts each persons 3D pose and determines bounding box of multi-camera images.
We evaluated the proposed method using various datasets and a real sports field.
arXiv Detail & Related papers (2020-01-16T02:14:59Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.