Interpretable Pre-Release Baseball Pitch Type Anticipation from Broadcast 3D Kinematics
- URL: http://arxiv.org/abs/2603.04874v1
- Date: Thu, 05 Mar 2026 07:04:35 GMT
- Title: Interpretable Pre-Release Baseball Pitch Type Anticipation from Broadcast 3D Kinematics
- Authors: Jerrin Bright, Michelle Lu, John Zelek,
- Abstract summary: We classify eight pitch types from monocular 3D pose sequences without access to ball-flight data.<n>Our pipeline chains a diffusion-based 3D pose backbone with automatic pitching-event detection.<n>We achieve 80.4% accuracy using body kinematics alone on 119,561 professional pitches.
- Score: 3.1224081969539714
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: How much can a pitcher's body reveal about the upcoming pitch? We study this question at scale by classifying eight pitch types from monocular 3D pose sequences, without access to ball-flight data. Our pipeline chains a diffusion-based 3D pose backbone with automatic pitching-event detection, groundtruth-validated biomechanical feature extraction, and gradient-boosted classification over 229 kinematic features. Evaluated on 119,561 professional pitches, the largest such benchmark to date, we achieve 80.4\% accuracy using body kinematics alone. A systematic importance analysis reveals that upper-body mechanics contribute 64.9\% of the predictive signal versus 35.1\% for the lower body, with wrist position (14.8\%) and trunk lateral tilt emerging as the most informative joint group and biomechanical feature, respectively. We further show that grip-defined variants (four-seam vs.\ two-seam fastball) are not separable from pose, establishing an empirical ceiling near 80\% and delineating where kinematic information ends and ball-flight information begins.
Related papers
- Scalable Injury-Risk Screening in Baseball Pitching From Broadcast Video [2.994962964425238]
We present a monocular video pipeline that recovers 18 clinically relevant biomechanics metrics from broadcast footage.<n>Built on DreamPose3D, our approach introduces a drift-controlled global lifting module that recovers pelvis trajectory.<n>Using these metrics for injury prediction, an automated screening model achieves AUC 0.811 for Tommy John surgery and 0.825 for significant arm injuries on 7,348 pitchers.
arXiv Detail & Related papers (2026-03-05T06:38:32Z) - Trace Anything: Representing Any Video in 4D via Trajectory Fields [98.85848134960172]
Trajectory Field is a dense mapping that assigns a continuous 3D trajectory function of time to each pixel in every frame.<n>We introduce Trace Anything, a neural network that predicts the entire trajectory field in a single feed-forward pass.<n>We trained the Trace Anything model on large-scale 4D data, including data from our new platform.
arXiv Detail & Related papers (2025-10-15T17:59:04Z) - SpikeGrasp: A Benchmark for 6-DoF Grasp Pose Detection from Stereo Spike Streams [57.84331423686738]
Most robotic grasping systems rely on converting sensor data into explicit 3D point clouds, which is a computational step not found in biological intelligence.<n>We introduce SpikeGrasp, a framework that mimics the biological visuomotor pathway, processing raw, asynchronous events from stereo spike cameras, similarly to retinas, to directly infer grasp poses.<n>Our model fuses these stereo spike streams and uses a recurrent spiking neural network, analogous to high-level visual processing, to iteratively refine grasp hypotheses without ever reconstructing a point cloud.
arXiv Detail & Related papers (2025-10-12T13:36:40Z) - AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability [4.991985467382602]
We introduce the AthleticsPose dataset, featuring real'' motions captured from 23 athletes performing various athletics events on an athletic field.<n>Our results show that the model trained on AthleticsPose significantly outperforms a baseline model trained on an imitated sports motion dataset.<n>In case studies of kinematic indicators, the model demonstrated the potential to capture individual differences in knee angles but struggled with higher-speed metrics.
arXiv Detail & Related papers (2025-07-17T08:43:23Z) - AthletePose3D: A Benchmark Dataset for 3D Human Pose Estimation and Kinematic Validation in Athletic Movements [4.653030985708889]
AthletePose3D is a novel dataset designed to capture high-speed, high-acceleration athletic movements.<n>We evaluate state-of-the-art (SOTA) monocular 2D and 3D pose estimation models on the dataset.
arXiv Detail & Related papers (2025-03-10T16:16:02Z) - Estimating Body and Hand Motion in an Ego-sensed World [62.61989004520802]
We present EgoAllo, a system for human motion estimation from a head-mounted device.<n>Using only egocentric SLAM poses and images, EgoAllo guides sampling from a conditional diffusion model to estimate 3D body pose, height, and hand parameters.
arXiv Detail & Related papers (2024-10-04T17:59:57Z) - PitcherNet: Powering the Moneyball Evolution in Baseball Video Analytics [13.928557561312026]
PitcherNet is an end-to-end automated system that analyzes pitcher kinematics directly from live broadcast video.
It achieves robust analysis results with 96.82% accuracy in pitcher tracklet identification.
PitcherNet paves the way for the future of baseball analytics by optimizing pitching strategies, preventing injuries, and unlocking a deeper understanding of pitcher mechanics.
arXiv Detail & Related papers (2024-05-13T01:03:06Z) - Towards Single Camera Human 3D-Kinematics [15.559206592078425]
We propose a novel approach for direct 3D human kinematic estimation D3KE from videos using deep neural networks.
Our experiments demonstrate that the proposed end-to-end training is robust and outperforms 2D and 3D markerless motion capture based kinematic estimation pipelines.
arXiv Detail & Related papers (2023-01-13T08:44:09Z) - Tactile Grasp Refinement using Deep Reinforcement Learning and Analytic
Grasp Stability Metrics [70.65363356763598]
We show that analytic grasp stability metrics constitute powerful optimization objectives for reinforcement learning algorithms.
We show that a combination of geometric and force-agnostic grasp stability metrics yields the highest average success rates of 95.4% for cuboids.
In a second experiment, we show that grasp refinement algorithms trained with contact feedback perform up to 6.6% better than a baseline that receives no tactile information.
arXiv Detail & Related papers (2021-09-23T09:20:19Z) - Neural Monocular 3D Human Motion Capture with Physical Awareness [76.55971509794598]
We present a new trainable system for physically plausible markerless 3D human motion capture.
Unlike most neural methods for human motion capture, our approach is aware of physical and environmental constraints.
It produces smooth and physically principled 3D motions in an interactive frame rate in a wide variety of challenging scenes.
arXiv Detail & Related papers (2021-05-03T17:57:07Z) - Contact and Human Dynamics from Monocular Video [73.47466545178396]
Existing deep models predict 2D and 3D kinematic poses from video that are approximately accurate, but contain visible errors.
We present a physics-based method for inferring 3D human motion from video sequences that takes initial 2D and 3D pose estimates as input.
arXiv Detail & Related papers (2020-07-22T21:09:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.