SmartMocap: Joint Estimation of Human and Camera Motion using
Uncalibrated RGB Cameras
- URL: http://arxiv.org/abs/2209.13906v2
- Date: Sat, 1 Apr 2023 21:52:18 GMT
- Title: SmartMocap: Joint Estimation of Human and Camera Motion using
Uncalibrated RGB Cameras
- Authors: Nitin Saini, Chun-hao P. Huang, Michael J. Black, Aamir Ahmad
- Abstract summary: Markerless human motion capture (mocap) from multiple RGB cameras is a widely studied problem.
Existing methods either need calibrated cameras or calibrate them relative to a static camera, which acts as the reference frame for the mocap system.
We propose a mocap method which uses multiple static and moving extrinsically uncalibrated RGB cameras.
- Score: 49.110201064166915
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Markerless human motion capture (mocap) from multiple RGB cameras is a widely
studied problem. Existing methods either need calibrated cameras or calibrate
them relative to a static camera, which acts as the reference frame for the
mocap system. The calibration step has to be done a priori for every capture
session, which is a tedious process, and re-calibration is required whenever
cameras are intentionally or accidentally moved. In this paper, we propose a
mocap method which uses multiple static and moving extrinsically uncalibrated
RGB cameras. The key components of our method are as follows. First, since the
cameras and the subject can move freely, we select the ground plane as a common
reference to represent both the body and the camera motions unlike existing
methods which represent bodies in the camera coordinate. Second, we learn a
probability distribution of short human motion sequences ($\sim$1sec) relative
to the ground plane and leverage it to disambiguate between the camera and
human motion. Third, we use this distribution as a motion prior in a novel
multi-stage optimization approach to fit the SMPL human body model and the
camera poses to the human body keypoints on the images. Finally, we show that
our method can work on a variety of datasets ranging from aerial cameras to
smartphones. It also gives more accurate results compared to the
state-of-the-art on the task of monocular human mocap with a static camera. Our
code is available for research purposes on
https://github.com/robot-perception-group/SmartMocap.
Related papers
- COIN: Control-Inpainting Diffusion Prior for Human and Camera Motion Estimation [98.05046790227561]
COIN is a control-inpainting motion diffusion prior that enables fine-grained control to disentangle human and camera motions.
COIN outperforms the state-of-the-art methods in terms of global human motion estimation and camera motion estimation.
arXiv Detail & Related papers (2024-08-29T10:36:29Z) - CasCalib: Cascaded Calibration for Motion Capture from Sparse Unsynchronized Cameras [18.51320244029833]
It is now possible to estimate 3D human pose from monocular images with off-the-shelf 3D pose estimators.
Many practical applications require fine-grained absolute pose information for which multi-view cues and camera calibration are necessary.
Our goal is full automation, which includes temporal synchronization, as well as intrinsic and extrinsic camera calibration.
arXiv Detail & Related papers (2024-05-10T23:02:23Z) - WHAC: World-grounded Humans and Cameras [37.877565981937586]
We aim to recover expressive parametric human models (i.e., SMPL-X) and corresponding camera poses jointly.
We introduce a novel framework, referred to as WHAC, to facilitate world-grounded expressive human pose and shape estimation.
We present a new synthetic dataset, WHAC-A-Mole, which includes accurately annotated humans and cameras.
arXiv Detail & Related papers (2024-03-19T17:58:02Z) - Decoupling Human and Camera Motion from Videos in the Wild [67.39432972193929]
We propose a method to reconstruct global human trajectories from videos in the wild.
Our method decouples the camera and human motion, which allows us to place people in the same world coordinate frame.
arXiv Detail & Related papers (2023-02-24T18:59:15Z) - Scene-Aware 3D Multi-Human Motion Capture from a Single Camera [83.06768487435818]
We consider the problem of estimating the 3D position of multiple humans in a scene as well as their body shape and articulation from a single RGB video recorded with a static camera.
We leverage recent advances in computer vision using large-scale pre-trained models for a variety of modalities, including 2D body joints, joint angles, normalized disparity maps, and human segmentation masks.
In particular, we estimate the scene depth and unique person scale from normalized disparity predictions using the 2D body joints and joint angles.
arXiv Detail & Related papers (2023-01-12T18:01:28Z) - AirPose: Multi-View Fusion Network for Aerial 3D Human Pose and Shape
Estimation [51.17610485589701]
We present a novel markerless 3D human motion capture (MoCap) system for unstructured, outdoor environments.
AirPose estimates human pose and shape using images captured by multiple uncalibrated flying cameras.
AirPose itself calibrates the cameras relative to the person instead of relying on any pre-calibration.
arXiv Detail & Related papers (2022-01-20T09:46:20Z) - Dynamic Multi-Person Mesh Recovery From Uncalibrated Multi-View Cameras [11.225376081130849]
We introduce a physics-geometry consistency to reduce the low and high frequency noises of the detected human semantics.
Then a novel latent motion prior is proposed to simultaneously optimize extrinsic camera parameters and coherent human motions from slightly noisy inputs.
Experimental results show that accurate camera parameters and human motions can be obtained through one-stage optimization.
arXiv Detail & Related papers (2021-10-20T03:19:20Z) - FLEX: Parameter-free Multi-view 3D Human Motion Reconstruction [70.09086274139504]
Multi-view algorithms strongly depend on camera parameters, in particular, the relative positions among the cameras.
We introduce FLEX, an end-to-end parameter-free multi-view model.
We demonstrate results on the Human3.6M and KTH Multi-view Football II datasets.
arXiv Detail & Related papers (2021-05-05T09:08:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.