Recent Advances in Monocular 2D and 3D Human Pose Estimation: A Deep
Learning Perspective
- URL: http://arxiv.org/abs/2104.11536v1
- Date: Fri, 23 Apr 2021 11:07:07 GMT
- Title: Recent Advances in Monocular 2D and 3D Human Pose Estimation: A Deep
Learning Perspective
- Authors: Wu Liu, Qian Bao, Yu Sun, Tao Mei
- Abstract summary: We provide a comprehensive and holistic 2D-to-3D perspective to tackle this problem.
We categorize the mainstream and milestone approaches since the year 2014 under unified frameworks.
We also summarize the pose representation styles, benchmarks, evaluation metrics, and the quantitative performance of popular approaches.
- Score: 69.44384540002358
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Estimation of the human pose from a monocular camera has been an emerging
research topic in the computer vision community with many applications.
Recently, benefited from the deep learning technologies, a significant amount
of research efforts have greatly advanced the monocular human pose estimation
both in 2D and 3D areas. Although there have been some works to summarize the
different approaches, it still remains challenging for researchers to have an
in-depth view of how these approaches work. In this paper, we provide a
comprehensive and holistic 2D-to-3D perspective to tackle this problem. We
categorize the mainstream and milestone approaches since the year 2014 under
unified frameworks. By systematically summarizing the differences and
connections between these approaches, we further analyze the solutions for
challenging cases, such as the lack of data, the inherent ambiguity between 2D
and 3D, and the complex multi-person scenarios. We also summarize the pose
representation styles, benchmarks, evaluation metrics, and the quantitative
performance of popular approaches. Finally, we discuss the challenges and give
deep thinking of promising directions for future research. We believe this
survey will provide the readers with a deep and insightful understanding of
monocular human pose estimation.
Related papers
- Markerless Multi-view 3D Human Pose Estimation: a survey [0.49157446832511503]
3D human pose estimation aims to reconstruct the human skeleton of all the individuals in a scene by detecting several body joints.
No method is yet capable of solving all the challenges associated with the reconstruction of the 3D pose.
Further research is still required to develop an approach capable of quickly inferring a highly accurate 3D pose with bearable computation cost.
arXiv Detail & Related papers (2024-07-04T10:44:35Z) - Deep Learning-Based Object Pose Estimation: A Comprehensive Survey [73.74933379151419]
We discuss the recent advances in deep learning-based object pose estimation.
Our survey also covers multiple input data modalities, degrees-of-freedom of output poses, object properties, and downstream tasks.
arXiv Detail & Related papers (2024-05-13T14:44:22Z) - Deep learning for 3D human pose estimation and mesh recovery: A survey [6.535833206786788]
We present a review of recent progress over the past five years in deep learning methods for 3D human pose estimation.
To the best of our knowledge, this survey is arguably the first to comprehensively cover deep learning methods for 3D human pose estimation.
arXiv Detail & Related papers (2024-02-29T04:30:39Z) - Recovering 3D Human Mesh from Monocular Images: A Survey [49.00136388529404]
Estimating human pose and shape from monocular images is a long-standing problem in computer vision.
This survey focuses on the task of monocular 3D human mesh recovery.
arXiv Detail & Related papers (2022-03-03T18:56:08Z) - 3D Object Detection from Images for Autonomous Driving: A Survey [68.33502122185813]
3D object detection from images is one of the fundamental and challenging problems in autonomous driving.
More than 200 works have studied this problem from 2015 to 2021, encompassing a broad spectrum of theories, algorithms, and applications.
We provide the first comprehensive survey of this novel and continuously growing research field, summarizing the most commonly used pipelines for image-based 3D detection.
arXiv Detail & Related papers (2022-02-07T07:12:24Z) - A survey of top-down approaches for human pose estimation [0.0]
State-of-the-art methods implemented with Deep Learning have brought remarkable results in the field of human pose estimation.
This paper aims to provide newcomers with an extensive review of deep learning methods-based 2D images for recognizing the pose of people.
arXiv Detail & Related papers (2022-02-05T23:27:46Z) - Deep Learning-Based Human Pose Estimation: A Survey [66.01917727294163]
Human pose estimation has drawn increasing attention during the past decade.
It has been utilized in a wide range of applications including human-computer interaction, motion analysis, augmented reality, and virtual reality.
Recent deep learning-based solutions have achieved high performance in human pose estimation.
arXiv Detail & Related papers (2020-12-24T18:49:06Z) - Monocular Human Pose Estimation: A Survey of Deep Learning-based Methods [25.3614052943568]
Vision-based monocular human pose estimation is one of the most fundamental and challenging problems in computer vision.
The recent developments of deep learning techniques have been brought significant progress and remarkable breakthroughs in the field of human pose estimation.
arXiv Detail & Related papers (2020-06-02T07:07:45Z) - Weakly-Supervised 3D Human Pose Learning via Multi-view Images in the
Wild [101.70320427145388]
We propose a weakly-supervised approach that does not require 3D annotations and learns to estimate 3D poses from unlabeled multi-view data.
We evaluate our proposed approach on two large scale datasets.
arXiv Detail & Related papers (2020-03-17T08:47:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.