2D Human Pose Estimation: A Survey
- URL: http://arxiv.org/abs/2204.07370v1
- Date: Fri, 15 Apr 2022 08:09:43 GMT
- Title: 2D Human Pose Estimation: A Survey
- Authors: Haoming Chen, Runyang Feng, Sifan Wu, Hao Xu, Fengcheng Zhou,
Zhenguang Liu
- Abstract summary: Human pose estimation aims at localizing human anatomical keypoints or body parts in the input data.
Deep learning techniques allow learning feature representations directly from the data.
In this paper, we reap the recent achievements of 2D human pose estimation methods and present a comprehensive survey.
- Score: 16.56050212383859
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Human pose estimation aims at localizing human anatomical keypoints or body
parts in the input data (e.g., images, videos, or signals). It forms a crucial
component in enabling machines to have an insightful understanding of the
behaviors of humans, and has become a salient problem in computer vision and
related fields. Deep learning techniques allow learning feature representations
directly from the data, significantly pushing the performance boundary of human
pose estimation. In this paper, we reap the recent achievements of 2D human
pose estimation methods and present a comprehensive survey. Briefly, existing
approaches put their efforts in three directions, namely network architecture
design, network training refinement, and post processing. Network architecture
design looks at the architecture of human pose estimation models, extracting
more robust features for keypoint recognition and localization. Network
training refinement tap into the training of neural networks and aims to
improve the representational ability of models. Post processing further
incorporates model-agnostic polishing strategies to improve the performance of
keypoint detection. More than 200 research contributions are involved in this
survey, covering methodological frameworks, common benchmark datasets,
evaluation metrics, and performance comparisons. We seek to provide researchers
with a more comprehensive and systematic review on human pose estimation,
allowing them to acquire a grand panorama and better identify future
directions.
Related papers
- 3D WholeBody Pose Estimation based on Semantic Graph Attention Network and Distance Information [2.457872341625575]
A novel Semantic Graph Attention Network can benefit from the ability of self-attention to capture global context.
A Body Part Decoder assists in extracting and refining the information related to specific segments of the body.
A Geometry Loss makes a critical constraint on the structural skeleton of the body, ensuring that the model's predictions adhere to the natural limits of human posture.
arXiv Detail & Related papers (2024-06-03T10:59:00Z) - Data Augmentation in Human-Centric Vision [54.97327269866757]
This survey presents a comprehensive analysis of data augmentation techniques in human-centric vision tasks.
It delves into a wide range of research areas including person ReID, human parsing, human pose estimation, and pedestrian detection.
Our work categorizes data augmentation methods into two main types: data generation and data perturbation.
arXiv Detail & Related papers (2024-03-13T16:05:18Z) - Human as Points: Explicit Point-based 3D Human Reconstruction from
Single-view RGB Images [78.56114271538061]
We introduce an explicit point-based human reconstruction framework called HaP.
Our approach is featured by fully-explicit point cloud estimation, manipulation, generation, and refinement in the 3D geometric space.
Our results may indicate a paradigm rollback to the fully-explicit and geometry-centric algorithm design.
arXiv Detail & Related papers (2023-11-06T05:52:29Z) - Computation-efficient Deep Learning for Computer Vision: A Survey [121.84121397440337]
Deep learning models have reached or even exceeded human-level performance in a range of visual perception tasks.
Deep learning models usually demand significant computational resources, leading to impractical power consumption, latency, or carbon emissions in real-world scenarios.
New research focus is computationally efficient deep learning, which strives to achieve satisfactory performance while minimizing the computational cost during inference.
arXiv Detail & Related papers (2023-08-27T03:55:28Z) - Deep Learning for Human Parsing: A Survey [54.812353922568995]
We provide an analysis of state-of-the-art human parsing methods, covering a broad spectrum of pioneering works for semantic human parsing.
We introduce five insightful categories: (1) structure-driven architectures exploit the relationship of different human parts and the inherent hierarchical structure of a human body, (2) graph-based networks capture the global information to achieve an efficient and complete human body analysis, (3) context-aware networks explore useful contexts across all pixel to characterize a pixel of the corresponding class, and (4) LSTM-based methods can combine short-distance and long-distance spatial dependencies to better exploit abundant local and global contexts.
arXiv Detail & Related papers (2023-01-29T10:54:56Z) - A survey of top-down approaches for human pose estimation [0.0]
State-of-the-art methods implemented with Deep Learning have brought remarkable results in the field of human pose estimation.
This paper aims to provide newcomers with an extensive review of deep learning methods-based 2D images for recognizing the pose of people.
arXiv Detail & Related papers (2022-02-05T23:27:46Z) - Single Person Pose Estimation: A Survey [45.144269986277365]
Human pose estimation in unconstrained images and videos is a fundamental computer vision task.
We summarize representative human pose methods in a structured taxonomy, with a particular focus on deep learning models and single-person image setting.
We examine and survey all the components of a typical human pose estimation pipeline, including data augmentation, model architecture and backbone.
arXiv Detail & Related papers (2021-09-21T09:53:15Z) - Recent Advances in Monocular 2D and 3D Human Pose Estimation: A Deep
Learning Perspective [69.44384540002358]
We provide a comprehensive and holistic 2D-to-3D perspective to tackle this problem.
We categorize the mainstream and milestone approaches since the year 2014 under unified frameworks.
We also summarize the pose representation styles, benchmarks, evaluation metrics, and the quantitative performance of popular approaches.
arXiv Detail & Related papers (2021-04-23T11:07:07Z) - Deep Learning-Based Human Pose Estimation: A Survey [66.01917727294163]
Human pose estimation has drawn increasing attention during the past decade.
It has been utilized in a wide range of applications including human-computer interaction, motion analysis, augmented reality, and virtual reality.
Recent deep learning-based solutions have achieved high performance in human pose estimation.
arXiv Detail & Related papers (2020-12-24T18:49:06Z) - Monocular Human Pose Estimation: A Survey of Deep Learning-based Methods [25.3614052943568]
Vision-based monocular human pose estimation is one of the most fundamental and challenging problems in computer vision.
The recent developments of deep learning techniques have been brought significant progress and remarkable breakthroughs in the field of human pose estimation.
arXiv Detail & Related papers (2020-06-02T07:07:45Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.