Differentiable Hierarchical Graph Grouping for Multi-Person Pose
Estimation
- URL: http://arxiv.org/abs/2007.11864v1
- Date: Thu, 23 Jul 2020 08:46:22 GMT
- Title: Differentiable Hierarchical Graph Grouping for Multi-Person Pose
Estimation
- Authors: Sheng Jin, Wentao Liu, Enze Xie, Wenhai Wang, Chen Qian, Wanli Ouyang,
Ping Luo
- Abstract summary: Multi-person pose estimation is challenging because it localizes body keypoints for multiple persons simultaneously.
We propose a novel differentiable Hierarchical Graph Grouping (HGG) method to learn the graph grouping in bottom-up multi-person pose estimation task.
- Score: 95.72606536493548
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Multi-person pose estimation is challenging because it localizes body
keypoints for multiple persons simultaneously. Previous methods can be divided
into two streams, i.e. top-down and bottom-up methods. The top-down methods
localize keypoints after human detection, while the bottom-up methods localize
keypoints directly and then cluster/group them for different persons, which are
generally more efficient than top-down methods. However, in existing bottom-up
methods, the keypoint grouping is usually solved independently from keypoint
detection, making them not end-to-end trainable and have sub-optimal
performance. In this paper, we investigate a new perspective of human part
grouping and reformulate it as a graph clustering task. Especially, we propose
a novel differentiable Hierarchical Graph Grouping (HGG) method to learn the
graph grouping in bottom-up multi-person pose estimation task. Moreover, HGG is
easily embedded into main-stream bottom-up methods. It takes human keypoint
candidates as graph nodes and clusters keypoints in a multi-layer graph neural
network model. The modules of HGG can be trained end-to-end with the keypoint
detection network and is able to supervise the grouping process in a
hierarchical manner. To improve the discrimination of the clustering, we add a
set of edge discriminators and macro-node discriminators. Extensive experiments
on both COCO and OCHuman datasets demonstrate that the proposed method improves
the performance of bottom-up pose estimation methods.
Related papers
- Reinforcement Graph Clustering with Unknown Cluster Number [91.4861135742095]
We propose a new deep graph clustering method termed Reinforcement Graph Clustering.
In our proposed method, cluster number determination and unsupervised representation learning are unified into a uniform framework.
In order to conduct feedback actions, the clustering-oriented reward function is proposed to enhance the cohesion of the same clusters and separate the different clusters.
arXiv Detail & Related papers (2023-08-13T18:12:28Z) - DeepCut: Unsupervised Segmentation using Graph Neural Networks
Clustering [6.447863458841379]
This study introduces a lightweight Graph Neural Network (GNN) to replace classical clustering methods.
Unlike existing methods, our GNN takes both the pair-wise affinities between local image features and the raw features as input.
We demonstrate how classical clustering objectives can be formulated as self-supervised loss functions for training an image segmentation GNN.
arXiv Detail & Related papers (2022-12-12T12:31:46Z) - Learning Heuristics for the Maximum Clique Enumeration Problem Using Low
Dimensional Representations [0.0]
We use a learning framework for a pruning process of the input graph towards reducing the clique of the maximum enumeration problem.
We study the role of using different vertex representations on the performance of this runtime method.
We observe that using local graph features in the classification process produce more accurate results when combined with a feature elimination process.
arXiv Detail & Related papers (2022-10-30T22:04:32Z) - Learning Spatial Context with Graph Neural Network for Multi-Person Pose
Grouping [71.59494156155309]
Bottom-up approaches for image-based multi-person pose estimation consist of two stages: keypoint detection and grouping.
In this work, we formulate the grouping task as a graph partitioning problem, where we learn the affinity matrix with a Graph Neural Network (GNN)
The learned geometry-based affinity is further fused with appearance-based affinity to achieve robust keypoint association.
arXiv Detail & Related papers (2021-04-06T09:21:14Z) - Differentiable Multi-Granularity Human Representation Learning for
Instance-Aware Human Semantic Parsing [131.97475877877608]
A new bottom-up regime is proposed to learn category-level human semantic segmentation and multi-person pose estimation in a joint and end-to-end manner.
It is a compact, efficient and powerful framework that exploits structural information over different human granularities.
Experiments on three instance-aware human datasets show that our model outperforms other bottom-up alternatives with much more efficient inference.
arXiv Detail & Related papers (2021-03-08T06:55:00Z) - Structured Graph Learning for Scalable Subspace Clustering: From
Single-view to Multi-view [28.779909990410978]
Graph-based subspace clustering methods have exhibited promising performance.
They still suffer some of these drawbacks: encounter the expensive time overhead, fail in exploring the explicit clusters, and cannot generalize to unseen data points.
We propose a scalable graph learning framework, seeking to address the above three challenges simultaneously.
arXiv Detail & Related papers (2021-02-16T03:46:11Z) - Structured Graph Learning for Clustering and Semi-supervised
Classification [74.35376212789132]
We propose a graph learning framework to preserve both the local and global structure of data.
Our method uses the self-expressiveness of samples to capture the global structure and adaptive neighbor approach to respect the local structure.
Our model is equivalent to a combination of kernel k-means and k-means methods under certain condition.
arXiv Detail & Related papers (2020-08-31T08:41:20Z) - Learning to Cluster Faces via Confidence and Connectivity Estimation [136.5291151775236]
We propose a fully learnable clustering framework without requiring a large number of overlapped subgraphs.
Our method significantly improves clustering accuracy and thus performance of the recognition models trained on top, yet it is an order of magnitude more efficient than existing supervised methods.
arXiv Detail & Related papers (2020-04-01T13:39:37Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.