Correlating Edge, Pose with Parsing
- URL: http://arxiv.org/abs/2005.01431v1
- Date: Mon, 4 May 2020 12:39:13 GMT
- Title: Correlating Edge, Pose with Parsing
- Authors: Ziwei Zhang, Chi Su, Liang Zheng, Xiaodong Xie
- Abstract summary: This paper studies how human semantic boundaries and keypoint locations can jointly improve human parsing.
We propose a Correlation Parsing Machine (CorrPM) employing a heterogeneous non-local block to discover the spatial affinity among feature maps from the edge, pose and parsing.
- Score: 35.27973063976257
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: According to existing studies, human body edge and pose are two beneficial
factors to human parsing. The effectiveness of each of the high-level features
(edge and pose) is confirmed through the concatenation of their features with
the parsing features. Driven by the insights, this paper studies how human
semantic boundaries and keypoint locations can jointly improve human parsing.
Compared with the existing practice of feature concatenation, we find that
uncovering the correlation among the three factors is a superior way of
leveraging the pivotal contextual cues provided by edges and poses. To capture
such correlations, we propose a Correlation Parsing Machine (CorrPM) employing
a heterogeneous non-local block to discover the spatial affinity among feature
maps from the edge, pose and parsing. The proposed CorrPM allows us to report
new state-of-the-art accuracy on three human parsing datasets. Importantly,
comparative studies confirm the advantages of feature correlation over the
concatenation.
Related papers
- Towards Human-Like Machine Comprehension: Few-Shot Relational Learning in Visually-Rich Documents [16.78371134590167]
Key-value relations are prevalent in Visually-Rich Documents (VRDs)
These non-textual cues serve as important indicators that greatly enhance human comprehension and acquisition of such relation triplets.
Our research focuses on few-shot relational learning, specifically targeting the extraction of key-value relation triplets in VRDs.
arXiv Detail & Related papers (2024-03-23T08:40:35Z) - Exploring Self- and Cross-Triplet Correlations for Human-Object
Interaction Detection [38.86053346974547]
We propose to explore Self- and Cross-Triplet Correlations for HOI detection.
Specifically, we regard each triplet proposal as a graph where Human, Object represent nodes and Action indicates edge.
Also, we try to explore cross-triplet dependencies by jointly considering instance-level, semantic-level, and layout-level relations.
arXiv Detail & Related papers (2024-01-11T05:38:24Z) - Learning Complete Topology-Aware Correlations Between Relations for Inductive Link Prediction [121.65152276851619]
We show that semantic correlations between relations are inherently edge-level and entity-independent.
We propose a novel subgraph-based method, namely TACO, to model Topology-Aware COrrelations between relations.
To further exploit the potential of RCN, we propose Complete Common Neighbor induced subgraph.
arXiv Detail & Related papers (2023-09-20T08:11:58Z) - CIParsing: Unifying Causality Properties into Multiple Human Parsing [82.32620538918812]
Existing methods of multiple human parsing (MHP) apply statistical models to acquire underlying associations between images and labeled body parts.
We present a causality inspired parsing paradigm termed CIParsing, which follows fundamental causal principles involving two causal properties for human parsing.
The CIParsing is designed in a plug-and-play fashion and can be integrated into any existing MHP models.
arXiv Detail & Related papers (2023-08-23T15:56:26Z) - Best Practices for 2-Body Pose Forecasting [58.661899246497896]
We review the progress in human pose forecasting and provide an in-depth assessment of the single-person practices that perform best.
Other single-person practices do not transfer to 2-body, so the proposed best ones do not include hierarchical body modeling or attention-based interaction encoding.
Our proposed 2-body pose forecasting best practices yield a performance improvement of 21.9% over the state-of-the-art on the most recent ExPI dataset.
arXiv Detail & Related papers (2023-04-12T10:46:23Z) - Extracting all Aspect-polarity Pairs Jointly in a Text with Relation
Extraction Approach [6.844982778392037]
We propose to generate aspect-polarity pairs directly from a text with relation extraction technology.
We present a position- and aspect-aware sequence2sequence model for joint extraction of aspect-polarity pairs.
arXiv Detail & Related papers (2021-09-01T09:00:39Z) - DRG: Dual Relation Graph for Human-Object Interaction Detection [65.50707710054141]
We tackle the challenging problem of human-object interaction (HOI) detection.
Existing methods either recognize the interaction of each human-object pair in isolation or perform joint inference based on complex appearance-based features.
In this paper, we leverage an abstract spatial-semantic representation to describe each human-object pair and aggregate the contextual information of the scene via a dual relation graph.
arXiv Detail & Related papers (2020-08-26T17:59:40Z) - Hierarchical Human Parsing with Typed Part-Relation Reasoning [179.64978033077222]
How to model human structures is the central theme in this task.
We seek to simultaneously exploit the representational capacity of deep graph networks and the hierarchical human structures.
arXiv Detail & Related papers (2020-03-10T16:45:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.