HP-Capsule: Unsupervised Face Part Discovery by Hierarchical Parsing
Capsule Network
- URL: http://arxiv.org/abs/2203.10699v1
- Date: Mon, 21 Mar 2022 01:39:41 GMT
- Title: HP-Capsule: Unsupervised Face Part Discovery by Hierarchical Parsing
Capsule Network
- Authors: Chang Yu, Xiangyu Zhu, Xiaomei Zhang, Zidu Wang, Zhaoxiang Zhang, Zhen
Lei
- Abstract summary: We propose a Hierarchical Parsing Capsule Network (HP-Capsule) for unsupervised face subpart-part discovery.
HP-Capsule extends the application of capsule networks from digits to human faces and takes a step forward to show how the neural networks understand objects without human intervention.
- Score: 76.92310948325847
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Capsule networks are designed to present the objects by a set of parts and
their relationships, which provide an insight into the procedure of visual
perception. Although recent works have shown the success of capsule networks on
simple objects like digits, the human faces with homologous structures, which
are suitable for capsules to describe, have not been explored. In this paper,
we propose a Hierarchical Parsing Capsule Network (HP-Capsule) for unsupervised
face subpart-part discovery. When browsing large-scale face images without
labels, the network first encodes the frequently observed patterns with a set
of explainable subpart capsules. Then, the subpart capsules are assembled into
part-level capsules through a Transformer-based Parsing Module (TPM) to learn
the compositional relations between them. During training, as the face
hierarchy is progressively built and refined, the part capsules adaptively
encode the face parts with semantic consistency. HP-Capsule extends the
application of capsule networks from digits to human faces and takes a step
forward to show how the neural networks understand homologous objects without
human intervention. Besides, HP-Capsule gives unsupervised face segmentation
results by the covered regions of part capsules, enabling qualitative and
quantitative evaluation. Experiments on BP4D and Multi-PIE datasets show the
effectiveness of our method.
Related papers
- ParseCaps: An Interpretable Parsing Capsule Network for Medical Image Diagnosis [6.273401483558281]
This paper introduces a novel capsule network, ParseCaps, which utilizes the sparse axial attention routing and parse convolutional capsule layer to form a parse-tree-like structure.
Experimental results on CE-MRI, PH$2$, and Derm7pt datasets show that ParseCaps not only outperforms other capsule network variants in classification accuracy, redundancy reduction and robustness, but also provides interpretable explanations.
arXiv Detail & Related papers (2024-11-03T13:34:31Z) - Mamba Capsule Routing Towards Part-Whole Relational Camouflaged Object Detection [98.6460229237143]
We propose a novel mamba capsule routing at the type level.
These type-level mamba capsules are fed into the EM routing algorithm to get the high-layer mamba capsules.
On top of that, to retrieve the pixel-level capsule features for further camouflaged prediction, we achieve this on the basis of the low-layer pixel-level capsules.
arXiv Detail & Related papers (2024-10-05T00:20:22Z) - Hierarchical Object-Centric Learning with Capsule Networks [0.0]
Capsule networks (CapsNets) were introduced to address convolutional neural networks limitations.
This thesis investigates the intriguing aspects of CapsNets and focuses on three key questions to unlock their full potential.
arXiv Detail & Related papers (2024-05-30T09:10:33Z) - Deep multi-prototype capsule networks [0.3823356975862005]
Capsule networks are a type of neural network that identify image parts and form the instantiation parameters of a whole hierarchically.
This paper presents a multi-prototype architecture for guiding capsule networks to represent the variations in the image parts.
The experimental results on MNIST, SVHN, C-Cube, CEDAR, MCYT, and UTSig datasets reveal that the proposed model outperforms others regarding image classification accuracy.
arXiv Detail & Related papers (2024-04-23T18:37:37Z) - Graphics Capsule: Learning Hierarchical 3D Face Representations from 2D
Images [82.5266467869448]
We propose an Inverse Graphics Capsule Network (IGC-Net) to learn the hierarchical 3D face representations from large-scale unlabeled images.
IGC-Net first decomposes the objects into a set of semantic-consistent part-level descriptions and then assembles them into object-level descriptions to build the hierarchy.
arXiv Detail & Related papers (2023-03-20T06:32:55Z) - Learning with Capsules: A Survey [73.31150426300198]
Capsule networks were proposed as an alternative approach to Convolutional Neural Networks (CNNs) for learning object-centric representations.
Unlike CNNs, capsule networks are designed to explicitly model part-whole hierarchical relationships.
arXiv Detail & Related papers (2022-06-06T15:05:36Z) - ASPCNet: A Deep Adaptive Spatial Pattern Capsule Network for
Hyperspectral Image Classification [47.541691093680406]
This paper proposes an adaptive spatial pattern capsule network (ASPCNet) architecture.
It can rotate the sampling location of convolutional kernels on the basis of an enlarged receptive field.
Experiments on three public datasets demonstrate that ASPCNet can yield competitive performance with higher accuracies than state-of-the-art methods.
arXiv Detail & Related papers (2021-04-25T07:10:55Z) - Unsupervised part representation by Flow Capsules [43.2934619818695]
We propose a way to learn primary capsule encoders that detect atomic parts from a single image.
We exploit motion as a powerful perceptual cue for part definition, with an expressive decoder for part generation within a layered image model.
arXiv Detail & Related papers (2020-11-27T18:59:42Z) - Subspace Capsule Network [85.69796543499021]
SubSpace Capsule Network (SCN) exploits the idea of capsule networks to model possible variations in the appearance or implicitly defined properties of an entity.
SCN can be applied to both discriminative and generative models without incurring computational overhead compared to CNN during test time.
arXiv Detail & Related papers (2020-02-07T17:51:56Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.