Graph Jigsaw Learning for Cartoon Face Recognition
- URL: http://arxiv.org/abs/2107.06532v1
- Date: Wed, 14 Jul 2021 08:01:06 GMT
- Title: Graph Jigsaw Learning for Cartoon Face Recognition
- Authors: Yong Li, Lingjie Lao, Zhen Cui, Shiguang Shan, Jian Yang
- Abstract summary: It is difficult to learn a shape-oriented representation for cartoon face recognition with convolutional neural networks (CNNs)
We propose the GraphJigsaw that constructs jigsaw puzzles at various stages in the classification network and solves the puzzles with the graph convolutional network (GCN) in a progressive manner.
Our proposed GraphJigsaw consistently outperforms other face recognition or jigsaw-based methods on two popular cartoon face datasets.
- Score: 79.29656077338828
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Cartoon face recognition is challenging as they typically have smooth color
regions and emphasized edges, the key to recognize cartoon faces is to
precisely perceive their sparse and critical shape patterns. However, it is
quite difficult to learn a shape-oriented representation for cartoon face
recognition with convolutional neural networks (CNNs). To mitigate this issue,
we propose the GraphJigsaw that constructs jigsaw puzzles at various stages in
the classification network and solves the puzzles with the graph convolutional
network (GCN) in a progressive manner. Solving the puzzles requires the model
to spot the shape patterns of the cartoon faces as the texture information is
quite limited. The key idea of GraphJigsaw is constructing a jigsaw puzzle by
randomly shuffling the intermediate convolutional feature maps in the spatial
dimension and exploiting the GCN to reason and recover the correct layout of
the jigsaw fragments in a self-supervised manner. The proposed GraphJigsaw
avoids training the classification model with the deconstructed images that
would introduce noisy patterns and are harmful for the final classification.
Specially, GraphJigsaw can be incorporated at various stages in a top-down
manner within the classification model, which facilitates propagating the
learned shape patterns gradually. GraphJigsaw does not rely on any extra manual
annotation during the training process and incorporates no extra computation
burden at inference time. Both quantitative and qualitative experimental
results have verified the feasibility of our proposed GraphJigsaw, which
consistently outperforms other face recognition or jigsaw-based methods on two
popular cartoon face datasets with considerable improvements.
Related papers
- Patch-Based Deep Unsupervised Image Segmentation using Graph Cuts [0.0]
We propose a patch-based unsupervised image segmentation strategy that bridges advances in unsupervised feature extraction with the algorithmic help of classical graph-based methods.
We show that a simple convolutional neural network, trained to classify image patches, naturally leads to a state-of-the-art fully-convolutional unsupervised pixel-level segmenter.
arXiv Detail & Related papers (2023-11-01T19:59:25Z) - G-MSM: Unsupervised Multi-Shape Matching with Graph-based Affinity
Priors [52.646396621449]
G-MSM is a novel unsupervised learning approach for non-rigid shape correspondence.
We construct an affinity graph on a given set of training shapes in a self-supervised manner.
We demonstrate state-of-the-art performance on several recent shape correspondence benchmarks.
arXiv Detail & Related papers (2022-12-06T12:09:24Z) - Video Anomaly Detection by Solving Decoupled Spatio-Temporal Jigsaw
Puzzles [67.39567701983357]
Video Anomaly Detection (VAD) is an important topic in computer vision.
Motivated by the recent advances in self-supervised learning, this paper addresses VAD by solving an intuitive yet challenging pretext task.
Our method outperforms state-of-the-art counterparts on three public benchmarks.
arXiv Detail & Related papers (2022-07-20T19:49:32Z) - SSR-GNNs: Stroke-based Sketch Representation with Graph Neural Networks [34.759306840182205]
This paper investigates a graph representation for sketches, where the information of strokes, i.e., parts of a sketch, are encoded on vertices and information of inter-stroke on edges.
The resultant graph representation facilitates the training of a Graph Neural Networks for classification tasks.
The proposed representation enables generation of novel sketches that are structurally similar to while separable from the existing dataset.
arXiv Detail & Related papers (2022-04-27T19:18:01Z) - Graph Neural Networks for Image Classification and Reinforcement
Learning using Graph representations [15.256931959393803]
We will evaluate the performance of graph neural networks in two distinct domains: computer vision and reinforcement learning.
In the computer vision section, we seek to learn whether a novel non-redundant representation for images as graphs can improve performance over trivial pixel to node mapping on a graph-level prediction graph, specifically image classification.
For the reinforcement learning section, we seek to learn if explicitly modeling solving a Rubik's cube as a graph problem can improve performance over a standard model-free technique with no inductive bias.
arXiv Detail & Related papers (2022-03-07T15:16:31Z) - Learning Skeletal Articulations with Neural Blend Shapes [57.879030623284216]
We develop a neural technique for articulating 3D characters using enveloping with a pre-defined skeletal structure.
Our framework learns to rig and skin characters with the same articulation structure.
We propose neural blend shapes which improve the deformation quality in the joint regions.
arXiv Detail & Related papers (2021-05-06T05:58:13Z) - Joint Deep Multi-Graph Matching and 3D Geometry Learning from
Inhomogeneous 2D Image Collections [57.60094385551773]
We propose a trainable framework for learning a deformable 3D geometry model from inhomogeneous image collections.
We in addition obtain the underlying 3D geometry of the objects depicted in the 2D images.
arXiv Detail & Related papers (2021-03-31T17:25:36Z) - Learning to Caricature via Semantic Shape Transform [95.25116681761142]
We propose an algorithm based on a semantic shape transform to produce shape exaggerations.
We show that the proposed framework is able to render visually pleasing shape exaggerations while maintaining their facial structures.
arXiv Detail & Related papers (2020-08-12T03:41:49Z) - Deep Generation of Face Images from Sketches [36.146494762987146]
Deep image-to-image translation techniques allow fast generation of face images from freehand sketches.
Existing solutions tend to overfit to sketches, thus requiring professional sketches or even edge maps as input.
We propose to implicitly model the shape space of plausible face images and synthesize a face image in this space to approximate an input sketch.
Our method essentially uses input sketches as soft constraints and is thus able to produce high-quality face images even from rough and/or incomplete sketches.
arXiv Detail & Related papers (2020-06-01T16:20:23Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.