Learning to Cluster Faces via Transformer
- URL: http://arxiv.org/abs/2104.11502v1
- Date: Fri, 23 Apr 2021 09:43:36 GMT
- Title: Learning to Cluster Faces via Transformer
- Authors: Jinxing Ye, Xioajiang Peng, Baigui Sun, Kai Wang, Xiuyu Sun, Hao Li,
Hanqing Wu
- Abstract summary: Face clustering is a useful tool for applications like automatic face annotation and retrieval.
Traditional clustering methods ignore the relationship between individual images and their neighbors.
We introduce a Face Transformer for supervised face clustering.
- Score: 8.285052859942443
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Face clustering is a useful tool for applications like automatic face
annotation and retrieval. The main challenge is that it is difficult to cluster
images from the same identity with different face poses, occlusions, and image
quality. Traditional clustering methods usually ignore the relationship between
individual images and their neighbors which may contain useful context
information. In this paper, we repurpose the well-known Transformer and
introduce a Face Transformer for supervised face clustering. In Face
Transformer, we decompose the face clustering into two steps: relation encoding
and linkage predicting. Specifically, given a face image, a \textbf{relation
encoder} module aggregates local context information from its neighbors and a
\textbf{linkage predictor} module judges whether a pair of images belong to the
same cluster or not. In the local linkage graph view, Face Transformer can
generate more robust node and edge representations compared to existing
methods. Experiments on both MS-Celeb-1M and DeepFashion show that our method
achieves state-of-the-art performance, e.g., 91.12\% in pairwise F-score on
MS-Celeb-1M.
Related papers
- Optimal-Landmark-Guided Image Blending for Face Morphing Attacks [8.024953195407502]
We propose a novel approach for conducting face morphing attacks, which utilizes optimal-landmark-guided image blending.
Our proposed method overcomes the limitations of previous approaches by optimizing the morphing landmarks and using Graph Convolutional Networks (GCNs) to combine landmark and appearance features.
arXiv Detail & Related papers (2024-01-30T03:45:06Z) - MixReorg: Cross-Modal Mixed Patch Reorganization is a Good Mask Learner
for Open-World Semantic Segmentation [110.09800389100599]
We propose MixReorg, a novel and straightforward pre-training paradigm for semantic segmentation.
Our approach involves generating fine-grained patch-text pairs data by mixing image patches while preserving the correspondence between patches and text.
With MixReorg as a mask learner, conventional text-supervised semantic segmentation models can achieve highly generalizable pixel-semantic alignment ability.
arXiv Detail & Related papers (2023-08-09T09:35:16Z) - Improving Masked Autoencoders by Learning Where to Mask [65.89510231743692]
Masked image modeling is a promising self-supervised learning method for visual data.
We present AutoMAE, a framework that uses Gumbel-Softmax to interlink an adversarially-trained mask generator and a mask-guided image modeling process.
In our experiments, AutoMAE is shown to provide effective pretraining models on standard self-supervised benchmarks and downstream tasks.
arXiv Detail & Related papers (2023-03-12T05:28:55Z) - MorphGANFormer: Transformer-based Face Morphing and De-Morphing [55.211984079735196]
StyleGAN-based approaches to face morphing are among the leading techniques.
We propose a transformer-based alternative to face morphing and demonstrate its superiority to StyleGAN-based methods.
arXiv Detail & Related papers (2023-02-18T19:09:11Z) - Self-supervised Video-centralised Transformer for Video Face Clustering [58.12996668434134]
This paper presents a novel method for face clustering in videos using a video-centralised transformer.
We release the first large-scale egocentric video face clustering dataset named EasyCom-Clustering.
arXiv Detail & Related papers (2022-03-24T16:38:54Z) - Learning Disentangled Representation for One-shot Progressive Face
Swapping [65.98684203654908]
We present a simple yet efficient method named FaceSwapper, for one-shot face swapping based on Generative Adversarial Networks.
Our method consists of a disentangled representation module and a semantic-guided fusion module.
Our results show that our method achieves state-of-the-art results on benchmark with fewer training samples.
arXiv Detail & Related papers (2022-03-24T11:19:04Z) - FaceMap: Towards Unsupervised Face Clustering via Map Equation [45.72096482531776]
Face clustering is an essential task in computer vision due to the explosion of related applications such as augmented reality or photo album management.
We develop an effective unsupervised method, named as FaceMap, by formulating face clustering as a process of non-overlapping community detection.
Experiments with ablation studies demonstrate that FaceMap significantly outperforms existing methods.
arXiv Detail & Related papers (2022-03-21T03:23:09Z) - GANSeg: Learning to Segment by Unsupervised Hierarchical Image
Generation [16.900404701997502]
We propose a GAN-based approach that generates images conditioned on latent masks.
We show that such mask-conditioned image generation can be learned faithfully when conditioning the masks in a hierarchical manner.
It also lets us generate image-mask pairs for training a segmentation network, which outperforms the state-of-the-art unsupervised segmentation methods on established benchmarks.
arXiv Detail & Related papers (2021-12-02T07:57:56Z) - Fine-grained Image-to-Image Transformation towards Visual Recognition [102.51124181873101]
We aim at transforming an image with a fine-grained category to synthesize new images that preserve the identity of the input image.
We adopt a model based on generative adversarial networks to disentangle the identity related and unrelated factors of an image.
Experiments on the CompCars and Multi-PIE datasets demonstrate that our model preserves the identity of the generated images much better than the state-of-the-art image-to-image transformation models.
arXiv Detail & Related papers (2020-01-12T05:26:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.