LVFace: Progressive Cluster Optimization for Large Vision Models in Face Recognition
- URL: http://arxiv.org/abs/2501.13420v2
- Date: Tue, 25 Mar 2025 03:43:57 GMT
- Title: LVFace: Progressive Cluster Optimization for Large Vision Models in Face Recognition
- Authors: Jinghan You, Shanglin Li, Yuanrui Sun, Jiangchuan Wei, Mingyu Guo, Chao Feng, Jiao Ran,
- Abstract summary: Vision Transformers (ViTs) have revolutionized large-scale visual modeling, yet remain underexplored in face recognition (FR) where CNNs still dominate.<n>We propose LVFace, a ViT-based FR model that integrates Progressive Cluster Optimization (PCO) to achieve superior results.
- Score: 9.067342817048253
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Vision Transformers (ViTs) have revolutionized large-scale visual modeling, yet remain underexplored in face recognition (FR) where CNNs still dominate. We identify a critical bottleneck: CNN-inspired training paradigms fail to unlock ViT's potential, leading to suboptimal performance and convergence instability.To address this challenge, we propose LVFace, a ViT-based FR model that integrates Progressive Cluster Optimization (PCO) to achieve superior results. Specifically, PCO sequentially applies negative class sub-sampling (NCS) for robust and fast feature alignment from random initialization, feature expectation penalties for centroid stabilization, performing cluster boundary refinement through full-batch training without NCS constraints. LVFace establishes a new state-of-the-art face recognition baseline, surpassing leading approaches such as UniFace and TopoFR across multiple benchmarks. Extensive experiments demonstrate that LVFace delivers consistent performance gains, while exhibiting scalability to large-scale datasets and compatibility with mainstream VLMs and LLMs. Notably, LVFace secured 1st place in the ICCV 2021 Masked Face Recognition (MFR)-Ongoing Challenge (March 2025), proving its efficacy in real-world scenarios.
Related papers
- xEdgeFace: Efficient Cross-Spectral Face Recognition for Edge Devices [4.910937238451485]
Heterogeneous Face Recognition (HFR) addresses the challenge of matching face images across different sensing modalities.
We present a lightweight yet effective HFR framework by adapting a hybrid CNN-Transformer architecture.
Our approach enables efficient end-to-end training with minimal paired heterogeneous data while preserving strong performance on standard RGB face recognition tasks.
arXiv Detail & Related papers (2025-04-28T10:03:11Z) - UIFace: Unleashing Inherent Model Capabilities to Enhance Intra-Class Diversity in Synthetic Face Recognition [42.86969216015855]
Face recognition (FR) stands as one of the most crucial applications in computer vision.
We propose a framework to enhance intra-class diversity for synthetic face recognition, shortened as UIFace.
Experiments show that our method significantly surpasses previous approaches with even less training data and half the size of synthetic dataset.
arXiv Detail & Related papers (2025-02-27T06:22:18Z) - Diffusion Prior Interpolation for Flexibility Real-World Face Super-Resolution [48.34173818491552]
Diffusion Prior Interpolation (DPI) can balance consistency and diversity and can be seamlessly integrated into pre-trained models.
In extensive experiments conducted on synthetic and real datasets, DPI demonstrates superiority over SOTA FSR methods.
arXiv Detail & Related papers (2024-12-21T09:28:44Z) - OODFace: Benchmarking Robustness of Face Recognition under Common Corruptions and Appearance Variations [21.00631818827379]
Existing open-source models and commercial algorithms lack robustness in certain real-world Out-of-Distribution (OOD) scenarios.<n>We introduce OODFace, which explores the OOD challenges faced by facial recognition models from two perspectives.<n>We conduct extensive experiments on 19 different facial recognition models and 3 commercial APIs, along with extended experiments on face masks.
arXiv Detail & Related papers (2024-12-03T14:42:31Z) - Unleashing Network Potentials for Semantic Scene Completion [50.95486458217653]
This paper proposes a novel SSC framework - Adrial Modality Modulation Network (AMMNet)
AMMNet introduces two core modules: a cross-modal modulation enabling the interdependence of gradient flows between modalities, and a customized adversarial training scheme leveraging dynamic gradient competition.
Extensive experimental results demonstrate that AMMNet outperforms state-of-the-art SSC methods by a large margin.
arXiv Detail & Related papers (2024-03-12T11:48:49Z) - Generalized Face Liveness Detection via De-fake Face Generator [52.23271636362843]
Previous Face Anti-spoofing (FAS) methods face the challenge of generalizing to unseen domains.<n>We propose an Anomalous cue Guided FAS (AG-FAS) method, which can effectively leverage large-scale additional real faces.<n>Our method achieves state-of-the-art results under cross-domain evaluations with unseen scenarios and unknown presentation attacks.
arXiv Detail & Related papers (2024-01-17T06:59:32Z) - Towards Large-scale Masked Face Recognition [44.380235958577785]
During the COVID-19 coronavirus epidemic, almost everyone is wearing masks, which poses a huge challenge for deep learning-based face recognition algorithms.
In this paper, we will present our textbfchampionship solutions in ICCV MFR WebFace260M and InsightFace unconstrained tracks.
arXiv Detail & Related papers (2023-10-25T05:04:47Z) - Recognizability Embedding Enhancement for Very Low-Resolution Face
Recognition and Quality Estimation [21.423956631978186]
We study principled approaches to elevate the recognizability of a face in the embedding space instead of the visual quality.
We first formulate a robust learning-based face recognizability measure, namely recognizability index (RI)
We then devise an index diversion loss to push the hard-to-recognize face embedding with low RI away from unrecognizable faces cluster to boost the RI, which reflects better recognizability.
arXiv Detail & Related papers (2023-04-20T03:18:03Z) - A Survey of Deep Face Restoration: Denoise, Super-Resolution, Deblur,
Artifact Removal [177.21001709272144]
Face Restoration (FR) aims to restore High-Quality (HQ) faces from Low-Quality (LQ) input images.
This paper comprehensively surveys recent advances in deep learning techniques for face restoration.
arXiv Detail & Related papers (2022-11-05T07:08:15Z) - Modeling biological face recognition with deep convolutional neural
networks [0.0]
Deep convolutional neural networks (DCNNs) have become the state-of-the-art computational models of biological object recognition.
Recent efforts have started to transfer this achievement to research on biological face recognition.
In this review, we summarize the first studies that use DCNNs to model biological face recognition.
arXiv Detail & Related papers (2022-08-13T16:45:30Z) - Blind Face Restoration: Benchmark Datasets and a Baseline Model [63.053331687284064]
Blind Face Restoration (BFR) aims to construct a high-quality (HQ) face image from its corresponding low-quality (LQ) input.
We first synthesize two blind face restoration benchmark datasets called EDFace-Celeb-1M (BFR128) and EDFace-Celeb-150K (BFR512)
State-of-the-art methods are benchmarked on them under five settings including blur, noise, low resolution, JPEG compression artifacts, and the combination of them (full degradation)
arXiv Detail & Related papers (2022-06-08T06:34:24Z) - SFace: Sigmoid-Constrained Hypersphere Loss for Robust Face Recognition [74.13631562652836]
We propose a novel loss function, named sigmoid-constrained hypersphere loss (SFace)
SFace imposes intra-class and inter-class constraints on a hypersphere manifold, which are controlled by two sigmoid gradient re-scale functions respectively.
It can make a better balance between decreasing the intra-class distances and preventing overfitting to the label noise, and contributes more robust deep face recognition models.
arXiv Detail & Related papers (2022-05-24T11:54:15Z) - On the (Limited) Generalization of MasterFace Attacks and Its Relation
to the Capacity of Face Representations [11.924504853735645]
We study the generalizability of MasterFace attacks in empirical and theoretical investigations.
We estimate the face capacity and the maximum MasterFace coverage under the assumption that identities in the face space are well separated.
We conclude that MasterFaces should not be seen as a threat to face recognition systems but as a tool to enhance the robustness of face recognition models.
arXiv Detail & Related papers (2022-03-23T13:02:41Z) - SphereFace Revived: Unifying Hyperspherical Face Recognition [57.07058009281208]
We introduce a unified framework to understand large angular margin in hyperspherical face recognition.
Under this framework, we propose an improved variant with substantially better training stability -- SphereFace-R.
We show that SphereFace-R is consistently better than or competitive with state-of-the-art methods.
arXiv Detail & Related papers (2021-09-12T17:07:54Z) - SynFace: Face Recognition with Synthetic Data [83.15838126703719]
We devise the SynFace with identity mixup (IM) and domain mixup (DM) to mitigate the performance gap.
We also perform a systematically empirical analysis on synthetic face images to provide some insights on how to effectively utilize synthetic data for face recognition.
arXiv Detail & Related papers (2021-08-18T03:41:54Z) - Joint Face Image Restoration and Frontalization for Recognition [79.78729632975744]
In real-world scenarios, many factors may harm face recognition performance, e.g., large pose, bad illumination,low resolution, blur and noise.
Previous efforts usually first restore the low-quality faces to high-quality ones and then perform face recognition.
We propose an Multi-Degradation Face Restoration model to restore frontalized high-quality faces from the given low-quality ones.
arXiv Detail & Related papers (2021-05-12T03:52:41Z) - FaceX-Zoo: A PyTorch Toolbox for Face Recognition [62.038018324643325]
We introduce a novel open-source framework, named FaceX-Zoo, which is oriented to the research-development community of face recognition.
FaceX-Zoo provides a training module with various supervisory heads and backbones towards state-of-the-art face recognition.
A simple yet fully functional face SDK is provided for the validation and primary application of the trained models.
arXiv Detail & Related papers (2021-01-12T11:06:50Z) - SuperFront: From Low-resolution to High-resolution Frontal Face
Synthesis [65.35922024067551]
We propose a generative adversarial network (GAN) -based model to generate high-quality, identity preserving frontal faces.
Specifically, we propose SuperFront-GAN to synthesize a high-resolution (HR), frontal face from one-to-many LR faces with various poses.
We integrate a super-resolution side-view module into SF-GAN to preserve identity information and fine details of the side-views in HR space.
arXiv Detail & Related papers (2020-12-07T23:30:28Z) - Facial Expressions as a Vulnerability in Face Recognition [73.85525896663371]
This work explores facial expression bias as a security vulnerability of face recognition systems.
We present a comprehensive analysis of how facial expression bias impacts the performance of face recognition technologies.
arXiv Detail & Related papers (2020-11-17T18:12:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.