Teacher-Student Asynchronous Learning with Multi-Source Consistency for
Facial Landmark Detection
- URL: http://arxiv.org/abs/2012.06711v1
- Date: Sat, 12 Dec 2020 03:23:30 GMT
- Title: Teacher-Student Asynchronous Learning with Multi-Source Consistency for
Facial Landmark Detection
- Authors: Rongye Meng, Sanping Zhou, Xingyu Wan, Mengliu Li, Jinjun Wang
- Abstract summary: We propose a teacher-student asynchronous learning(TSAL) framework based on the multi-source supervision signal consistency criterion.
Experiments on 300W, AFLW, and 300VW benchmarks show that TSAL framework achieves state-of-the-art performance.
- Score: 15.796415030063802
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Due to the high annotation cost of large-scale facial landmark detection
tasks in videos, a semi-supervised paradigm that uses self-training for mining
high-quality pseudo-labels to participate in training has been proposed by
researchers. However, self-training based methods often train with a gradually
increasing number of samples, whose performances vary a lot depending on the
number of pseudo-labeled samples added.
In this paper, we propose a teacher-student asynchronous learning~(TSAL)
framework based on the multi-source supervision signal consistency criterion,
which implicitly mines pseudo-labels through consistency constraints.
Specifically, the TSAL framework contains two models with exactly the same
structure. The radical student uses multi-source supervision signals from the
same task to update parameters, while the calm teacher uses a single-source
supervision signal to update parameters. In order to reasonably absorb
student's suggestions, teacher's parameters are updated again through recursive
average filtering. The experimental results prove that asynchronous-learning
framework can effectively filter noise in multi-source supervision signals,
thereby mining the pseudo-labels which are more significant for network
parameter updating. And extensive experiments on 300W, AFLW, and 300VW
benchmarks show that the TSAL framework achieves state-of-the-art performance.
Related papers
- Collaborative Static-Dynamic Teaching: A Semi-Supervised Framework for Stripe-Like Space Target Detection [2.9133687889451023]
Stripe-like space target detection is crucial for space situational awareness.
Traditional unsupervised methods often fail in low signal-to-noise ratio and variable stripe-like space targets scenarios.
We introduce an innovative Collaborative Static-Dynamic Teacher (CSDT) SSL framework, which includes static and dynamic teacher models as well as a student model.
We also present MSSA-Net, a novel SSTD network featuring a multi-scale dual-path convolution (MDPC) block and a feature map weighted attention (FMWA) block.
arXiv Detail & Related papers (2024-08-09T12:33:27Z) - ACTRESS: Active Retraining for Semi-supervised Visual Grounding [52.08834188447851]
A previous study, RefTeacher, makes the first attempt to tackle this task by adopting the teacher-student framework to provide pseudo confidence supervision and attention-based supervision.
This approach is incompatible with current state-of-the-art visual grounding models, which follow the Transformer-based pipeline.
Our paper proposes the ACTive REtraining approach for Semi-Supervised Visual Grounding, abbreviated as ACTRESS.
arXiv Detail & Related papers (2024-07-03T16:33:31Z) - Diverse Teacher-Students for Deep Safe Semi-Supervised Learning under Class Mismatch [35.42630035488178]
We introduce a novel framework named Diverse Teacher-Students (textbfDTS)
By training both teacher-student models with all unlabeled samples, DTS can enhance the classification of seen classes while simultaneously improving the detection of unseen classes.
arXiv Detail & Related papers (2024-05-25T06:54:43Z) - Cyclic-Bootstrap Labeling for Weakly Supervised Object Detection [134.05510658882278]
Cyclic-Bootstrap Labeling (CBL) is a novel weakly supervised object detection pipeline.
Uses a weighted exponential moving average strategy to take advantage of various refinement modules.
A novel class-specific ranking distillation algorithm is proposed to leverage the output of weighted ensembled teacher network.
arXiv Detail & Related papers (2023-08-11T07:57:17Z) - MSVQ: Self-Supervised Learning with Multiple Sample Views and Queues [10.327408694770709]
We propose a new simple framework, namely Multiple Sample Views and Queues (MSVQ)
We jointly construct three soft labels on-the-fly by utilizing two complementary and symmetric approaches.
Let the student network mimic the similarity relationships between the samples, thus giving the student network a more flexible ability to identify false negative samples in the dataset.
arXiv Detail & Related papers (2023-05-09T12:05:14Z) - Hierarchical Supervision and Shuffle Data Augmentation for 3D
Semi-Supervised Object Detection [90.32180043449263]
State-of-the-art 3D object detectors are usually trained on large-scale datasets with high-quality 3D annotations.
A natural remedy is to adopt semi-supervised learning (SSL) by leveraging a limited amount of labeled samples and abundant unlabeled samples.
This paper introduces a novel approach of Hierarchical Supervision and Shuffle Data Augmentation (HSSDA), which is a simple yet effective teacher-student framework.
arXiv Detail & Related papers (2023-04-04T02:09:32Z) - A Weakly Supervised Learning Framework for Salient Object Detection via
Hybrid Labels [96.56299163691979]
This paper focuses on a new weakly-supervised salient object detection (SOD) task under hybrid labels.
To address the issues of label noise and quantity imbalance in this task, we design a new pipeline framework with three sophisticated training strategies.
Experiments on five SOD benchmarks show that our method achieves competitive performance against weakly-supervised/unsupervised methods.
arXiv Detail & Related papers (2022-09-07T06:45:39Z) - Active Learning for Deep Visual Tracking [51.5063680734122]
Convolutional neural networks (CNNs) have been successfully applied to the single target tracking task in recent years.
In this paper, we propose an active learning method for deep visual tracking, which selects and annotates the unlabeled samples to train the deep CNNs model.
Under the guidance of active learning, the tracker based on the trained deep CNNs model can achieve competitive tracking performance while reducing the labeling cost.
arXiv Detail & Related papers (2021-10-17T11:47:56Z) - Graph Consistency based Mean-Teaching for Unsupervised Domain Adaptive
Person Re-Identification [54.58165777717885]
This paper proposes a Graph Consistency based Mean-Teaching (GCMT) method with constructing the Graph Consistency Constraint (GCC) between teacher and student networks.
Experiments on three datasets, i.e., Market-1501, DukeMTMCreID, and MSMT17, show that proposed GCMT outperforms state-of-the-art methods by clear margin.
arXiv Detail & Related papers (2021-05-11T04:09:49Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.