Towards Speaker Age Estimation with Label Distribution Learning
- URL: http://arxiv.org/abs/2202.11424v1
- Date: Wed, 23 Feb 2022 11:11:58 GMT
- Title: Towards Speaker Age Estimation with Label Distribution Learning
- Authors: Shijing Si, Jianzong Wang, Junqing Peng, Jing Xiao
- Abstract summary: We utilize the ambiguous information among the age labels, convert each age label into a discrete label distribution and leverage the label distribution learning (LDL) method to fit the data.
Our method naturally combines the age classification and regression approaches, which enhances the robustness of our method.
We conduct experiments on the public NIST SRE08-10 dataset and a real-world dataset, which exhibit that our method outperforms baseline methods by a relatively large margin.
- Score: 26.12240876065871
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Existing methods for speaker age estimation usually treat it as a multi-class
classification or a regression problem. However, precise age identification
remains a challenge due to label ambiguity, \emph{i.e.}, utterances from
adjacent age of the same person are often indistinguishable. To address this,
we utilize the ambiguous information among the age labels, convert each age
label into a discrete label distribution and leverage the label distribution
learning (LDL) method to fit the data. For each audio data sample, our method
produces a age distribution of its speaker, and on top of the distribution we
also perform two other tasks: age prediction and age uncertainty minimization.
Therefore, our method naturally combines the age classification and regression
approaches, which enhances the robustness of our method. We conduct experiments
on the public NIST SRE08-10 dataset and a real-world dataset, which exhibit
that our method outperforms baseline methods by a relatively large margin,
yielding a 10\% reduction in terms of mean absolute error (MAE) on a real-world
dataset.
Related papers
- Label Distribution Learning with Biased Annotations by Learning Multi-Label Representation [120.97262070068224]
Multi-label learning (MLL) has gained attention for its ability to represent real-world data.
Label Distribution Learning (LDL) faces challenges in collecting accurate label distributions.
arXiv Detail & Related papers (2025-02-03T09:04:03Z) - Improving realistic semi-supervised learning with doubly robust estimation [8.828699635463265]
A major challenge in Semi-Supervised Learning (SSL) is the limited information available about the class distribution in the unlabeled data.
We propose to explicitly estimate the unlabeled class distribution, which is a finite-dimensional parameter, emphas an initial step, using a doubly robust estimator with a strong theoretical guarantee.
This estimate can then be integrated into existing methods to pseudo-label the unlabeled data during training more accurately.
arXiv Detail & Related papers (2025-02-01T02:34:12Z) - From Age Estimation to Age-Invariant Face Recognition: Generalized Age Feature Extraction Using Order-Enhanced Contrastive Learning [23.817867981093382]
Generalized age feature extraction is crucial for age-related facial analysis tasks.
We propose Order-Enhanced Contrastive Learning (OrdCon) to minimize the domain gap across different datasets and scenarios.
We demonstrate that our proposed method achieves comparable results to state-of-the-art methods on various benchmark datasets.
arXiv Detail & Related papers (2025-01-03T11:23:52Z) - Continuous Contrastive Learning for Long-Tailed Semi-Supervised Recognition [50.61991746981703]
Current state-of-the-art LTSSL approaches rely on high-quality pseudo-labels for large-scale unlabeled data.
This paper introduces a novel probabilistic framework that unifies various recent proposals in long-tail learning.
We introduce a continuous contrastive learning method, CCL, extending our framework to unlabeled data using reliable and smoothed pseudo-labels.
arXiv Detail & Related papers (2024-10-08T15:06:10Z) - Partial-Label Regression [54.74984751371617]
Partial-label learning is a weakly supervised learning setting that allows each training example to be annotated with a set of candidate labels.
Previous studies on partial-label learning only focused on the classification setting where candidate labels are all discrete.
In this paper, we provide the first attempt to investigate partial-label regression, where each training example is annotated with a set of real-valued candidate labels.
arXiv Detail & Related papers (2023-06-15T09:02:24Z) - SVLDL: Improved Speaker Age Estimation Using Selective Variance Label
Distribution Learning [24.57668015470307]
We propose selective variance label distribution learning (SVLDL) method to adapt the variance of different age distributions.
Model uses WavLM as the speech feature extractor and adds the auxiliary task of gender recognition to further improve the performance.
Experiments show that the model achieves state-of-the-art performance on all aspects of the NIST SRE08-10 and a real-world datasets.
arXiv Detail & Related papers (2022-10-18T01:34:31Z) - Re-distributing Biased Pseudo Labels for Semi-supervised Semantic
Segmentation: A Baseline Investigation [30.688753736660725]
We present a simple and yet effective Distribution Alignment and Random Sampling (DARS) method to produce unbiased pseudo labels.
Our method performs favorably in comparison with state-of-the-art approaches.
arXiv Detail & Related papers (2021-07-23T14:45:14Z) - Disentangling Sampling and Labeling Bias for Learning in Large-Output
Spaces [64.23172847182109]
We show that different negative sampling schemes implicitly trade-off performance on dominant versus rare labels.
We provide a unified means to explicitly tackle both sampling bias, arising from working with a subset of all labels, and labeling bias, which is inherent to the data due to label imbalance.
arXiv Detail & Related papers (2021-05-12T15:40:13Z) - Semi-supervised Long-tailed Recognition using Alternate Sampling [95.93760490301395]
Main challenges in long-tailed recognition come from the imbalanced data distribution and sample scarcity in its tail classes.
We propose a new recognition setting, namely semi-supervised long-tailed recognition.
We demonstrate significant accuracy improvements over other competitive methods on two datasets.
arXiv Detail & Related papers (2021-05-01T00:43:38Z) - Enhancing Facial Data Diversity with Style-based Face Aging [59.984134070735934]
In particular, face datasets are typically biased in terms of attributes such as gender, age, and race.
We propose a novel, generative style-based architecture for data augmentation that captures fine-grained aging patterns.
We show that the proposed method outperforms state-of-the-art algorithms for age transfer.
arXiv Detail & Related papers (2020-06-06T21:53:44Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.