Imbalance Robust Softmax for Deep Embeeding Learning
- URL: http://arxiv.org/abs/2011.11155v1
- Date: Mon, 23 Nov 2020 00:43:07 GMT
- Title: Imbalance Robust Softmax for Deep Embeeding Learning
- Authors: Hao Zhu, Yang Yuan, Guosheng Hu, Xiang Wu, Neil Robertson
- Abstract summary: In recent years, one research focus is to solve the open-set problem by discriminative deep embedding learning in the field of face recognition (FR) and person re-identification (re-ID)
We find that imbalanced training data is another main factor causing the performance of FR and re-ID with softmax or its variants.
We propose a unified framework, Imbalance-Robust Softmax (IR-Softmax), which can simultaneously solve the open-set problem and reduce the influence of data imbalance.
- Score: 34.95520933299555
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Deep embedding learning is expected to learn a metric space in which features
have smaller maximal intra-class distance than minimal inter-class distance. In
recent years, one research focus is to solve the open-set problem by
discriminative deep embedding learning in the field of face recognition (FR)
and person re-identification (re-ID). Apart from open-set problem, we find that
imbalanced training data is another main factor causing the performance
degradation of FR and re-ID, and data imbalance widely exists in the real
applications. However, very little research explores why and how data imbalance
influences the performance of FR and re-ID with softmax or its variants. In
this work, we deeply investigate data imbalance in the perspective of neural
network optimisation and feature distribution about softmax. We find one main
reason of performance degradation caused by data imbalance is that the weights
(from the penultimate fully-connected layer) are far from their class centers
in feature space. Based on this investigation, we propose a unified framework,
Imbalance-Robust Softmax (IR-Softmax), which can simultaneously solve the
open-set problem and reduce the influence of data imbalance. IR-Softmax can
generalise to any softmax and its variants (which are discriminative for
open-set problem) by directly setting the weights as their class centers,
naturally solving the data imbalance problem. In this work, we explicitly
re-formulate two discriminative softmax (A-Softmax and AM-Softmax) under the
framework of IR-Softmax. We conduct extensive experiments on FR databases (LFW,
MegaFace) and re-ID database (Market-1501, Duke), and IR-Softmax outperforms
many state-of-the-art methods.
Related papers
- Statistical Mechanics of Min-Max Problems [0.0]
This study introduces a statistical mechanical formalism for analyzing the equilibrium values of min-max problems in the high-dimensional limit.
As a first step, we apply this formalism to bi-linear min-max games and simple GANs.
This formalism provides a groundwork for a deeper theoretical analysis of the equilibrium properties in various machine learning methods.
arXiv Detail & Related papers (2024-09-09T20:24:19Z) - Revisiting Logistic-softmax Likelihood in Bayesian Meta-Learning for Few-Shot Classification [4.813254903898101]
logistic-softmax is often employed as an alternative to the softmax likelihood in multi-class Gaussian process classification.
We revisit and redesign the logistic-softmax likelihood, which enables control of the textita priori confidence level through a temperature parameter.
Our approach yields well-calibrated uncertainty estimates and achieves comparable or superior results on standard benchmark datasets.
arXiv Detail & Related papers (2023-10-16T13:20:13Z) - Federated Compositional Deep AUC Maximization [58.25078060952361]
We develop a novel federated learning method for imbalanced data by directly optimizing the area under curve (AUC) score.
To the best of our knowledge, this is the first work to achieve such favorable theoretical results.
arXiv Detail & Related papers (2023-04-20T05:49:41Z) - Spectral Aware Softmax for Visible-Infrared Person Re-Identification [123.69049942659285]
Visible-infrared person re-identification (VI-ReID) aims to match specific pedestrian images from different modalities.
Existing methods still follow the softmax loss training paradigm, which is widely used in single-modality classification tasks.
We propose the spectral-aware softmax (SA-Softmax) loss, which can fully explore the embedding space with the modality information.
arXiv Detail & Related papers (2023-02-03T02:57:18Z) - Stochastic Methods for AUC Optimization subject to AUC-based Fairness
Constraints [51.12047280149546]
A direct approach for obtaining a fair predictive model is to train the model through optimizing its prediction performance subject to fairness constraints.
We formulate the training problem of a fairness-aware machine learning model as an AUC optimization problem subject to a class of AUC-based fairness constraints.
We demonstrate the effectiveness of our approach on real-world data under different fairness metrics.
arXiv Detail & Related papers (2022-12-23T22:29:08Z) - Sparse-softmax: A Simpler and Faster Alternative Softmax Transformation [2.3813678058429626]
The softmax function is widely used in artificial neural networks for the multiclass classification problems.
In this paper, we provide an empirical study on a simple and concise softmax variant, namely sparse-softmax, to alleviate the problem that occurred in traditional softmax in terms of high-dimensional classification problems.
arXiv Detail & Related papers (2021-12-23T09:53:38Z) - Breaking the Softmax Bottleneck for Sequential Recommender Systems with
Dropout and Decoupling [0.0]
We show that there are more aspects to the Softmax bottleneck in SBRSs.
We propose a simple yet effective method, Dropout and Decoupling (D&D), to alleviate these problems.
Our method significantly improves the accuracy of a variety of Softmax-based SBRS algorithms.
arXiv Detail & Related papers (2021-10-11T16:52:23Z) - Federated Deep AUC Maximization for Heterogeneous Data with a Constant
Communication Complexity [77.78624443410216]
We propose improved FDAM algorithms for detecting heterogeneous chest data.
A result of this paper is that the communication of the proposed algorithm is strongly independent of the number of machines and also independent of the accuracy level.
Experiments have demonstrated the effectiveness of our FDAM algorithm on benchmark datasets and on medical chest Xray images from different organizations.
arXiv Detail & Related papers (2021-02-09T04:05:19Z) - Non-convex Min-Max Optimization: Applications, Challenges, and Recent
Theoretical Advances [58.54078318403909]
The min-max problem, also known as the saddle point problem, is a class adversarial problem which is also studied in the context ofsum games.
arXiv Detail & Related papers (2020-06-15T05:33:42Z) - SS-IL: Separated Softmax for Incremental Learning [15.161214516549844]
We consider class incremental learning (CIL) problem, in which a learning agent continuously learns new classes from incrementally arriving training data batches.
The main challenge of the problem is the catastrophic forgetting.
We propose a new method, dubbed as Separated Softmax for Incremental Learning (SS-IL), that consists of separated softmax (SS) output layer combined with task-wise knowledge distillation (TKD) to resolve such bias.
arXiv Detail & Related papers (2020-03-31T04:36:59Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.