All Mistakes Are Not Equal: Comprehensive Hierarchy Aware Multi-label
Predictions (CHAMP)
- URL: http://arxiv.org/abs/2206.08653v1
- Date: Fri, 17 Jun 2022 09:32:48 GMT
- Title: All Mistakes Are Not Equal: Comprehensive Hierarchy Aware Multi-label
Predictions (CHAMP)
- Authors: Ashwin Vaswani, Gaurav Aggarwal, Praneeth Netrapalli, Narayan G Hegde
- Abstract summary: We present a framework that penalizes a misprediction depending on its severity as per the hierarchy tree.
Our method provides a framework to enhance existing multilabel classification algorithms with better mistakes.
- Score: 19.41500672342727
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This paper considers the problem of Hierarchical Multi-Label Classification
(HMC), where (i) several labels can be present for each example, and (ii)
labels are related via a domain-specific hierarchy tree. Guided by the
intuition that all mistakes are not equal, we present Comprehensive Hierarchy
Aware Multi-label Predictions (CHAMP), a framework that penalizes a
misprediction depending on its severity as per the hierarchy tree. While there
have been works that apply such an idea to single-label classification, to the
best of our knowledge, there are limited such works for multilabel
classification focusing on the severity of mistakes. The key reason is that
there is no clear way of quantifying the severity of a misprediction a priori
in the multilabel setting. In this work, we propose a simple but effective
metric to quantify the severity of a mistake in HMC, naturally leading to
CHAMP. Extensive experiments on six public HMC datasets across modalities
(image, audio, and text) demonstrate that incorporating hierarchical
information leads to substantial gains as CHAMP improves both AUPRC (2.6%
median percentage improvement) and hierarchical metrics (2.85% median
percentage improvement), over stand-alone hierarchical or multilabel
classification methods. Compared to standard multilabel baselines, CHAMP
provides improved AUPRC in both robustness (8.87% mean percentage improvement )
and less data regimes. Further, our method provides a framework to enhance
existing multilabel classification algorithms with better mistakes (18.1% mean
percentage increment).
Related papers
- Rethinking Consistent Multi-Label Classification under Inexact Supervision [60.79309683889278]
In partial multi-label learning, each instance is annotated with a candidate label set, among which only some labels are relevant.<n>In complementary multi-label learning, each instance is annotated with complementary labels indicating the classes to which the instance does not belong.
arXiv Detail & Related papers (2025-10-05T08:30:32Z) - Hierarchical Multi-Label Generation with Probabilistic Level-Constraint [3.1427813443719868]
Hierarchical Extreme Multi-Label Classification poses greater difficulties compared to traditional multi-label classification.<n>We employ a generative framework with Probabilistic Level Constraints (PLC) to generate hierarchical labels within a specific taxonomy.<n>Our approach achieves a new SOTA performance in the HMG task, but also has a much better performance in constrained the output of model than previous research work.
arXiv Detail & Related papers (2025-04-30T07:56:53Z) - Generating Unbiased Pseudo-labels via a Theoretically Guaranteed
Chebyshev Constraint to Unify Semi-supervised Classification and Regression [57.17120203327993]
threshold-to-pseudo label process (T2L) in classification uses confidence to determine the quality of label.
In nature, regression also requires unbiased methods to generate high-quality labels.
We propose a theoretically guaranteed constraint for generating unbiased labels based on Chebyshev's inequality.
arXiv Detail & Related papers (2023-11-03T08:39:35Z) - RLSEP: Learning Label Ranks for Multi-label Classification [0.0]
Multi-label ranking maps instances to a ranked set of predicted labels from multiple possible classes.
We propose a novel dedicated loss function to optimize models by incorporating penalties for incorrectly ranked pairs.
Our method achieves the best reported performance measures on both synthetic and real world ranked datasets.
arXiv Detail & Related papers (2022-12-08T00:59:09Z) - Decision Making for Hierarchical Multi-label Classification with
Multidimensional Local Precision Rate [4.812468844362369]
We introduce a new statistic called the multidimensional local precision rate (mLPR) for each object in each class.
We show that classification decisions made by simply sorting objects across classes in descending order of their mLPRs can, in theory, ensure the class hierarchy.
In response, we introduce HierRank, a new algorithm that maximizes an empirical version of CATCH using estimated mLPRs while respecting the hierarchy.
arXiv Detail & Related papers (2022-05-16T17:43:35Z) - Multi-Label Gold Asymmetric Loss Correction with Single-Label Regulators [6.129273021888717]
We propose a novel Gold Asymmetric Loss Correction with Single-Label Regulators (GALC-SLR) that operates robust against noisy labels.
GALC-SLR estimates the noise confusion matrix using single-label samples, then constructs an asymmetric loss correction via estimated confusion matrix to avoid overfitting to the noisy labels.
Empirical results show that our method outperforms the state-of-the-art original asymmetric loss multi-label classifier under all corruption levels.
arXiv Detail & Related papers (2021-08-04T12:57:29Z) - Rethinking Pseudo Labels for Semi-Supervised Object Detection [84.697097472401]
We introduce certainty-aware pseudo labels tailored for object detection.
We dynamically adjust the thresholds used to generate pseudo labels and reweight loss functions for each category to alleviate the class imbalance problem.
Our approach improves supervised baselines by up to 10% AP using only 1-10% labeled data from COCO.
arXiv Detail & Related papers (2021-06-01T01:32:03Z) - PLM: Partial Label Masking for Imbalanced Multi-label Classification [59.68444804243782]
Neural networks trained on real-world datasets with long-tailed label distributions are biased towards frequent classes and perform poorly on infrequent classes.
We propose a method, Partial Label Masking (PLM), which utilizes this ratio during training.
Our method achieves strong performance when compared to existing methods on both multi-label (MultiMNIST and MSCOCO) and single-label (imbalanced CIFAR-10 and CIFAR-100) image classification datasets.
arXiv Detail & Related papers (2021-05-22T18:07:56Z) - Coherent Hierarchical Multi-Label Classification Networks [56.41950277906307]
C-HMCNN(h) is a novel approach for HMC problems, which exploits hierarchy information in order to produce predictions coherent with the constraint and improve performance.
We conduct an extensive experimental analysis showing the superior performance of C-HMCNN(h) when compared to state-of-the-art models.
arXiv Detail & Related papers (2020-10-20T09:37:02Z) - Joint Visual and Temporal Consistency for Unsupervised Domain Adaptive
Person Re-Identification [64.37745443119942]
This paper jointly enforces visual and temporal consistency in the combination of a local one-hot classification and a global multi-class classification.
Experimental results on three large-scale ReID datasets demonstrate the superiority of proposed method in both unsupervised and unsupervised domain adaptive ReID tasks.
arXiv Detail & Related papers (2020-07-21T14:31:27Z) - Learning Gradient Boosted Multi-label Classification Rules [4.842945656927122]
We propose an algorithm for learning multi-label classification rules that is able to minimize decomposable as well as non-decomposable loss functions.
We analyze the abilities and limitations of our approach on synthetic data and evaluate its predictive performance on multi-label benchmarks.
arXiv Detail & Related papers (2020-06-23T21:39:23Z) - Unsupervised Person Re-identification via Multi-label Classification [55.65870468861157]
This paper formulates unsupervised person ReID as a multi-label classification task to progressively seek true labels.
Our method starts by assigning each person image with a single-class label, then evolves to multi-label classification by leveraging the updated ReID model for label prediction.
To boost the ReID model training efficiency in multi-label classification, we propose the memory-based multi-label classification loss (MMCL)
arXiv Detail & Related papers (2020-04-20T12:13:43Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.