Class-constrained t-SNE: Combining Data Features and Class Probabilities
- URL: http://arxiv.org/abs/2308.13837v1
- Date: Sat, 26 Aug 2023 10:05:07 GMT
- Title: Class-constrained t-SNE: Combining Data Features and Class Probabilities
- Authors: Linhao Meng, Stef van den Elzen, Nicola Pezzotti, and Anna Vilanova
- Abstract summary: We propose a class-constrained t-SNE that combines data features and class probabilities in the same DR result.
We illustrate its application potential in model evaluation and visual-interactive labeling.
- Score: 1.3285222309805058
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Data features and class probabilities are two main perspectives when, e.g.,
evaluating model results and identifying problematic items. Class probabilities
represent the likelihood that each instance belongs to a particular class,
which can be produced by probabilistic classifiers or even human labeling with
uncertainty. Since both perspectives are multi-dimensional data, dimensionality
reduction (DR) techniques are commonly used to extract informative
characteristics from them. However, existing methods either focus solely on the
data feature perspective or rely on class probability estimates to guide the DR
process. In contrast to previous work where separate views are linked to
conduct the analysis, we propose a novel approach, class-constrained t-SNE,
that combines data features and class probabilities in the same DR result.
Specifically, we combine them by balancing two corresponding components in a
cost function to optimize the positions of data points and iconic
representation of classes -- class landmarks. Furthermore, an interactive
user-adjustable parameter balances these two components so that users can focus
on the weighted perspectives of interest and also empowers a smooth visual
transition between varying perspectives to preserve the mental map. We
illustrate its application potential in model evaluation and visual-interactive
labeling. A comparative analysis is performed to evaluate the DR results.
Related papers
- Learning Decomposable and Debiased Representations via Attribute-Centric Information Bottlenecks [21.813755593742858]
Biased attributes, spuriously correlated with target labels in a dataset, can problematically lead to neural networks that learn improper shortcuts for classifications.
We propose a novel debiasing framework, Debiasing Global Workspace, introducing attention-based information bottlenecks for learning compositional representations of attributes.
We conduct comprehensive evaluations on biased datasets, along with both quantitative and qualitative analyses, to showcase our approach's efficacy.
arXiv Detail & Related papers (2024-03-21T05:33:49Z) - Common-Sense Bias Modeling for Classification Tasks [15.683471433842492]
We propose a novel framework to extract comprehensive biases in image datasets based on textual descriptions.
Our method uncovers novel model biases in multiple image benchmark datasets.
The discovered bias can be mitigated by simple data re-weighting to de-correlate the features.
arXiv Detail & Related papers (2024-01-24T03:56:07Z) - CS-Shapley: Class-wise Shapley Values for Data Valuation in
Classification [24.44357623723746]
We propose CS-Shapley, a Shapley value with a new value function that discriminates between training instances' in-class and out-of-class contributions.
Our results suggest Shapley-based data valuation is transferable for application across different models.
arXiv Detail & Related papers (2022-11-13T03:32:33Z) - ACTIVE:Augmentation-Free Graph Contrastive Learning for Partial
Multi-View Clustering [52.491074276133325]
We propose an augmentation-free graph contrastive learning framework to solve the problem of partial multi-view clustering.
The proposed approach elevates instance-level contrastive learning and missing data inference to the cluster-level, effectively mitigating the impact of individual missing data on clustering.
arXiv Detail & Related papers (2022-03-01T02:32:25Z) - Generalized One-Class Learning Using Pairs of Complementary Classifiers [41.64645294104883]
One-class learning is the classic problem of fitting a model to the data for which annotations are available only for a single class.
In this paper, we explore novel objectives for one-class learning, which we collectively refer to as Generalized One-class Discriminative Subspaces (GODS)
arXiv Detail & Related papers (2021-06-24T18:52:05Z) - From Canonical Correlation Analysis to Self-supervised Graph Neural
Networks [99.44881722969046]
We introduce a conceptually simple yet effective model for self-supervised representation learning with graph data.
We optimize an innovative feature-level objective inspired by classical Canonical Correlation Analysis.
Our method performs competitively on seven public graph datasets.
arXiv Detail & Related papers (2021-06-23T15:55:47Z) - Auto-weighted Multi-view Feature Selection with Graph Optimization [90.26124046530319]
We propose a novel unsupervised multi-view feature selection model based on graph learning.
The contributions are threefold: (1) during the feature selection procedure, the consensus similarity graph shared by different views is learned.
Experiments on various datasets demonstrate the superiority of the proposed method compared with the state-of-the-art methods.
arXiv Detail & Related papers (2021-04-11T03:25:25Z) - Weakly-Supervised Aspect-Based Sentiment Analysis via Joint
Aspect-Sentiment Topic Embedding [71.2260967797055]
We propose a weakly-supervised approach for aspect-based sentiment analysis.
We learn sentiment, aspect> joint topic embeddings in the word embedding space.
We then use neural models to generalize the word-level discriminative information.
arXiv Detail & Related papers (2020-10-13T21:33:24Z) - A Few-Shot Sequential Approach for Object Counting [63.82757025821265]
We introduce a class attention mechanism that sequentially attends to objects in the image and extracts their relevant features.
The proposed technique is trained on point-level annotations and uses a novel loss function that disentangles class-dependent and class-agnostic aspects of the model.
We present our results on a variety of object-counting/detection datasets, including FSOD and MS COCO.
arXiv Detail & Related papers (2020-07-03T18:23:39Z) - Bayesian Sparse Factor Analysis with Kernelized Observations [67.60224656603823]
Multi-view problems can be faced with latent variable models.
High-dimensionality and non-linear issues are traditionally handled by kernel methods.
We propose merging both approaches into single model.
arXiv Detail & Related papers (2020-06-01T14:25:38Z) - FDive: Learning Relevance Models using Pattern-based Similarity Measures [27.136998442865217]
We present FDive, a visual active learning system that helps to create visually explorable relevance models.
Based on the best-ranked similarity measure, the system calculates an interactive Self-Organizing Map-based relevance model.
It also automatically prompts further relevance feedback to improve its accuracy.
arXiv Detail & Related papers (2019-07-29T15:37:43Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.