An Interactive Interface for Novel Class Discovery in Tabular Data
- URL: http://arxiv.org/abs/2306.12919v1
- Date: Thu, 22 Jun 2023 14:32:53 GMT
- Title: An Interactive Interface for Novel Class Discovery in Tabular Data
- Authors: Colin Troisemaine, Joachim Flocon-Cholet, St\'ephane Gosselin,
Alexandre Reiffers-Masson, Sandrine Vaton, Vincent Lemaire
- Abstract summary: Novel Class Discovery (NCD) is the problem of trying to discover novel classes in an unlabeled set, given a labeled set of different but related classes.
The majority of NCD methods proposed so far only deal with image data.
This interface allows a domain expert to easily run state-of-the-art algorithms for NCD in tabular data.
- Score: 54.11148718494725
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Novel Class Discovery (NCD) is the problem of trying to discover novel
classes in an unlabeled set, given a labeled set of different but related
classes. The majority of NCD methods proposed so far only deal with image data,
despite tabular data being among the most widely used type of data in practical
applications. To interpret the results of clustering or NCD algorithms, data
scientists need to understand the domain- and application-specific attributes
of tabular data. This task is difficult and can often only be performed by a
domain expert. Therefore, this interface allows a domain expert to easily run
state-of-the-art algorithms for NCD in tabular data. With minimal knowledge in
data science, interpretable results can be generated.
Related papers
- Exclusive Style Removal for Cross Domain Novel Class Discovery [15.868889486516306]
Novel Class Discovery (NCD) is a promising field in open-world learning.
We introduce an exclusive style removal module for extracting style information that is distinctive from the baseline features.
This module is easy to integrate with other NCD methods, acting as a plug-in to improve performance on novel classes with different distributions.
arXiv Detail & Related papers (2024-06-26T07:44:27Z) - A Practical Approach to Novel Class Discovery in Tabular Data [38.41548083078336]
Novel Class Discovery (NCD) is a problem of extracting knowledge from a labeled set of known classes to accurately partition an unlabeled set of novel classes.
In this work, we propose to tune the hyper parameters of NCD methods by adapting the $k$-fold cross-validation process and hiding some of the known classes in each fold.
We find that the latent space of this method can be used to reliably estimate the number of novel classes.
arXiv Detail & Related papers (2023-11-09T15:24:44Z) - Dynamic Conceptional Contrastive Learning for Generalized Category
Discovery [76.82327473338734]
Generalized category discovery (GCD) aims to automatically cluster partially labeled data.
Unlabeled data contain instances that are not only from known categories of the labeled data but also from novel categories.
One effective way for GCD is applying self-supervised learning to learn discriminate representation for unlabeled data.
We propose a Dynamic Conceptional Contrastive Learning framework, which can effectively improve clustering accuracy.
arXiv Detail & Related papers (2023-03-30T14:04:39Z) - D\'ecouvrir de nouvelles classes dans des donn\'ees tabulaires [54.11148718494725]
In Novel Class Discovery (NCD), the goal is to find new classes in an unlabeled set given a labeled set of known but different classes.
We show a way to extract knowledge from already known classes to guide the discovery process of novel classes in heterogeneous data.
arXiv Detail & Related papers (2022-11-28T09:48:55Z) - A Method for Discovering Novel Classes in Tabular Data [54.11148718494725]
In Novel Class Discovery (NCD), the goal is to find new classes in an unlabeled set given a labeled set of known but different classes.
We show a way to extract knowledge from already known classes to guide the discovery process of novel classes in heterogeneous data.
arXiv Detail & Related papers (2022-09-02T11:45:24Z) - XCon: Learning with Experts for Fine-grained Category Discovery [4.787507865427207]
We present a novel method called Expert-Contrastive Learning (XCon) to help the model to mine useful information from the images.
Experiments on fine-grained datasets show a clear improved performance over the previous best methods, indicating the effectiveness of our method.
arXiv Detail & Related papers (2022-08-03T08:03:12Z) - Spacing Loss for Discovering Novel Categories [72.52222295216062]
Novel Class Discovery (NCD) is a learning paradigm, where a machine learning model is tasked to semantically group instances from unlabeled data.
We first characterize existing NCD approaches into single-stage and two-stage methods based on whether they require access to labeled and unlabeled data together.
We devise a simple yet powerful loss function that enforces separability in the latent space using cues from multi-dimensional scaling.
arXiv Detail & Related papers (2022-04-22T09:37:11Z) - CvS: Classification via Segmentation For Small Datasets [52.821178654631254]
This paper presents CvS, a cost-effective classifier for small datasets that derives the classification labels from predicting the segmentation maps.
We evaluate the effectiveness of our framework on diverse problems showing that CvS is able to achieve much higher classification results compared to previous methods when given only a handful of examples.
arXiv Detail & Related papers (2021-10-29T18:41:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.