Classifying with Uncertain Data Envelopment Analysis
- URL: http://arxiv.org/abs/2209.01052v1
- Date: Fri, 2 Sep 2022 13:41:19 GMT
- Title: Classifying with Uncertain Data Envelopment Analysis
- Authors: Casey Garner and Allen Holder
- Abstract summary: We propose a new classification scheme premised on the reality of imperfect data.
Our model uses uncertain data envelopment analysis to define a classification's proximity to equitable efficiency.
We conclude by classifying the thirty stocks in the Dow Jones Industrial average into performant tiers and by classifying prostate treatments into clinically effectual categories.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Classifications organize entities into categories that identify similarities
within a category and discern dissimilarities among categories, and they
powerfully classify information in support of analysis. We propose a new
classification scheme premised on the reality of imperfect data. Our
computational model uses uncertain data envelopment analysis to define a
classification's proximity to equitable efficiency, which is an aggregate
measure of intra-similarity within a classification's categories. Our
classification process has two overriding computational challenges, those being
a loss of convexity and a combinatorially explosive search space. We overcome
the first by establishing lower and upper bounds on the proximity value, and
then by searching this range with a first-order algorithm. We overcome the
second by adapting the p-median problem to initiate our exploration, and by
then employing an iterative neighborhood search to finalize a classification.
We conclude by classifying the thirty stocks in the Dow Jones Industrial
average into performant tiers and by classifying prostate treatments into
clinically effectual categories.
Related papers
- Learn to Categorize or Categorize to Learn? Self-Coding for Generalized
Category Discovery [49.1865089933055]
We propose a novel, efficient and self-supervised method capable of discovering previously unknown categories at test time.
A salient feature of our approach is the assignment of minimum length category codes to individual data instances.
Experimental evaluations, bolstered by state-of-the-art benchmark comparisons, testify to the efficacy of our solution.
arXiv Detail & Related papers (2023-10-30T17:45:32Z) - Anomaly Detection using Ensemble Classification and Evidence Theory [62.997667081978825]
We present a novel approach for novel detection using ensemble classification and evidence theory.
A pool selection strategy is presented to build a solid ensemble classifier.
We use uncertainty for the anomaly detection approach.
arXiv Detail & Related papers (2022-12-23T00:50:41Z) - Parametric Classification for Generalized Category Discovery: A Baseline
Study [70.73212959385387]
Generalized Category Discovery (GCD) aims to discover novel categories in unlabelled datasets using knowledge learned from labelled samples.
We investigate the failure of parametric classifiers, verify the effectiveness of previous design choices when high-quality supervision is available, and identify unreliable pseudo-labels as a key problem.
We propose a simple yet effective parametric classification method that benefits from entropy regularisation, achieves state-of-the-art performance on multiple GCD benchmarks and shows strong robustness to unknown class numbers.
arXiv Detail & Related papers (2022-11-21T18:47:11Z) - Uncertainty-based Network for Few-shot Image Classification [17.912365063048263]
We propose Uncertainty-Based Network, which models the uncertainty of classification results with the help of mutual information.
We show that Uncertainty-Based Network achieves comparable performance in classification accuracy compared to state-of-the-art method.
arXiv Detail & Related papers (2022-05-17T07:49:32Z) - Out-of-Category Document Identification Using Target-Category Names as
Weak Supervision [64.671654559798]
Out-of-category detection aims to distinguish documents according to their semantic relevance to the inlier (or target) categories.
We present an out-of-category detection framework, which effectively measures how confidently each document belongs to one of the target categories.
arXiv Detail & Related papers (2021-11-24T21:01:25Z) - On Clustering Categories of Categorical Predictors in Generalized Linear
Models [0.0]
We propose a method to reduce the complexity of Generalized Linear Models in the presence of categorical predictors.
The traditional one-hot encoding, where each category is represented by a dummy variable, can be wasteful, difficult to interpret, and prone to overfitting.
This paper addresses these challenges by finding a reduced representation of the categorical predictors by clustering their categories.
arXiv Detail & Related papers (2021-10-19T15:36:35Z) - A Framework for Multi-View Classification of Features [6.660458629649826]
In solving the data classification problems, when the feature set is too large, typical approaches will not be able to solve the problem.
In this research, an innovative framework for multi-view ensemble classification, inspired by the problem of object recognition in the multiple views theory of humans, is proposed.
arXiv Detail & Related papers (2021-08-02T16:27:43Z) - Inducing a hierarchy for multi-class classification problems [11.58041597483471]
In applications where categorical labels follow a natural hierarchy, classification methods that exploit the label structure often outperform those that do not.
In this paper, we investigate a class of methods that induce a hierarchy that can similarly improve classification performance over flat classifiers.
We demonstrate the effectiveness of the class of methods both for discovering a latent hierarchy and for improving accuracy in principled simulation settings and three real data applications.
arXiv Detail & Related papers (2021-02-20T05:40:42Z) - Theoretical Insights Into Multiclass Classification: A High-dimensional
Asymptotic View [82.80085730891126]
We provide the first modernally precise analysis of linear multiclass classification.
Our analysis reveals that the classification accuracy is highly distribution-dependent.
The insights gained may pave the way for a precise understanding of other classification algorithms.
arXiv Detail & Related papers (2020-11-16T05:17:29Z) - Deep Hierarchical Classification for Category Prediction in E-commerce
System [16.6932395109085]
In e-commerce system, category prediction is to automatically predict categories of given texts.
We propose a Deep Hierarchical Classification framework, which incorporates the multi-scale hierarchical information in neural networks.
We also define a novel combined loss function to punish hierarchical prediction losses.
arXiv Detail & Related papers (2020-05-14T02:29:14Z) - Fine-Grained Visual Classification with Efficient End-to-end
Localization [49.9887676289364]
We present an efficient localization module that can be fused with a classification network in an end-to-end setup.
We evaluate the new model on the three benchmark datasets CUB200-2011, Stanford Cars and FGVC-Aircraft.
arXiv Detail & Related papers (2020-05-11T14:07:06Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.