Discriminant Analysis in Contrasting Dimensions for Polycystic Ovary
Syndrome Prognostication
- URL: http://arxiv.org/abs/2201.03029v1
- Date: Sun, 9 Jan 2022 14:55:17 GMT
- Title: Discriminant Analysis in Contrasting Dimensions for Polycystic Ovary
Syndrome Prognostication
- Authors: Abhishek Gupta, Himanshu Soni, Raunak Joshi, Ronald Melwin Laban
- Abstract summary: Polycystic Ovary Syndrome also known as PCOS is a binary classification problem.
We present Discriminant Analysis in different dimensions with Linear and Quadratic form for binary classification along with metrics.
We were able to achieve good accuracy and less variation with Discriminant Analysis.
- Score: 3.1871776847712523
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: A lot of prognostication methodologies have been formulated for early
detection of Polycystic Ovary Syndrome also known as PCOS using Machine
Learning. PCOS is a binary classification problem. Dimensionality Reduction
methods impact the performance of Machine Learning to a greater extent and
using a Supervised Dimensionality Reduction method can give us a new edge to
tackle this problem. In this paper we present Discriminant Analysis in
different dimensions with Linear and Quadratic form for binary classification
along with metrics. We were able to achieve good accuracy and less variation
with Discriminant Analysis as compared to many commonly used classification
algorithms with training accuracy reaching 97.37% and testing accuracy of
95.92% using Quadratic Discriminant Analysis. Paper also gives the analysis of
data with visualizations for deeper understanding of problem.
Related papers
- An AI-Guided Data Centric Strategy to Detect and Mitigate Biases in
Healthcare Datasets [32.25265709333831]
We generate a data-centric, model-agnostic, task-agnostic approach to evaluate dataset bias by investigating the relationship between how easily different groups are learned at small sample sizes (AEquity)
We then apply a systematic analysis of AEq values across subpopulations to identify and manifestations of racial bias in two known cases in healthcare.
AEq is a novel and broadly applicable metric that can be applied to advance equity by diagnosing and remediating bias in healthcare datasets.
arXiv Detail & Related papers (2023-11-06T17:08:41Z) - Deep Linear Discriminant Analysis with Variation for Polycystic Ovary
Syndrome Classification [3.1871776847712523]
The simple machine learning algorithms can be improved with advanced frameworks using Deep Learning.
The Linear Discriminant Analysis is a linear dimensionality reduction algorithm for classification that can be boosted in terms of performance using Deep LDA.
arXiv Detail & Related papers (2023-03-25T08:39:06Z) - An Experimental Study of Dimension Reduction Methods on Machine Learning
Algorithms with Applications to Psychometrics [77.34726150561087]
We show that dimension reduction can decrease, increase, or provide the same accuracy as no reduction of variables.
Our tentative results find that dimension reduction tends to lead to better performance when used for classification tasks.
arXiv Detail & Related papers (2022-10-19T22:07:13Z) - Binary Classification for High Dimensional Data using Supervised
Non-Parametric Ensemble Method [3.1871776847712523]
The dataset for Polycystic Ovary Syndrome is available, which is termed as an endocrinological disorder in women.
Non-Parametric Supervised Ensemble machine learning methods can be used for prediction of the disorder in early stages.
arXiv Detail & Related papers (2022-02-15T23:06:49Z) - Riemannian-based Discriminant Analysis for Feature Extraction and
Classification [2.1485350418225244]
Discriminant analysis is a widely used approach in machine learning to extract low-dimensional features from the high-dimensional data.
Traditional Euclidean-based algorithms for discriminant analysis are easily convergent to a spurious local minima.
We propose a novel method named Riemannian-based Discriminant Analysis (RDA), which transforms the traditional Euclidean-based methods to the Riemannian manifold space.
arXiv Detail & Related papers (2021-01-20T09:13:34Z) - An Uncertainty-Driven GCN Refinement Strategy for Organ Segmentation [53.425900196763756]
We propose a segmentation refinement method based on uncertainty analysis and graph convolutional networks.
We employ the uncertainty levels of the convolutional network in a particular input volume to formulate a semi-supervised graph learning problem.
We show that our method outperforms the state-of-the-art CRF refinement method by improving the dice score by 1% for the pancreas and 2% for spleen.
arXiv Detail & Related papers (2020-12-06T18:55:07Z) - Theoretical Insights Into Multiclass Classification: A High-dimensional
Asymptotic View [82.80085730891126]
We provide the first modernally precise analysis of linear multiclass classification.
Our analysis reveals that the classification accuracy is highly distribution-dependent.
The insights gained may pave the way for a precise understanding of other classification algorithms.
arXiv Detail & Related papers (2020-11-16T05:17:29Z) - Approaching Bio Cellular Classification for Malaria Infected Cells Using
Machine Learning and then Deep Learning to compare & analyze K-Nearest
Neighbours and Deep CNNs [0.0]
Malaria is a deadly disease which claims the lives of hundreds of thousands of people every year.
This paper examines different machine learning methods in the context of classifying the presence of malaria in cell images.
arXiv Detail & Related papers (2020-05-22T23:02:36Z) - Compressing Large Sample Data for Discriminant Analysis [78.12073412066698]
We consider the computational issues due to large sample size within the discriminant analysis framework.
We propose a new compression approach for reducing the number of training samples for linear and quadratic discriminant analysis.
arXiv Detail & Related papers (2020-05-08T05:09:08Z) - Asymptotic Analysis of an Ensemble of Randomly Projected Linear
Discriminants [94.46276668068327]
In [1], an ensemble of randomly projected linear discriminants is used to classify datasets.
We develop a consistent estimator of the misclassification probability as an alternative to the computationally-costly cross-validation estimator.
We also demonstrate the use of our estimator for tuning the projection dimension on both real and synthetic data.
arXiv Detail & Related papers (2020-04-17T12:47:04Z) - Saliency-based Weighted Multi-label Linear Discriminant Analysis [101.12909759844946]
We propose a new variant of Linear Discriminant Analysis (LDA) to solve multi-label classification tasks.
The proposed method is based on a probabilistic model for defining the weights of individual samples.
The Saliency-based weighted Multi-label LDA approach is shown to lead to performance improvements in various multi-label classification problems.
arXiv Detail & Related papers (2020-04-08T19:40:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.