Binary Classification for High Dimensional Data using Supervised
Non-Parametric Ensemble Method
- URL: http://arxiv.org/abs/2202.07779v1
- Date: Tue, 15 Feb 2022 23:06:49 GMT
- Title: Binary Classification for High Dimensional Data using Supervised
Non-Parametric Ensemble Method
- Authors: Nandan Kanvinde, Abhishek Gupta, Raunak Joshi
- Abstract summary: The dataset for Polycystic Ovary Syndrome is available, which is termed as an endocrinological disorder in women.
Non-Parametric Supervised Ensemble machine learning methods can be used for prediction of the disorder in early stages.
- Score: 3.1871776847712523
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Medical Research data used for prognostication deals with binary
classification problems in most of the cases. The endocrinological disorders
have data available and it can be leveraged using Machine Learning. The dataset
for Polycystic Ovary Syndrome is available, which is termed as an
endocrinological disorder in women. Non-Parametric Supervised Ensemble machine
learning methods can be used for prediction of the disorder in early stages. In
this paper we present the Bootstrap Aggregation Supervised Ensemble
Non-parametric method for prognostication that competes state-of-the-art
performance with accuracy of over 92% along with in depth analysis of the data.
Related papers
- General targeted machine learning for modern causal mediation analysis [3.813608775141218]
Causal mediation analyses investigate the mechanisms through which causes exert their effects.
We show that the identification formulas for six popular non-parametric approaches to mediation analysis can be recovered from just two statistical estimands.
We propose an all-purpose one-step estimation algorithm that can be coupled with machine learning in any mediation study.
arXiv Detail & Related papers (2024-08-26T20:31:26Z) - Seeing Unseen: Discover Novel Biomedical Concepts via
Geometry-Constrained Probabilistic Modeling [53.7117640028211]
We present a geometry-constrained probabilistic modeling treatment to resolve the identified issues.
We incorporate a suite of critical geometric properties to impose proper constraints on the layout of constructed embedding space.
A spectral graph-theoretic method is devised to estimate the number of potential novel classes.
arXiv Detail & Related papers (2024-03-02T00:56:05Z) - Few-shot learning for COVID-19 Chest X-Ray Classification with
Imbalanced Data: An Inter vs. Intra Domain Study [49.5374512525016]
Medical image datasets are essential for training models used in computer-aided diagnosis, treatment planning, and medical research.
Some challenges are associated with these datasets, including variability in data distribution, data scarcity, and transfer learning issues when using models pre-trained from generic images.
We propose a methodology based on Siamese neural networks in which a series of techniques are integrated to mitigate the effects of data scarcity and distribution imbalance.
arXiv Detail & Related papers (2024-01-18T16:59:27Z) - STEM Rebalance: A Novel Approach for Tackling Imbalanced Datasets using
SMOTE, Edited Nearest Neighbour, and Mixup [0.20482269513546458]
Imbalanced datasets in medical imaging are characterized by skewed class proportions and scarcity of abnormal cases.
This paper investigates the potential of using Mixup augmentation to generate new data points as a generic vicinal distribution.
We focus on the breast cancer problem, where imbalanced datasets are prevalent.
arXiv Detail & Related papers (2023-11-13T17:45:28Z) - A Perceptron-based Fine Approximation Technique for Linear Separation [0.0]
This paper presents a novel online learning method that aims at finding a separator hyperplane between data points labelled as either positive or negative.
Weights and biases of artificial neurons can directly be related to hyperplanes in high-dimensional spaces.
The presented method has proven converge; empirical results show that it can be more efficient than the Perceptron algorithm.
arXiv Detail & Related papers (2023-09-12T08:35:24Z) - Discriminant Analysis in Contrasting Dimensions for Polycystic Ovary
Syndrome Prognostication [3.1871776847712523]
Polycystic Ovary Syndrome also known as PCOS is a binary classification problem.
We present Discriminant Analysis in different dimensions with Linear and Quadratic form for binary classification along with metrics.
We were able to achieve good accuracy and less variation with Discriminant Analysis.
arXiv Detail & Related papers (2022-01-09T14:55:17Z) - Non-stationary Gaussian process discriminant analysis with variable
selection for high-dimensional functional data [0.0]
High-dimensional classification and feature selection are ubiquitous with the recent advancement in data acquisition technology.
These structures pose additional challenges to commonly used methods that rely mainly on a two-stage approach performing variable selection and classification separately.
We propose in this work a novel Gaussian process discriminant analysis (GPDA) that combines these steps in a unified framework.
arXiv Detail & Related papers (2021-09-29T03:35:49Z) - G-MIND: An End-to-End Multimodal Imaging-Genetics Framework for
Biomarker Identification and Disease Classification [49.53651166356737]
We propose a novel deep neural network architecture to integrate imaging and genetics data, as guided by diagnosis, that provides interpretable biomarkers.
We have evaluated our model on a population study of schizophrenia that includes two functional MRI (fMRI) paradigms and Single Nucleotide Polymorphism (SNP) data.
arXiv Detail & Related papers (2021-01-27T19:28:04Z) - Learning Binary Semantic Embedding for Histology Image Classification
and Retrieval [56.34863511025423]
We propose a novel method for Learning Binary Semantic Embedding (LBSE)
Based on the efficient and effective embedding, classification and retrieval are performed to provide interpretable computer-assisted diagnosis for histology images.
Experiments conducted on three benchmark datasets validate the superiority of LBSE under various scenarios.
arXiv Detail & Related papers (2020-10-07T08:36:44Z) - Select-ProtoNet: Learning to Select for Few-Shot Disease Subtype
Prediction [55.94378672172967]
We focus on few-shot disease subtype prediction problem, identifying subgroups of similar patients.
We introduce meta learning techniques to develop a new model, which can extract the common experience or knowledge from interrelated clinical tasks.
Our new model is built upon a carefully designed meta-learner, called Prototypical Network, that is a simple yet effective meta learning machine for few-shot image classification.
arXiv Detail & Related papers (2020-09-02T02:50:30Z) - Predictive Modeling of ICU Healthcare-Associated Infections from
Imbalanced Data. Using Ensembles and a Clustering-Based Undersampling
Approach [55.41644538483948]
This work is focused on both the identification of risk factors and the prediction of healthcare-associated infections in intensive-care units.
The aim is to support decision making addressed at reducing the incidence rate of infections.
arXiv Detail & Related papers (2020-05-07T16:13:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.