The Relevance Feature and Vector Machine for health applications
- URL: http://arxiv.org/abs/2402.07079v1
- Date: Sun, 11 Feb 2024 01:21:56 GMT
- Title: The Relevance Feature and Vector Machine for health applications
- Authors: Albert Belenguer-Llorens, Carlos Sevilla-Salcedo, Emilio
Parrado-Hern\'andez and Vanessa G\'omez-Verdejo
- Abstract summary: This paper presents a novel model that addresses the challenges of the fat-data problem when dealing with clinical prospective studies.
The model capabilities are tested against state-of-the-art models in several medical datasets with fat-data problems.
- Score: 0.11538034264098687
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: This paper presents the Relevance Feature and Vector Machine (RFVM), a novel
model that addresses the challenges of the fat-data problem when dealing with
clinical prospective studies. The fat-data problem refers to the limitations of
Machine Learning (ML) algorithms when working with databases in which the
number of features is much larger than the number of samples (a common scenario
in certain medical fields). To overcome such limitations, the RFVM incorporates
different characteristics: (1) A Bayesian formulation which enables the model
to infer its parameters without overfitting thanks to the Bayesian model
averaging. (2) A joint optimisation that overcomes the limitations arising from
the fat-data characteristic by simultaneously including the variables that
define the primal space (features) and those that define the dual space
(observations). (3) An integrated prunning that removes the irrelevant features
and samples during the training iterative optimization. Also, this last point
turns out crucial when performing medical prospective studies, enabling
researchers to exclude unnecessary medical tests, reducing costs and
inconvenience for patients, and identifying the critical patients/subjects that
characterize the disorder and, subsequently, optimize the patient recruitment
process that leads to a balanced cohort. The model capabilities are tested
against state-of-the-art models in several medical datasets with fat-data
problems. These experimental works show that RFVM is capable of achieving
competitive classification accuracies while providing the most compact subset
of data (in both terms of features and samples). Moreover, the selected
features (medical tests) seem to be aligned with the existing medical
literature.
Related papers
- Seeing Unseen: Discover Novel Biomedical Concepts via
Geometry-Constrained Probabilistic Modeling [53.7117640028211]
We present a geometry-constrained probabilistic modeling treatment to resolve the identified issues.
We incorporate a suite of critical geometric properties to impose proper constraints on the layout of constructed embedding space.
A spectral graph-theoretic method is devised to estimate the number of potential novel classes.
arXiv Detail & Related papers (2024-03-02T00:56:05Z) - Question-Answering Model for Schizophrenia Symptoms and Their Impact on
Daily Life using Mental Health Forums Data [0.0]
The Mental Health'' forum was used, a forum dedicated to people suffering from schizophrenia and different mental disorders.
It is shown how to pre-process the dataset to convert it into a QA dataset.
The BiBERT, DistilBERT, RoBERTa, and BioBERT models were fine-tuned and evaluated via F1-Score, Exact Match, Precision and Recall.
arXiv Detail & Related papers (2023-09-30T17:50:50Z) - Automated Medical Coding on MIMIC-III and MIMIC-IV: A Critical Review
and Replicability Study [60.56194508762205]
We reproduce, compare, and analyze state-of-the-art automated medical coding machine learning models.
We show that several models underperform due to weak configurations, poorly sampled train-test splits, and insufficient evaluation.
We present the first comprehensive results on the newly released MIMIC-IV dataset using the reproduced models.
arXiv Detail & Related papers (2023-04-21T11:54:44Z) - A Multimodal Data-driven Framework for Anxiety Screening [15.002401707506941]
We propose a data-driven anxiety screening framework, namely MMD-AS, and conduct experiments on the collected health data of over 200 seafarers by smartphones.
The proposed framework's feature extraction, dimension reduction, feature selection, and anxiety inference are jointly trained to improve the model's performance.
arXiv Detail & Related papers (2023-03-16T02:25:05Z) - Bayesian Models of Functional Connectomics and Behavior [0.0]
We present a fully bayesian formulation for joint representation learning and prediction.
We present preliminary results on a subset of a publicly available clinical rs-fMRI study on patients with Autism Spectrum Disorder.
arXiv Detail & Related papers (2023-01-15T20:42:31Z) - Benchmarking Heterogeneous Treatment Effect Models through the Lens of
Interpretability [82.29775890542967]
Estimating personalized effects of treatments is a complex, yet pervasive problem.
Recent developments in the machine learning literature on heterogeneous treatment effect estimation gave rise to many sophisticated, but opaque, tools.
We use post-hoc feature importance methods to identify features that influence the model's predictions.
arXiv Detail & Related papers (2022-06-16T17:59:05Z) - Active Selection of Classification Features [0.0]
Auxiliary data, such as demographics, might help in selecting a smaller sample that comprises the individuals with the most informative MRI scans.
We propose two utility-based approaches for this problem, and evaluate their performance on three public real-world benchmark datasets.
arXiv Detail & Related papers (2021-02-26T18:19:08Z) - FIT: a Fast and Accurate Framework for Solving Medical Inquiring and
Diagnosing Tasks [10.687562550605739]
Self-diagnosis provides low-cost and accessible healthcare via an agent that queries the patient and makes predictions about possible diseases.
We propose a competitive framework, called FIT, which uses an information-theoretic reward to determine what data to collect next.
Our results in two simulated datasets show that FIT can effectively deal with large search space problems, outperforming existing baselines.
arXiv Detail & Related papers (2020-12-02T10:12:49Z) - Select-ProtoNet: Learning to Select for Few-Shot Disease Subtype
Prediction [55.94378672172967]
We focus on few-shot disease subtype prediction problem, identifying subgroups of similar patients.
We introduce meta learning techniques to develop a new model, which can extract the common experience or knowledge from interrelated clinical tasks.
Our new model is built upon a carefully designed meta-learner, called Prototypical Network, that is a simple yet effective meta learning machine for few-shot image classification.
arXiv Detail & Related papers (2020-09-02T02:50:30Z) - Predicting Clinical Diagnosis from Patients Electronic Health Records
Using BERT-based Neural Networks [62.9447303059342]
We show the importance of this problem in medical community.
We present a modification of Bidirectional Representations from Transformers (BERT) model for classification sequence.
We use a large-scale Russian EHR dataset consisting of about 4 million unique patient visits.
arXiv Detail & Related papers (2020-07-15T09:22:55Z) - Hemogram Data as a Tool for Decision-making in COVID-19 Management:
Applications to Resource Scarcity Scenarios [62.997667081978825]
COVID-19 pandemics has challenged emergency response systems worldwide, with widespread reports of essential services breakdown and collapse of health care structure.
This work describes a machine learning model derived from hemogram exam data performed in symptomatic patients.
Proposed models can predict COVID-19 qRT-PCR results in symptomatic individuals with high accuracy, sensitivity and specificity.
arXiv Detail & Related papers (2020-05-10T01:45:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.