Simultaneous imputation and disease classification in incomplete medical
datasets using Multigraph Geometric Matrix Completion (MGMC)
- URL: http://arxiv.org/abs/2005.06935v1
- Date: Thu, 14 May 2020 13:11:35 GMT
- Title: Simultaneous imputation and disease classification in incomplete medical
datasets using Multigraph Geometric Matrix Completion (MGMC)
- Authors: Gerome Vivar, Anees Kazi, Hendrik Burwinkel, Andreas Zwergal, Nassir
Navab, Seyed-Ahmad Ahmadi
- Abstract summary: We propose an end-to-end learning of imputation and disease prediction of incomplete medical datasets via Multigraph Geometric Matrix Completion.
We empirically show the superiority of our proposed approach in terms of classification and imputation performance when compared with state-of-the-art approaches.
- Score: 37.160335232396406
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Large-scale population-based studies in medicine are a key resource towards
better diagnosis, monitoring, and treatment of diseases. They also serve as
enablers of clinical decision support systems, in particular Computer Aided
Diagnosis (CADx) using machine learning (ML). Numerous ML approaches for CADx
have been proposed in literature. However, these approaches assume full data
availability, which is not always feasible in clinical data. To account for
missing data, incomplete data samples are either removed or imputed, which
could lead to data bias and may negatively affect classification performance.
As a solution, we propose an end-to-end learning of imputation and disease
prediction of incomplete medical datasets via Multigraph Geometric Matrix
Completion (MGMC). MGMC uses multiple recurrent graph convolutional networks,
where each graph represents an independent population model based on a key
clinical meta-feature like age, sex, or cognitive function. Graph signal
aggregation from local patient neighborhoods, combined with multigraph signal
fusion via self-attention, has a regularizing effect on both matrix
reconstruction and classification performance. Our proposed approach is able to
impute class relevant features as well as perform accurate classification on
two publicly available medical datasets. We empirically show the superiority of
our proposed approach in terms of classification and imputation performance
when compared with state-of-the-art approaches. MGMC enables disease prediction
in multimodal and incomplete medical datasets. These findings could serve as
baseline for future CADx approaches which utilize incomplete datasets.
Related papers
- Graph-Ensemble Learning Model for Multi-label Skin Lesion Classification
using Dermoscopy and Clinical Images [7.159532626507458]
This study introduces a Graph Convolution Network (GCN) to exploit prior co-occurrence between each category as a correlation matrix into the deep learning model for the multi-label classification.
We propose a Graph-Ensemble Learning Model (GELN) that views the prediction from GCN as complementary information of the predictions from the fusion model.
arXiv Detail & Related papers (2023-07-04T13:19:57Z) - LVM-Med: Learning Large-Scale Self-Supervised Vision Models for Medical
Imaging via Second-order Graph Matching [59.01894976615714]
We introduce LVM-Med, the first family of deep networks trained on large-scale medical datasets.
We have collected approximately 1.3 million medical images from 55 publicly available datasets.
LVM-Med empirically outperforms a number of state-of-the-art supervised, self-supervised, and foundation models.
arXiv Detail & Related papers (2023-06-20T22:21:34Z) - Unsupervised pre-training of graph transformers on patient population
graphs [48.02011627390706]
We propose a graph-transformer-based network to handle heterogeneous clinical data.
We show the benefit of our pre-training method in a self-supervised and a transfer learning setting.
arXiv Detail & Related papers (2022-07-21T16:59:09Z) - Multi-modal Graph Learning for Disease Prediction [35.156975779372836]
We propose an end-to-end Multi-modal Graph Learning framework (MMGL) for disease prediction with multi-modality.
Instead of defining the graph manually, the latent graph structure is captured through an effective way of adaptive graph learning.
An extensive group of experiments on two disease prediction tasks demonstrates that the proposed MMGL achieves more favorable performance.
arXiv Detail & Related papers (2022-03-11T12:33:20Z) - A Real Use Case of Semi-Supervised Learning for Mammogram Classification
in a Local Clinic of Costa Rica [0.5541644538483946]
Training a deep learning model requires a considerable amount of labeled images.
A number of publicly available datasets have been built with data from different hospitals and clinics.
The use of the semi-supervised deep learning approach known as MixMatch, to leverage the usage of unlabeled data is proposed and evaluated.
arXiv Detail & Related papers (2021-07-24T22:26:50Z) - Select-ProtoNet: Learning to Select for Few-Shot Disease Subtype
Prediction [55.94378672172967]
We focus on few-shot disease subtype prediction problem, identifying subgroups of similar patients.
We introduce meta learning techniques to develop a new model, which can extract the common experience or knowledge from interrelated clinical tasks.
Our new model is built upon a carefully designed meta-learner, called Prototypical Network, that is a simple yet effective meta learning machine for few-shot image classification.
arXiv Detail & Related papers (2020-09-02T02:50:30Z) - Trajectories, bifurcations and pseudotime in large clinical datasets:
applications to myocardial infarction and diabetes data [94.37521840642141]
We suggest a semi-supervised methodology for the analysis of large clinical datasets, characterized by mixed data types and missing values.
The methodology is based on application of elastic principal graphs which can address simultaneously the tasks of dimensionality reduction, data visualization, clustering, feature selection and quantifying the geodesic distances (pseudotime) in partially ordered sequences of observations.
arXiv Detail & Related papers (2020-07-07T21:04:55Z) - A Deep Learning Pipeline for Patient Diagnosis Prediction Using
Electronic Health Records [0.5672132510411464]
We develop and publish a Python package to transform public health dataset into easy to access universal format.
We propose two novel model architectures to predict multiple diagnoses simultaneously.
Both models can predict multiple diagnoses simultaneously with high accuracy.
arXiv Detail & Related papers (2020-06-23T14:58:58Z) - Dynamic Graph Correlation Learning for Disease Diagnosis with Incomplete
Labels [66.57101219176275]
Disease diagnosis on chest X-ray images is a challenging multi-label classification task.
We propose a Disease Diagnosis Graph Convolutional Network (DD-GCN) that presents a novel view of investigating the inter-dependency among different diseases.
Our method is the first to build a graph over the feature maps with a dynamic adjacency matrix for correlation learning.
arXiv Detail & Related papers (2020-02-26T17:10:48Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.