MIRACLE: Multi-task Learning based Interpretable Regulation of
Autoimmune Diseases through Common Latent Epigenetics
- URL: http://arxiv.org/abs/2306.13866v2
- Date: Thu, 3 Aug 2023 04:34:00 GMT
- Title: MIRACLE: Multi-task Learning based Interpretable Regulation of
Autoimmune Diseases through Common Latent Epigenetics
- Authors: Pengcheng Xu, Jinpu Cai, Yulin Gao, Ziqi Rong
- Abstract summary: MIRACLE is a novel interpretable neural network that integrates multiple datasets and jointly identify common patterns in DNA methylation.
Tested on six datasets, including rheumatoid arthritis, systemic lupus erythematosus, multiple sclerosis, inflammatory bowel disease, psoriasis, and type 1 diabetes.
- Score: 1.8632273262541308
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: DNA methylation is a crucial regulator of gene transcription and has been
linked to various diseases, including autoimmune diseases and cancers. However,
diagnostics based on DNA methylation face challenges due to large feature sets
and small sample sizes, resulting in overfitting and suboptimal performance. To
address these issues, we propose MIRACLE, a novel interpretable neural network
that leverages autoencoder-based multi-task learning to integrate multiple
datasets and jointly identify common patterns in DNA methylation.
MIRACLE's architecture reflects the relationships between methylation sites,
genes, and pathways, ensuring biological interpretability and meaningfulness.
The network comprises an encoder and a decoder, with a bottleneck layer
representing pathway information as the basic unit of heredity. Customized
defined MaskedLinear Layer is constrained by site-gene-pathway graph adjacency
matrix information, which provides explainability and expresses the
site-gene-pathway hierarchical structure explicitly. And from the embedding,
there are different multi-task classifiers to predict diseases.
Tested on six datasets, including rheumatoid arthritis, systemic lupus
erythematosus, multiple sclerosis, inflammatory bowel disease, psoriasis, and
type 1 diabetes, MIRACLE demonstrates robust performance in identifying common
functions of DNA methylation across different phenotypes, with higher accuracy
in prediction dieseases than baseline methods. By incorporating biological
prior knowledge, MIRACLE offers a meaningful and interpretable framework for
DNA methylation data analysis in the context of autoimmune diseases.
Related papers
- Stacked ensemble\-based mutagenicity prediction model using multiple modalities with graph attention network [0.9736758288065405]
Mutagenicity is a concern due to its association with genetic mutations which can result in a variety of negative consequences.
In this work, we introduce a novel stacked ensemble based mutagenicity prediction model.
arXiv Detail & Related papers (2024-09-03T09:14:21Z) - MMIL: A novel algorithm for disease associated cell type discovery [58.044870442206914]
Single-cell datasets often lack individual cell labels, making it challenging to identify cells associated with disease.
We introduce Mixture Modeling for Multiple Learning Instance (MMIL), an expectation method that enables the training and calibration of cell-level classifiers.
arXiv Detail & Related papers (2024-06-12T15:22:56Z) - GestaltMML: Enhancing Rare Genetic Disease Diagnosis through Multimodal Machine Learning Combining Facial Images and Clinical Texts [8.805728428427457]
We introduce a multimodal machine learning (MML) approach solely based on the Transformer architecture.
It integrates facial images, demographic information (age, sex, ethnicity), and clinical notes to improve prediction accuracy.
arXiv Detail & Related papers (2023-12-23T18:40:25Z) - Genetic InfoMax: Exploring Mutual Information Maximization in
High-Dimensional Imaging Genetics Studies [50.11449968854487]
Genome-wide association studies (GWAS) are used to identify relationships between genetic variations and specific traits.
Representation learning for imaging genetics is largely under-explored due to the unique challenges posed by GWAS.
We introduce a trans-modal learning framework Genetic InfoMax (GIM) to address the specific challenges of GWAS.
arXiv Detail & Related papers (2023-09-26T03:59:21Z) - Machine Learning Methods for Cancer Classification Using Gene Expression
Data: A Review [77.34726150561087]
Cancer is the second major cause of death after cardiovascular diseases.
Gene expression can play a fundamental role in the early detection of cancer.
This study reviews recent progress in gene expression analysis for cancer classification using machine learning methods.
arXiv Detail & Related papers (2023-01-28T15:03:03Z) - Domain Invariant Model with Graph Convolutional Network for Mammogram
Classification [49.691629817104925]
We propose a novel framework, namely Domain Invariant Model with Graph Convolutional Network (DIM-GCN)
We first propose a Bayesian network, which explicitly decomposes the latent variables into disease-related and other disease-irrelevant parts that are provable to be disentangled from each other.
To better capture the macroscopic features, we leverage the observed clinical attributes as a goal for reconstruction, via Graph Convolutional Network (GCN)
arXiv Detail & Related papers (2022-04-21T08:23:44Z) - Data-Driven Logistic Regression Ensembles With Applications in Genomics [0.0]
We propose a new approach for dealing with high-dimensional binary classification problems that combines ideas from regularization and ensembling.
We demonstrate the good performance of our method in terms of prediction accuracy and identification of key biomarkers using several medical datasets involving common diseases such as cancer, multiple sclerosis and psoriasis.
arXiv Detail & Related papers (2021-02-17T05:57:26Z) - G-MIND: An End-to-End Multimodal Imaging-Genetics Framework for
Biomarker Identification and Disease Classification [49.53651166356737]
We propose a novel deep neural network architecture to integrate imaging and genetics data, as guided by diagnosis, that provides interpretable biomarkers.
We have evaluated our model on a population study of schizophrenia that includes two functional MRI (fMRI) paradigms and Single Nucleotide Polymorphism (SNP) data.
arXiv Detail & Related papers (2021-01-27T19:28:04Z) - A Cross-Level Information Transmission Network for Predicting Phenotype
from New Genotype: Application to Cancer Precision Medicine [37.442717660492384]
We propose a novel Cross-LEvel Information Transmission network (CLEIT) framework.
Inspired by domain adaptation, CLEIT first learns the latent representation of high-level domain then uses it as ground-truth embedding.
We demonstrate the effectiveness and performance boost of CLEIT in predicting anti-cancer drug sensitivity from somatic mutations.
arXiv Detail & Related papers (2020-10-09T22:01:00Z) - Select-ProtoNet: Learning to Select for Few-Shot Disease Subtype
Prediction [55.94378672172967]
We focus on few-shot disease subtype prediction problem, identifying subgroups of similar patients.
We introduce meta learning techniques to develop a new model, which can extract the common experience or knowledge from interrelated clinical tasks.
Our new model is built upon a carefully designed meta-learner, called Prototypical Network, that is a simple yet effective meta learning machine for few-shot image classification.
arXiv Detail & Related papers (2020-09-02T02:50:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.