MG-NET: Leveraging Pseudo-Imaging for Multi-Modal Metagenome Analysis
- URL: http://arxiv.org/abs/2107.09883v1
- Date: Wed, 21 Jul 2021 05:53:01 GMT
- Title: MG-NET: Leveraging Pseudo-Imaging for Multi-Modal Metagenome Analysis
- Authors: Sathyanarayanan N. Aakur, Sai Narayanan, Vineela Indla, Arunkumar
Bagavathi, Vishalini Laguduva Ramnath, Akhilesh Ramachandran
- Abstract summary: We propose MG-Net, a self-supervised representation learning framework.
We show that MG-Net can learn robust representations from unlabeled data.
Experiments show that the learned features outperform current baseline metagenome representations.
- Score: 5.04905391284093
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: The emergence of novel pathogens and zoonotic diseases like the SARS-CoV-2
have underlined the need for developing novel diagnosis and intervention
pipelines that can learn rapidly from small amounts of labeled data. Combined
with technological advances in next-generation sequencing, metagenome-based
diagnostic tools hold much promise to revolutionize rapid point-of-care
diagnosis. However, there are significant challenges in developing such an
approach, the chief among which is to learn self-supervised representations
that can help detect novel pathogen signatures with very low amounts of labeled
data. This is particularly a difficult task given that closely related
pathogens can share more than 90% of their genome structure. In this work, we
address these challenges by proposing MG-Net, a self-supervised representation
learning framework that leverages multi-modal context using pseudo-imaging data
derived from clinical metagenome sequences. We show that the proposed framework
can learn robust representations from unlabeled data that can be used for
downstream tasks such as metagenome sequence classification with limited access
to labeled data. Extensive experiments show that the learned features
outperform current baseline metagenome representations, given only 1000 samples
per class.
Related papers
- Scalable Pathogen Detection from Next Generation DNA Sequencing with
Deep Learning [3.8175773487333857]
We propose MG2Vec, a deep learning-based solution that uses the transformer network as its backbone.
We show that the proposed approach can help detect pathogens from uncurated, real-world clinical samples.
We provide a comprehensive evaluation of a novel representation learning framework for metagenome-based disease diagnostics with deep learning.
arXiv Detail & Related papers (2022-11-30T00:13:59Z) - Metagenome2Vec: Building Contextualized Representations for Scalable
Metagenome Analysis [4.807955518532493]
We propose Metagenome2Vec - a contextualized representation that captures the global structural properties inherent in metagenome data.
We show that the learned representations can help detect six (6) related pathogens from clinical samples with less than 100 labeled sequences.
arXiv Detail & Related papers (2021-11-09T23:21:10Z) - METGAN: Generative Tumour Inpainting and Modality Synthesis in Light
Sheet Microscopy [4.872960046536882]
We introduce a novel generative method which leverages real anatomical information to generate realistic image-label pairs of tumours.
We construct a dual-pathway generator, for the anatomical image and label, trained in a cycle-consistent setup, constrained by an independent, pretrained segmentor.
The generated images yield significant quantitative improvement compared to existing methods.
arXiv Detail & Related papers (2021-04-22T11:18:17Z) - Many-to-One Distribution Learning and K-Nearest Neighbor Smoothing for
Thoracic Disease Identification [83.6017225363714]
deep learning has become the most powerful computer-aided diagnosis technology for improving disease identification performance.
For chest X-ray imaging, annotating large-scale data requires professional domain knowledge and is time-consuming.
In this paper, we propose many-to-one distribution learning (MODL) and K-nearest neighbor smoothing (KNNS) methods to improve a single model's disease identification performance.
arXiv Detail & Related papers (2021-02-26T02:29:30Z) - G-MIND: An End-to-End Multimodal Imaging-Genetics Framework for
Biomarker Identification and Disease Classification [49.53651166356737]
We propose a novel deep neural network architecture to integrate imaging and genetics data, as guided by diagnosis, that provides interpretable biomarkers.
We have evaluated our model on a population study of schizophrenia that includes two functional MRI (fMRI) paradigms and Single Nucleotide Polymorphism (SNP) data.
arXiv Detail & Related papers (2021-01-27T19:28:04Z) - Weakly-Supervised Cross-Domain Adaptation for Endoscopic Lesions
Segmentation [79.58311369297635]
We propose a new weakly-supervised lesions transfer framework, which can explore transferable domain-invariant knowledge across different datasets.
A Wasserstein quantified transferability framework is developed to highlight widerange transferable contextual dependencies.
A novel self-supervised pseudo label generator is designed to equally provide confident pseudo pixel labels for both hard-to-transfer and easy-to-transfer target samples.
arXiv Detail & Related papers (2020-12-08T02:26:03Z) - Towards an Automatic Analysis of CHO-K1 Suspension Growth in
Microfluidic Single-cell Cultivation [63.94623495501023]
We propose a novel Machine Learning architecture, which allows us to infuse a neural deep network with human-powered abstraction on the level of data.
Specifically, we train a generative model simultaneously on natural and synthetic data, so that it learns a shared representation, from which a target variable, such as the cell count, can be reliably estimated.
arXiv Detail & Related papers (2020-10-20T08:36:51Z) - Select-ProtoNet: Learning to Select for Few-Shot Disease Subtype
Prediction [55.94378672172967]
We focus on few-shot disease subtype prediction problem, identifying subgroups of similar patients.
We introduce meta learning techniques to develop a new model, which can extract the common experience or knowledge from interrelated clinical tasks.
Our new model is built upon a carefully designed meta-learner, called Prototypical Network, that is a simple yet effective meta learning machine for few-shot image classification.
arXiv Detail & Related papers (2020-09-02T02:50:30Z) - Genome Sequence Classification for Animal Diagnostics with Graph
Representations and Deep Neural Networks [4.339839287869652]
Bovine Respiratory Disease Complex (BRDC) is a complex respiratory disease in cattle with multiple etiologies, including bacterial and viral.
Current animal disease diagnostics is based on traditional tests such as bacterial culture, serolog, and Polymerase Chain Reaction (PCR) tests.
We show that networks-based machine learning approaches can detect pathogen signature with up to 89.7% accuracy.
arXiv Detail & Related papers (2020-07-24T22:30:18Z) - Semi-supervised Medical Image Classification with Relation-driven
Self-ensembling Model [71.80319052891817]
We present a relation-driven semi-supervised framework for medical image classification.
It exploits the unlabeled data by encouraging the prediction consistency of given input under perturbations.
Our method outperforms many state-of-the-art semi-supervised learning methods on both single-label and multi-label image classification scenarios.
arXiv Detail & Related papers (2020-05-15T06:57:54Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.