Genome Sequence Classification for Animal Diagnostics with Graph
Representations and Deep Neural Networks
- URL: http://arxiv.org/abs/2007.12791v1
- Date: Fri, 24 Jul 2020 22:30:18 GMT
- Title: Genome Sequence Classification for Animal Diagnostics with Graph
Representations and Deep Neural Networks
- Authors: Sai Narayanan, Akhilesh Ramachandran, Sathyanarayanan N. Aakur,
Arunkumar Bagavathi
- Abstract summary: Bovine Respiratory Disease Complex (BRDC) is a complex respiratory disease in cattle with multiple etiologies, including bacterial and viral.
Current animal disease diagnostics is based on traditional tests such as bacterial culture, serolog, and Polymerase Chain Reaction (PCR) tests.
We show that networks-based machine learning approaches can detect pathogen signature with up to 89.7% accuracy.
- Score: 4.339839287869652
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Bovine Respiratory Disease Complex (BRDC) is a complex respiratory disease in
cattle with multiple etiologies, including bacterial and viral. It is estimated
that mortality, morbidity, therapy, and quarantine resulting from BRDC account
for significant losses in the cattle industry. Early detection and management
of BRDC are crucial in mitigating economic losses. Current animal disease
diagnostics is based on traditional tests such as bacterial culture, serolog,
and Polymerase Chain Reaction (PCR) tests. Even though these tests are
validated for several diseases, their main challenge is their limited ability
to detect the presence of multiple pathogens simultaneously. Advancements of
data analytics and machine learning and applications over metagenome sequencing
are setting trends on several applications. In this work, we demonstrate a
machine learning approach to identify pathogen signatures present in bovine
metagenome sequences using k-mer-based network embedding followed by a deep
learning-based classification task. With experiments conducted on two different
simulated datasets, we show that networks-based machine learning approaches can
detect pathogen signature with up to 89.7% accuracy. We will make the data
available publicly upon request to tackle this important problem in a difficult
domain.
Related papers
- PathoLM: Identifying pathogenicity from the DNA sequence through the Genome Foundation Model [9.285895422810704]
PathoLM is a cutting-edge pathogen language model optimized for the identification of pathogenicity in bacterial and viral sequences.
We developed a comprehensive data set comprising approximately 30 species of viruses and bacteria, including ESKAPEE pathogens.
In comparative assessments, PathoLM dramatically outperforms existing models like DciPatho, demonstrating robust zero-shot and few-shot capabilities.
arXiv Detail & Related papers (2024-06-19T00:53:48Z) - Highly Accurate Disease Diagnosis and Highly Reproducible Biomarker
Identification with PathFormer [32.26944736442376]
Graph neural networks (GNNs) have been the dominant deep learning model for analyzing graph-structured data.
The root of the challenges is the unique graph structure of biological signaling pathways.
We present a novel GNN model architecture, named PathFormer, which integrates signaling network, priori knowledge and omics data to rank biomarkers and predict disease diagnosis.
arXiv Detail & Related papers (2024-02-11T18:23:54Z) - Machine Learning Methods for Cancer Classification Using Gene Expression
Data: A Review [77.34726150561087]
Cancer is the second major cause of death after cardiovascular diseases.
Gene expression can play a fundamental role in the early detection of cancer.
This study reviews recent progress in gene expression analysis for cancer classification using machine learning methods.
arXiv Detail & Related papers (2023-01-28T15:03:03Z) - Scalable Pathogen Detection from Next Generation DNA Sequencing with
Deep Learning [3.8175773487333857]
We propose MG2Vec, a deep learning-based solution that uses the transformer network as its backbone.
We show that the proposed approach can help detect pathogens from uncurated, real-world clinical samples.
We provide a comprehensive evaluation of a novel representation learning framework for metagenome-based disease diagnostics with deep learning.
arXiv Detail & Related papers (2022-11-30T00:13:59Z) - Benchmarking Machine Learning Robustness in Covid-19 Genome Sequence
Classification [109.81283748940696]
We introduce several ways to perturb SARS-CoV-2 genome sequences to mimic the error profiles of common sequencing platforms such as Illumina and PacBio.
We show that some simulation-based approaches are more robust (and accurate) than others for specific embedding methods to certain adversarial attacks to the input sequences.
arXiv Detail & Related papers (2022-07-18T19:16:56Z) - Lung Cancer Lesion Detection in Histopathology Images Using Graph-Based
Sparse PCA Network [93.22587316229954]
We propose a graph-based sparse principal component analysis (GS-PCA) network, for automated detection of cancerous lesions on histological lung slides stained by hematoxylin and eosin (H&E)
We evaluate the performance of the proposed algorithm on H&E slides obtained from an SVM K-rasG12D lung cancer mouse model using precision/recall rates, F-score, Tanimoto coefficient, and area under the curve (AUC) of the receiver operator characteristic (ROC)
arXiv Detail & Related papers (2021-10-27T19:28:36Z) - Deep neural networks approach to microbial colony detection -- a
comparative analysis [52.77024349608834]
This study investigates the performance of three deep learning approaches for object detection on the AGAR dataset.
The achieved results may serve as a benchmark for future experiments.
arXiv Detail & Related papers (2021-08-23T12:06:00Z) - MG-NET: Leveraging Pseudo-Imaging for Multi-Modal Metagenome Analysis [5.04905391284093]
We propose MG-Net, a self-supervised representation learning framework.
We show that MG-Net can learn robust representations from unlabeled data.
Experiments show that the learned features outperform current baseline metagenome representations.
arXiv Detail & Related papers (2021-07-21T05:53:01Z) - G-MIND: An End-to-End Multimodal Imaging-Genetics Framework for
Biomarker Identification and Disease Classification [49.53651166356737]
We propose a novel deep neural network architecture to integrate imaging and genetics data, as guided by diagnosis, that provides interpretable biomarkers.
We have evaluated our model on a population study of schizophrenia that includes two functional MRI (fMRI) paradigms and Single Nucleotide Polymorphism (SNP) data.
arXiv Detail & Related papers (2021-01-27T19:28:04Z) - Select-ProtoNet: Learning to Select for Few-Shot Disease Subtype
Prediction [55.94378672172967]
We focus on few-shot disease subtype prediction problem, identifying subgroups of similar patients.
We introduce meta learning techniques to develop a new model, which can extract the common experience or knowledge from interrelated clinical tasks.
Our new model is built upon a carefully designed meta-learner, called Prototypical Network, that is a simple yet effective meta learning machine for few-shot image classification.
arXiv Detail & Related papers (2020-09-02T02:50:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.