Towards Interpreting Zoonotic Potential of Betacoronavirus Sequences
With Attention
- URL: http://arxiv.org/abs/2108.08077v1
- Date: Wed, 18 Aug 2021 10:11:11 GMT
- Title: Towards Interpreting Zoonotic Potential of Betacoronavirus Sequences
With Attention
- Authors: Kahini Wadhawan and Payel Das and Barbara A. Han and Ilya R. Fischhoff
and Adrian C. Castellanos and Arvind Varsani and Kush R. Varshney
- Abstract summary: We apply an attention-enhanced long-short-term memory (LSTM) deep neural net classifier to a highly conserved viral protein target to predict zoonotic potential across betacoronaviruses.
Analysis and visualization of attention at the sequence and structure-level features indicate possible association between important protein-protein interactions governing viral replication in zoonotic betacoronaviruses and zoonotic transmission.
- Score: 17.406451433347527
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Current methods for viral discovery target evolutionarily conserved proteins
that accurately identify virus families but remain unable to distinguish the
zoonotic potential of newly discovered viruses. Here, we apply an
attention-enhanced long-short-term memory (LSTM) deep neural net classifier to
a highly conserved viral protein target to predict zoonotic potential across
betacoronaviruses. The classifier performs with a 94% accuracy. Analysis and
visualization of attention at the sequence and structure-level features
indicate possible association between important protein-protein interactions
governing viral replication in zoonotic betacoronaviruses and zoonotic
transmission.
Related papers
- Opponent Shaping for Antibody Development [49.26728828005039]
Anti-viral therapies are typically designed to target only the current strains of a virus.
therapy-induced selective pressures act on viruses to drive the emergence of mutated strains, against which initial therapies have reduced efficacy.
We build on a computational model of binding between antibodies and viral antigens to implement a genetic simulation of viral evolutionary escape.
arXiv Detail & Related papers (2024-09-16T14:56:27Z) - DNA: Differentially private Neural Augmentation for contact tracing [62.740950398187664]
Contact tracing is an effective way to reduce infection rates by detecting potential virus carriers early.
We substantially improve the privacy guarantees of the current state of the art in decentralized contact tracing.
This work marks an important first step in integrating deep learning into contact tracing while maintaining essential privacy guarantees.
arXiv Detail & Related papers (2024-04-20T13:43:28Z) - Virus2Vec: Viral Sequence Classification Using Machine Learning [48.40285316053593]
We propose Virus2Vec, a feature-vector representation for viral sequences that enable machine learning models to identify viral hosts.
We empirically evaluate Virus2Vec on real-world spike sequences of Coronaviridae and rabies virus sequence data to predict the host.
Our results demonstrate that Virus2Vec outperforms the predictive accuracies of baseline and state-of-the-art methods.
arXiv Detail & Related papers (2023-04-24T08:17:16Z) - PCD2Vec: A Poisson Correction Distance-Based Approach for Viral Host
Classification [0.966840768820136]
Coronaviruses are membrane-enveloped, non-segmented positive-strand RNA viruses belonging to the Coronaviridae family.
In the Coronavirus genome, an essential structural region is the spike region, and it's responsible for attaching the virus to the host cell membrane.
We propose a novel method for predicting the host specificity of coronaviruses by analyzing spike protein sequences from different viral subgenera and species.
arXiv Detail & Related papers (2023-04-13T03:02:22Z) - Accurate Virus Identification with Interpretable Raman Signatures by
Machine Learning [12.184128048998906]
We present a machine learning approach for analyzing Raman spectra of human and avian viruses.
A Convolutional Neural Network (CNN) classifier specifically designed for spectral data achieves very high accuracy for a variety of virus type or subtype identification tasks.
arXiv Detail & Related papers (2022-06-05T22:31:14Z) - Persistent Animal Identification Leveraging Non-Visual Markers [71.14999745312626]
We aim to locate and provide a unique identifier for each mouse in a cluttered home-cage environment through time.
This is a very challenging problem due to (i) the lack of distinguishing visual features for each mouse, and (ii) the close confines of the scene with constant occlusion.
Our approach achieves 77% accuracy on this animal identification problem, and is able to reject spurious detections when the animals are hidden.
arXiv Detail & Related papers (2021-12-13T17:11:32Z) - A multitask transfer learning framework for the prediction of
virus-human protein-protein interactions [0.30586855806896046]
We develop a transfer learning approach that exploits the information of around 24 million protein sequences and the interaction patterns from the human interactome.
We employ an additional objective which aims to maximize the probability of observing human protein-protein interactions.
Experimental results show that our proposed model works effectively for both virus-human and bacteria-human protein-protein interaction prediction tasks.
arXiv Detail & Related papers (2021-11-26T07:53:51Z) - Classification of Influenza Hemagglutinin Protein Sequences using
Convolutional Neural Networks [8.397189036839956]
This paper focuses on accurately predicting if an Influenza type A virus can infect specific hosts, and more specifically, Human, Avian and Swine hosts, using only the protein sequence of the HA gene.
We propose encoding the protein sequences into numerical signals using the Hydrophobicity Index and subsequently utilising a Convolutional Neural Network-based predictive model.
As the results show, the proposed model can distinguish HA protein sequences with high accuracy whenever the virus under investigation can infect Human, Avian or Swine hosts.
arXiv Detail & Related papers (2021-08-09T10:42:26Z) - A k-mer Based Approach for SARS-CoV-2 Variant Identification [55.78588835407174]
We show that preserving the order of the amino acids helps the underlying classifiers to achieve better performance.
We also show the importance of the different amino acids which play a key role in identifying variants and how they coincide with those reported by the USA's Centers for Disease Control and Prevention (CDC)
arXiv Detail & Related papers (2021-08-07T15:08:15Z) - Multi-Task Neural Networks with Spatial Activation for Retinal Vessel
Segmentation and Artery/Vein Classification [49.64863177155927]
We propose a multi-task deep neural network with spatial activation mechanism to segment full retinal vessel, artery and vein simultaneously.
The proposed network achieves pixel-wise accuracy of 95.70% for vessel segmentation, and A/V classification accuracy of 94.50%, which is the state-of-the-art performance for both tasks.
arXiv Detail & Related papers (2020-07-18T05:46:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.