Predicting the Binding of SARS-CoV-2 Peptides to the Major
Histocompatibility Complex with Recurrent Neural Networks
- URL: http://arxiv.org/abs/2104.08237v1
- Date: Fri, 16 Apr 2021 17:16:35 GMT
- Title: Predicting the Binding of SARS-CoV-2 Peptides to the Major
Histocompatibility Complex with Recurrent Neural Networks
- Authors: Johanna Vielhaben, Markus Wenzel, Eva Weicken, Nils Strodthoff
- Abstract summary: We adapt and extend USMPep, a proposed, conceptually simple prediction algorithm based on recurrent neural networks.
We evaluate the performance on a recently released SARS-CoV-2 dataset with binding stability measurements.
- Score: 0.40040974874482094
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Predicting the binding of viral peptides to the major histocompatibility
complex with machine learning can potentially extend the computational
immunology toolkit for vaccine development, and serve as a key component in the
fight against a pandemic. In this work, we adapt and extend USMPep, a recently
proposed, conceptually simple prediction algorithm based on recurrent neural
networks. Most notably, we combine regressors (binding affinity data) and
classifiers (mass spectrometry data) from qualitatively different data sources
to obtain a more comprehensive prediction tool. We evaluate the performance on
a recently released SARS-CoV-2 dataset with binding stability measurements.
USMPep not only sets new benchmarks on selected single alleles, but
consistently turns out to be among the best-performing methods or, for some
metrics, to be even the overall best-performing method for this task.
Related papers
- AsEP: Benchmarking Deep Learning Methods for Antibody-specific Epitope Prediction [12.433560411515575]
We introduce a filtered antibody-antigen complex structure dataset, AsEP.
AsEP is the largest of its kind and provides clustered groups, allowing the community to develop prediction methods.
We propose a new method, WALLE, that leverages both protein language models and graph neural networks.
arXiv Detail & Related papers (2024-07-25T16:43:56Z) - NovoBench: Benchmarking Deep Learning-based De Novo Peptide Sequencing Methods in Proteomics [58.03989832372747]
We present the first unified benchmark NovoBench for emphde novo peptide sequencing.
It comprises diverse mass spectrum data, integrated models, and comprehensive evaluation metrics.
Recent methods, including DeepNovo, PointNovo, Casanovo, InstaNovo, AdaNovo and $pi$-HelixNovo are integrated into our framework.
arXiv Detail & Related papers (2024-06-16T08:23:21Z) - Convolutional Monge Mapping Normalization for learning on sleep data [63.22081662149488]
We propose a new method called Convolutional Monge Mapping Normalization (CMMN)
CMMN consists in filtering the signals in order to adapt their power spectrum density (PSD) to a Wasserstein barycenter estimated on training data.
Numerical experiments on sleep EEG data show that CMMN leads to significant and consistent performance gains independent from the neural network architecture.
arXiv Detail & Related papers (2023-05-30T08:24:01Z) - Forecast reconciliation for vaccine supply chain optimization [61.13962963550403]
Vaccine supply chain optimization can benefit from hierarchical time series forecasting.
Forecasts of different hierarchy levels become incoherent when higher levels do not match the sum of the lower levels forecasts.
We tackle the vaccine sale forecasting problem by modeling sales data from GSK between 2010 and 2021 as a hierarchical time series.
arXiv Detail & Related papers (2023-05-02T14:34:34Z) - A Supervised Machine Learning Approach for Sequence Based
Protein-protein Interaction (PPI) Prediction [4.916874464940376]
Computational protein-protein interaction (PPI) prediction techniques can contribute greatly in reducing time, cost and false-positive interactions.
We have described our submitted solution with the results of the SeqPIP competition.
arXiv Detail & Related papers (2022-03-23T18:27:25Z) - Multi-modality fusion using canonical correlation analysis methods:
Application in breast cancer survival prediction from histology and genomics [16.537929113715432]
We study the use of canonical correlation analysis (CCA) and penalized variants of CCA for the fusion of two modalities.
We analytically show that, with known model parameters, posterior mean estimators that jointly use both modalities outperform arbitrary linear mixing of single modality posterior estimators in latent variable prediction.
arXiv Detail & Related papers (2021-11-27T21:18:01Z) - Toward Robust Drug-Target Interaction Prediction via Ensemble Modeling
and Transfer Learning [0.0]
We introduce an ensemble of deep learning models (EnsembleDLM) for robust DTI prediction.
EnsembleDLM only uses the sequence information of chemical compounds and proteins, and it aggregates the predictions from multiple deep neural networks.
It achieves state-of-the-art performance in Davis and KIBA datasets.
arXiv Detail & Related papers (2021-07-02T04:00:03Z) - Bootstrapping Your Own Positive Sample: Contrastive Learning With
Electronic Health Record Data [62.29031007761901]
This paper proposes a novel contrastive regularized clinical classification model.
We introduce two unique positive sampling strategies specifically tailored for EHR data.
Our framework yields highly competitive experimental results in predicting the mortality risk on real-world COVID-19 EHR data.
arXiv Detail & Related papers (2021-04-07T06:02:04Z) - Select-ProtoNet: Learning to Select for Few-Shot Disease Subtype
Prediction [55.94378672172967]
We focus on few-shot disease subtype prediction problem, identifying subgroups of similar patients.
We introduce meta learning techniques to develop a new model, which can extract the common experience or knowledge from interrelated clinical tasks.
Our new model is built upon a carefully designed meta-learner, called Prototypical Network, that is a simple yet effective meta learning machine for few-shot image classification.
arXiv Detail & Related papers (2020-09-02T02:50:30Z) - Predictive Modeling of ICU Healthcare-Associated Infections from
Imbalanced Data. Using Ensembles and a Clustering-Based Undersampling
Approach [55.41644538483948]
This work is focused on both the identification of risk factors and the prediction of healthcare-associated infections in intensive-care units.
The aim is to support decision making addressed at reducing the incidence rate of infections.
arXiv Detail & Related papers (2020-05-07T16:13:12Z) - Adaptive Invariance for Molecule Property Prediction [38.637412590671865]
We introduce a novel approach to learn predictors that can generalize or extrapolate beyond the heterogeneous data.
Our method builds on and extends recently proposed invariant risk minimization.
Our predictor outperforms state-of-the-art transfer learning methods by significant margin.
arXiv Detail & Related papers (2020-05-05T19:47:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.