BeeTLe: A Framework for Linear B-Cell Epitope Prediction and
Classification
- URL: http://arxiv.org/abs/2309.02071v1
- Date: Tue, 5 Sep 2023 09:18:29 GMT
- Title: BeeTLe: A Framework for Linear B-Cell Epitope Prediction and
Classification
- Authors: Xiao Yuan
- Abstract summary: This paper presents a new deep learning-based framework for linear B-cell prediction as well as antibody type-specific classification.
We propose an amino acid encoding method based on eigen decomposition to help the model learn the representations of antibodies.
Experimental results on data curated from the largest public database demonstrate the validity of the proposed methods.
- Score: 0.43512163406551996
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The process of identifying and characterizing B-cell epitopes, which are the
portions of antigens recognized by antibodies, is important for our
understanding of the immune system, and for many applications including vaccine
development, therapeutics, and diagnostics. Computational epitope prediction is
challenging yet rewarding as it significantly reduces the time and cost of
laboratory work. Most of the existing tools do not have satisfactory
performance and only discriminate epitopes from non-epitopes. This paper
presents a new deep learning-based multi-task framework for linear B-cell
epitope prediction as well as antibody type-specific epitope classification.
Specifically, a sequenced-based neural network model using recurrent layers and
Transformer blocks is developed. We propose an amino acid encoding method based
on eigen decomposition to help the model learn the representations of epitopes.
We introduce modifications to standard cross-entropy loss functions by
extending a logit adjustment technique to cope with the class imbalance.
Experimental results on data curated from the largest public epitope database
demonstrate the validity of the proposed methods and the superior performance
compared to competing ones.
Related papers
- AsEP: Benchmarking Deep Learning Methods for Antibody-specific Epitope Prediction [12.433560411515575]
We introduce a filtered antibody-antigen complex structure dataset, AsEP.
AsEP is the largest of its kind and provides clustered groups, allowing the community to develop prediction methods.
We propose a new method, WALLE, that leverages both protein language models and graph neural networks.
arXiv Detail & Related papers (2024-07-25T16:43:56Z) - PathoLM: Identifying pathogenicity from the DNA sequence through the Genome Foundation Model [9.285895422810704]
PathoLM is a cutting-edge pathogen language model optimized for the identification of pathogenicity in bacterial and viral sequences.
We developed a comprehensive data set comprising approximately 30 species of viruses and bacteria, including ESKAPEE pathogens.
In comparative assessments, PathoLM dramatically outperforms existing models like DciPatho, demonstrating robust zero-shot and few-shot capabilities.
arXiv Detail & Related papers (2024-06-19T00:53:48Z) - Regressor-free Molecule Generation to Support Drug Response Prediction [83.25894107956735]
Conditional generation based on the target IC50 score can obtain a more effective sampling space.
Regressor-free guidance combines a diffusion model's score estimation with a regression controller model's gradient based on number labels.
arXiv Detail & Related papers (2024-05-23T13:22:17Z) - Seeing Unseen: Discover Novel Biomedical Concepts via
Geometry-Constrained Probabilistic Modeling [53.7117640028211]
We present a geometry-constrained probabilistic modeling treatment to resolve the identified issues.
We incorporate a suite of critical geometric properties to impose proper constraints on the layout of constructed embedding space.
A spectral graph-theoretic method is devised to estimate the number of potential novel classes.
arXiv Detail & Related papers (2024-03-02T00:56:05Z) - An Efficient Consolidation of Word Embedding and Deep Learning
Techniques for Classifying Anticancer Peptides: FastText+BiLSTM [0.0]
Anticancer peptides (ACPs) are peptides with higher degree of selectivity and safety.
Recent scientific advancements generate an interest in peptide-based therapies.
ACPs offer the advantage of efficiently treating intended cells without negatively impacting normal cells.
arXiv Detail & Related papers (2023-09-21T13:25:11Z) - AI driven B-cell Immunotherapy Design [0.0]
The effectiveness of antigen neutralisation and elimination hinges upon the strength, sensitivity, and specificity of the paratope-epitope interaction.
In recent years, artificial intelligence and machine learning methods have made significant strides, revolutionising the prediction of protein structures and their complexes.
This review focuses on the progress of machine learning-based tools and their frameworks in the domain of B-cell immunotherapy design.
arXiv Detail & Related papers (2023-09-03T09:14:10Z) - Reprogramming Pretrained Language Models for Antibody Sequence Infilling [72.13295049594585]
Computational design of antibodies involves generating novel and diverse sequences, while maintaining structural consistency.
Recent deep learning models have shown impressive results, however the limited number of known antibody sequence/structure pairs frequently leads to degraded performance.
In our work we address this challenge by leveraging Model Reprogramming (MR), which repurposes pretrained models on a source language to adapt to the tasks that are in a different language and have scarce data.
arXiv Detail & Related papers (2022-10-05T20:44:55Z) - Benchmarking Heterogeneous Treatment Effect Models through the Lens of
Interpretability [82.29775890542967]
Estimating personalized effects of treatments is a complex, yet pervasive problem.
Recent developments in the machine learning literature on heterogeneous treatment effect estimation gave rise to many sophisticated, but opaque, tools.
We use post-hoc feature importance methods to identify features that influence the model's predictions.
arXiv Detail & Related papers (2022-06-16T17:59:05Z) - SPLDExtraTrees: Robust machine learning approach for predicting kinase
inhibitor resistance [1.0674604700001966]
We propose a robust machine learning method, SPLDExtraTrees, which can accurately predict ligand binding affinity changes upon protein mutation.
The proposed method ranks training data following a specific scheme that starts with easy-to-learn samples.
Experiments substantiate the capability of the proposed method for predicting kinase inhibitor resistance under three scenarios.
arXiv Detail & Related papers (2021-11-15T09:07:45Z) - STELAR: Spatio-temporal Tensor Factorization with Latent Epidemiological
Regularization [76.57716281104938]
We develop a tensor method to predict the evolution of epidemic trends for many regions simultaneously.
STELAR enables long-term prediction by incorporating latent temporal regularization through a system of discrete-time difference equations.
We conduct experiments using both county- and state-level COVID-19 data and show that our model can identify interesting latent patterns of the epidemic.
arXiv Detail & Related papers (2020-12-08T21:21:47Z) - Select-ProtoNet: Learning to Select for Few-Shot Disease Subtype
Prediction [55.94378672172967]
We focus on few-shot disease subtype prediction problem, identifying subgroups of similar patients.
We introduce meta learning techniques to develop a new model, which can extract the common experience or knowledge from interrelated clinical tasks.
Our new model is built upon a carefully designed meta-learner, called Prototypical Network, that is a simple yet effective meta learning machine for few-shot image classification.
arXiv Detail & Related papers (2020-09-02T02:50:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.