Related papers: BeeTLe: A Framework for Linear B-Cell Epitope Prediction and Classification

BeeTLe: A Framework for Linear B-Cell Epitope Prediction and Classification

URL: http://arxiv.org/abs/2309.02071v1
Date: Tue, 5 Sep 2023 09:18:29 GMT
Title: BeeTLe: A Framework for Linear B-Cell Epitope Prediction and Classification
Authors: Xiao Yuan
Abstract summary: This paper presents a new deep learning-based framework for linear B-cell prediction as well as antibody type-specific classification. We propose an amino acid encoding method based on eigen decomposition to help the model learn the representations of antibodies. Experimental results on data curated from the largest public database demonstrate the validity of the proposed methods.
Score: 0.43512163406551996
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: The process of identifying and characterizing B-cell epitopes, which are the portions of antigens recognized by antibodies, is important for our understanding of the immune system, and for many applications including vaccine development, therapeutics, and diagnostics. Computational epitope prediction is challenging yet rewarding as it significantly reduces the time and cost of laboratory work. Most of the existing tools do not have satisfactory performance and only discriminate epitopes from non-epitopes. This paper presents a new deep learning-based multi-task framework for linear B-cell epitope prediction as well as antibody type-specific epitope classification. Specifically, a sequenced-based neural network model using recurrent layers and Transformer blocks is developed. We propose an amino acid encoding method based on eigen decomposition to help the model learn the representations of epitopes. We introduce modifications to standard cross-entropy loss functions by extending a logit adjustment technique to cope with the class imbalance. Experimental results on data curated from the largest public epitope database demonstrate the validity of the proposed methods and the superior performance compared to competing ones.

Related papers

Investigating the Impact of Histopathological Foundation Models on Regressive Prediction of Homologous Recombination Deficiency [52.50039435394964]
We systematically evaluate foundation models for regression-based tasks.<n>We extract patch-level features from whole slide images (WSI) using five state-of-the-art foundation models.<n>Models are trained to predict continuous HRD scores based on these extracted features across breast, endometrial, and lung cancer cohorts.
arXiv Detail & Related papers (2026-01-29T14:06:50Z)
Overlap-weighted orthogonal meta-learner for treatment effect estimation over time [90.46786193198744]
We introduce a novel overlap-weighted meta-learner for estimating heterogeneous treatment effects (HTEs)<n>Our WO-learner has the favorable property of Neyman-orthogonality, meaning that it is robust against misspecification in the nuisance functions.<n>We show that our WO-learner is fully model-agnostic and can be applied to any machine learning model.
arXiv Detail & Related papers (2025-10-22T14:47:57Z)
ABConformer: Physics-inspired Sliding Attention for Antibody-Antigen Interface Prediction [3.947298454012977]
We present ABCONFORMER, a model based on the Conformer backbone that captures both local and global features of a biosequence.<n>ABCONFORMER can accurately predict paratopes and antigens given the antibody and sequence, and predict pan-epitopes on the antigen without antibody information.
arXiv Detail & Related papers (2025-09-27T11:12:04Z)
epiGPTope: A machine learning-based epitope generator and classifier [0.0]
Epitopes are short antigenic peptide sequences recognized by antibodies or immune cell receptors.<n>The design of synthetic libraries is challenging due to the large sequence space, $20n$ combinations for linears of n amino acids, making screening and testing unfeasible.<n>We present a large language model, epiGPTope, which fine-tuned on linears and can generate novel rationallike sequences.
arXiv Detail & Related papers (2025-09-03T14:36:06Z)
BConformeR: A Conformer Based on Mutual Sampling for Unified Prediction of Continuous and Discontinuous Antibody Binding Sites [3.947298454012977]
In this work, we propose a conformer-based model trained on antigen sequences derived from 1,080 antigen-antibody complexes.<n>CNN enhances the prediction of linears, and the Transformer module improves the prediction of conformationals.<n> Experimental results show that our model outperforms existing baselines in terms of PCC, ROC-AUC, PR-AUC, and F1 scores on both linear and conformationals.
arXiv Detail & Related papers (2025-08-16T12:31:39Z)
Deep Neural Network-Based Prediction of B-Cell Epitopes for SARS-CoV and SARS-CoV-2: Enhancing Vaccine Design through Machine Learning [4.728153103738193]
The accurate prediction of B-cells is critical for guiding vaccine development against infectious diseases, including SARS and COVID-19. Traditional sequence-based methods often struggle with large, complex datasets, but deep learning offers promising improvements in predictive accuracy. Results indicate an overall accuracy of 82% in predicting COVID-19 negative and positive cases, with room for improvement in detecting positive samples.
arXiv Detail & Related papers (2024-11-28T01:54:43Z)
AsEP: Benchmarking Deep Learning Methods for Antibody-specific Epitope Prediction [12.433560411515575]
We introduce a filtered antibody-antigen complex structure dataset, AsEP. AsEP is the largest of its kind and provides clustered groups. We propose a novel method, WALLE, which leverages both protein language models and structural modeling from graph neural networks.
arXiv Detail & Related papers (2024-07-25T16:43:56Z)
PathoLM: Identifying pathogenicity from the DNA sequence through the Genome Foundation Model [9.285895422810704]
PathoLM is a cutting-edge pathogen language model optimized for the identification of pathogenicity in bacterial and viral sequences. We developed a comprehensive data set comprising approximately 30 species of viruses and bacteria, including ESKAPEE pathogens. In comparative assessments, PathoLM dramatically outperforms existing models like DciPatho, demonstrating robust zero-shot and few-shot capabilities.
arXiv Detail & Related papers (2024-06-19T00:53:48Z)
Regressor-free Molecule Generation to Support Drug Response Prediction [83.25894107956735]
Conditional generation based on the target IC50 score can obtain a more effective sampling space. Regressor-free guidance combines a diffusion model's score estimation with a regression controller model's gradient based on number labels.
arXiv Detail & Related papers (2024-05-23T13:22:17Z)
Seeing Unseen: Discover Novel Biomedical Concepts via Geometry-Constrained Probabilistic Modeling [53.7117640028211]
We present a geometry-constrained probabilistic modeling treatment to resolve the identified issues. We incorporate a suite of critical geometric properties to impose proper constraints on the layout of constructed embedding space. A spectral graph-theoretic method is devised to estimate the number of potential novel classes.
arXiv Detail & Related papers (2024-03-02T00:56:05Z)
An Efficient Consolidation of Word Embedding and Deep Learning Techniques for Classifying Anticancer Peptides: FastText+BiLSTM [0.0]
Anticancer peptides (ACPs) are peptides with higher degree of selectivity and safety. Recent scientific advancements generate an interest in peptide-based therapies. ACPs offer the advantage of efficiently treating intended cells without negatively impacting normal cells.
arXiv Detail & Related papers (2023-09-21T13:25:11Z)
AI driven B-cell Immunotherapy Design [0.0]
The effectiveness of antigen neutralisation and elimination hinges upon the strength, sensitivity, and specificity of the paratope-epitope interaction. In recent years, artificial intelligence and machine learning methods have made significant strides, revolutionising the prediction of protein structures and their complexes. This review focuses on the progress of machine learning-based tools and their frameworks in the domain of B-cell immunotherapy design.
arXiv Detail & Related papers (2023-09-03T09:14:10Z)
Reprogramming Pretrained Language Models for Antibody Sequence Infilling [72.13295049594585]
Computational design of antibodies involves generating novel and diverse sequences, while maintaining structural consistency. Recent deep learning models have shown impressive results, however the limited number of known antibody sequence/structure pairs frequently leads to degraded performance. In our work we address this challenge by leveraging Model Reprogramming (MR), which repurposes pretrained models on a source language to adapt to the tasks that are in a different language and have scarce data.
arXiv Detail & Related papers (2022-10-05T20:44:55Z)
Benchmarking Heterogeneous Treatment Effect Models through the Lens of Interpretability [82.29775890542967]
Estimating personalized effects of treatments is a complex, yet pervasive problem. Recent developments in the machine learning literature on heterogeneous treatment effect estimation gave rise to many sophisticated, but opaque, tools. We use post-hoc feature importance methods to identify features that influence the model's predictions.
arXiv Detail & Related papers (2022-06-16T17:59:05Z)
SPLDExtraTrees: Robust machine learning approach for predicting kinase inhibitor resistance [1.0674604700001966]
We propose a robust machine learning method, SPLDExtraTrees, which can accurately predict ligand binding affinity changes upon protein mutation. The proposed method ranks training data following a specific scheme that starts with easy-to-learn samples. Experiments substantiate the capability of the proposed method for predicting kinase inhibitor resistance under three scenarios.
arXiv Detail & Related papers (2021-11-15T09:07:45Z)
STELAR: Spatio-temporal Tensor Factorization with Latent Epidemiological Regularization [76.57716281104938]
We develop a tensor method to predict the evolution of epidemic trends for many regions simultaneously. STELAR enables long-term prediction by incorporating latent temporal regularization through a system of discrete-time difference equations. We conduct experiments using both county- and state-level COVID-19 data and show that our model can identify interesting latent patterns of the epidemic.
arXiv Detail & Related papers (2020-12-08T21:21:47Z)
Select-ProtoNet: Learning to Select for Few-Shot Disease Subtype Prediction [55.94378672172967]
We focus on few-shot disease subtype prediction problem, identifying subgroups of similar patients. We introduce meta learning techniques to develop a new model, which can extract the common experience or knowledge from interrelated clinical tasks. Our new model is built upon a carefully designed meta-learner, called Prototypical Network, that is a simple yet effective meta learning machine for few-shot image classification.
arXiv Detail & Related papers (2020-09-02T02:50:30Z)

This list is automatically generated from the titles and abstracts of the papers in this site.