TITAN: T Cell Receptor Specificity Prediction with Bimodal Attention
Networks
- URL: http://arxiv.org/abs/2105.03323v1
- Date: Wed, 21 Apr 2021 09:25:14 GMT
- Title: TITAN: T Cell Receptor Specificity Prediction with Bimodal Attention
Networks
- Authors: Anna Weber, Jannis Born and Mar\'ia Rodr\'iguez Mart\'inez
- Abstract summary: We propose a bimodal neural network that encodes both TCR sequences and epITopes to enable independent study of capabilities to unseen sequences and transfer/ors.
Tcr-distance-distance neural network exhibits competitive performance on unseen TCRs.
Tcr-distance-distance neural network also exhibits competitive performance on unseen TCRs.
- Score: 0.5371337604556311
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Motivation: The activity of the adaptive immune system is governed by T-cells
and their specific T-cell receptors (TCR), which selectively recognize foreign
antigens. Recent advances in experimental techniques have enabled sequencing of
TCRs and their antigenic targets (epitopes), allowing to research the missing
link between TCR sequence and epitope binding specificity. Scarcity of data and
a large sequence space make this task challenging, and to date only models
limited to a small set of epitopes have achieved good performance. Here, we
establish a k-nearest-neighbor (K-NN) classifier as a strong baseline and then
propose TITAN (Tcr epITope bimodal Attention Networks), a bimodal neural
network that explicitly encodes both TCR sequences and epitopes to enable the
independent study of generalization capabilities to unseen TCRs and/or
epitopes. Results: By encoding epitopes at the atomic level with SMILES
sequences, we leverage transfer learning and data augmentation to enrich the
input data space and boost performance. TITAN achieves high performance in the
prediction of specificity of unseen TCRs (ROC-AUC 0.87 in 10-fold CV) and
surpasses the results of the current state-of-the-art (ImRex) by a large
margin. Notably, our Levenshtein-distance-based K-NN classifier also exhibits
competitive performance on unseen TCRs. While the generalization to unseen
epitopes remains challenging, we report two major breakthroughs. First, by
dissecting the attention heatmaps, we demonstrate that the sparsity of
available epitope data favors an implicit treatment of epitopes as classes.
This may be a general problem that limits unseen epitope performance for
sufficiently complex models. Second, we show that TITAN nevertheless exhibits
significantly improved performance on unseen epitopes and is capable of
focusing attention on chemically meaningful molecular structures.
Related papers
- Dumpling GNN: Hybrid GNN Enables Better ADC Payload Activity Prediction Based on Chemical Structure [53.76752789814785]
DumplingGNN is a hybrid Graph Neural Network architecture specifically designed for predicting ADC payload activity based on chemical structure.
We evaluate it on a comprehensive ADC payload dataset focusing on DNA Topoisomerase I inhibitors.
It demonstrates exceptional accuracy (91.48%), sensitivity (95.08%), and specificity (97.54%) on our specialized ADC payload dataset.
arXiv Detail & Related papers (2024-09-23T17:11:04Z) - A large language model for predicting T cell receptor-antigen binding specificity [4.120928123714289]
We propose a Masked Language Model (MLM) to overcome limitations in model generalization.
Specifically, we randomly masked sequence segments and train tcrLM to infer the masked segment, thereby extract expressive feature from TCR sequences.
Our extensive experimental results demonstrate that tcrLM achieved AUC values of 0.937 and 0.933 on independent test sets and external validation sets.
arXiv Detail & Related papers (2024-06-24T08:36:40Z) - Active Learning Framework for Cost-Effective TCR-Epitope Binding
Affinity Prediction [6.3044887592852845]
ActiveTCR is a framework that incorporates active learning and TCR-epitope binding affinity prediction models.
It aims to maximize performance gains while minimizing the cost of annotation.
Our work is the first systematic investigation of data optimization for TCR-epitope binding affinity prediction.
arXiv Detail & Related papers (2023-10-16T23:53:07Z) - AIRIVA: A Deep Generative Model of Adaptive Immune Repertoires [6.918664738267051]
We present an Adaptive Immune Repertoire-Invariant Variational Autoencoder (AIRIVA) that learns a low-dimensional, interpretable, and compositional representation of TCR repertoires to disentangle systematic effects in repertoires.
arXiv Detail & Related papers (2023-04-26T14:40:35Z) - Tissue Classification During Needle Insertion Using Self-Supervised
Contrastive Learning and Optical Coherence Tomography [53.38589633687604]
We propose a deep neural network that classifies the tissues from the phase and intensity data of complex OCT signals acquired at the needle tip.
We show that with 10% of the training set, our proposed pretraining strategy helps the model achieve an F1 score of 0.84 whereas the model achieves an F1 score of 0.60 without it.
arXiv Detail & Related papers (2023-04-26T14:11:04Z) - Data-Efficient Vision Transformers for Multi-Label Disease
Classification on Chest Radiographs [55.78588835407174]
Vision Transformers (ViTs) have not been applied to this task despite their high classification performance on generic images.
ViTs do not rely on convolutions but on patch-based self-attention and in contrast to CNNs, no prior knowledge of local connectivity is present.
Our results show that while the performance between ViTs and CNNs is on par with a small benefit for ViTs, DeiTs outperform the former if a reasonably large data set is available for training.
arXiv Detail & Related papers (2022-08-17T09:07:45Z) - TCR: A Transformer Based Deep Network for Predicting Cancer Drugs
Response [12.86640026993276]
We proposeTransformer based network for Cancer drug Response (TCR) to predict anti-cancer drug response.
By utilizing an attention mechanism, TCR is able to learn the interactions between drug atom/sub-structure and molecular signatures efficiently.
Our study highlights the prediction power of TCR and its potential value for cancer drug repurpose and precision oncology treatment.
arXiv Detail & Related papers (2022-07-10T13:01:54Z) - Attention-aware contrastive learning for predicting T cell
receptor-antigen binding specificity [7.365824008999903]
It has been verified that only a small fraction of the neoantigens presented by MHC class I molecules on the cell surface can elicit T cells.
We propose an attentive-mask contrastive learning model, ATMTCR, for inferring TCR-antigen binding specificity.
arXiv Detail & Related papers (2022-05-17T10:53:32Z) - Dual-Consistency Semi-Supervised Learning with Uncertainty
Quantification for COVID-19 Lesion Segmentation from CT Images [49.1861463923357]
We propose an uncertainty-guided dual-consistency learning network (UDC-Net) for semi-supervised COVID-19 lesion segmentation from CT images.
Our proposed UDC-Net improves the fully supervised method by 6.3% in Dice and outperforms other competitive semi-supervised approaches by significant margins.
arXiv Detail & Related papers (2021-04-07T16:23:35Z) - CovidDeep: SARS-CoV-2/COVID-19 Test Based on Wearable Medical Sensors
and Efficient Neural Networks [51.589769497681175]
The novel coronavirus (SARS-CoV-2) has led to a pandemic.
The current testing regime based on Reverse Transcription-Polymerase Chain Reaction for SARS-CoV-2 has been unable to keep up with testing demands.
We propose a framework called CovidDeep that combines efficient DNNs with commercially available WMSs for pervasive testing of the virus.
arXiv Detail & Related papers (2020-07-20T21:47:28Z) - Inf-Net: Automatic COVID-19 Lung Infection Segmentation from CT Images [152.34988415258988]
Automated detection of lung infections from computed tomography (CT) images offers a great potential to augment the traditional healthcare strategy for tackling COVID-19.
segmenting infected regions from CT slices faces several challenges, including high variation in infection characteristics, and low intensity contrast between infections and normal tissues.
To address these challenges, a novel COVID-19 Deep Lung Infection Network (Inf-Net) is proposed to automatically identify infected regions from chest CT slices.
arXiv Detail & Related papers (2020-04-22T07:30:56Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.