DePS: An improved deep learning model for de novo peptide sequencing
- URL: http://arxiv.org/abs/2203.08820v1
- Date: Wed, 16 Mar 2022 16:45:48 GMT
- Title: DePS: An improved deep learning model for de novo peptide sequencing
- Authors: Cheng Ge, Yi Lu, Jia Qu, Liangxu Xie, Feng Wang, Hong Zhang, Ren Kong
and Shan Chang
- Abstract summary: In this study, we proposed an enhanced model, DePS, which can improve the accuracy of de novo peptide sequencing.
For the same test set of DeepNovoV2, the DePS model achieved excellent results of 74.22%, 74.21% and 41.68% for amino acid recall, amino acid precision and peptide recall respectively.
- Score: 7.468176246958974
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: De novo peptide sequencing from mass spectrometry data is an important method
for protein identification. Recently, various deep learning approaches were
applied for de novo peptide sequencing and DeepNovoV2 is one of the
represetative models. In this study, we proposed an enhanced model, DePS, which
can improve the accuracy of de novo peptide sequencing even with missing signal
peaks or large number of noisy peaks in tandem mass spectrometry data. It is
showed that, for the same test set of DeepNovoV2, the DePS model achieved
excellent results of 74.22%, 74.21% and 41.68% for amino acid recall, amino
acid precision and peptide recall respectively. Furthermore, the results
suggested that DePS outperforms DeepNovoV2 on the cross species dataset.
Related papers
- NovoBench: Benchmarking Deep Learning-based De Novo Peptide Sequencing Methods in Proteomics [58.03989832372747]
We present the first unified benchmark NovoBench for emphde novo peptide sequencing.
It comprises diverse mass spectrum data, integrated models, and comprehensive evaluation metrics.
Recent methods, including DeepNovo, PointNovo, Casanovo, InstaNovo, AdaNovo and $pi$-HelixNovo are integrated into our framework.
arXiv Detail & Related papers (2024-06-16T08:23:21Z) - AdaNovo: Adaptive \emph{De Novo} Peptide Sequencing with Conditional Mutual Information [46.23980841020632]
We propose AdaNovo, a novel framework that calculates conditional mutual information (CMI) between the spectrum and each amino acid/peptide.
AdaNovo excels in identifying amino acids with post-translational modifications (PTMs) and exhibits robustness against data noise.
arXiv Detail & Related papers (2024-03-09T11:54:58Z) - Transformer-based de novo peptide sequencing for data-independent acquisition mass spectrometry [1.338778493151964]
We introduce DiaTrans, a deep-learning model based on transformer architecture.
It deciphers peptide sequences from DIA mass spectrometry data.
Our results show significant improvements over existing STOA methods.
arXiv Detail & Related papers (2024-02-17T19:04:23Z) - PSC-CPI: Multi-Scale Protein Sequence-Structure Contrasting for
Efficient and Generalizable Compound-Protein Interaction Prediction [63.50967073653953]
Compound-Protein Interaction prediction aims to predict the pattern and strength of compound-protein interactions for rational drug discovery.
Existing deep learning-based methods utilize only the single modality of protein sequences or structures.
We propose a novel multi-scale Protein Sequence-structure Contrasting framework for CPI prediction.
arXiv Detail & Related papers (2024-02-13T03:51:10Z) - ContraNovo: A Contrastive Learning Approach to Enhance De Novo Peptide
Sequencing [70.12220342151113]
ContraNovo is a pioneering algorithm that leverages contrastive learning to extract the relationship between spectra and peptides.
ContraNovo consistently outshines contemporary state-of-the-art solutions.
arXiv Detail & Related papers (2023-12-18T12:49:46Z) - Efficiently Predicting Protein Stability Changes Upon Single-point
Mutation with Large Language Models [51.57843608615827]
The ability to precisely predict protein thermostability is pivotal for various subfields and applications in biochemistry.
We introduce an ESM-assisted efficient approach that integrates protein sequence and structural features to predict the thermostability changes in protein upon single-point mutations.
arXiv Detail & Related papers (2023-12-07T03:25:49Z) - Efficient Prediction of Peptide Self-assembly through Sequential and
Graphical Encoding [57.89530563948755]
This work provides a benchmark analysis of peptide encoding with advanced deep learning models.
It serves as a guide for a wide range of peptide-related predictions such as isoelectric points, hydration free energy, etc.
arXiv Detail & Related papers (2023-07-17T00:43:33Z) - Target specific peptide design using latent space approximate trajectory
collector [1.7289819674602296]
We propose a novel machine based machine learning design, called Approximate Space Tray Collector (LSATC)
It consists of a series of samplers that approximates peptides with desired binding properties in a space $284%$.
We latent by the design of peptide extensions targeting Beta-catenin, a key effector protein involved in canonical Wnt signalling.
All the four binding peptides extended by LSATC show improved Beta-catenin binding by at least $2%$, two of the peptides show a $3$ fold increase in affinity as compared to the base peptide.
arXiv Detail & Related papers (2023-02-02T21:56:52Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.