DePS: An improved deep learning model for de novo peptide sequencing
        - URL: http://arxiv.org/abs/2203.08820v1
- Date: Wed, 16 Mar 2022 16:45:48 GMT
- Title: DePS: An improved deep learning model for de novo peptide sequencing
- Authors: Cheng Ge, Yi Lu, Jia Qu, Liangxu Xie, Feng Wang, Hong Zhang, Ren Kong
  and Shan Chang
- Abstract summary: In this study, we proposed an enhanced model, DePS, which can improve the accuracy of de novo peptide sequencing.
For the same test set of DeepNovoV2, the DePS model achieved excellent results of 74.22%, 74.21% and 41.68% for amino acid recall, amino acid precision and peptide recall respectively.
- Score: 7.468176246958974
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract:   De novo peptide sequencing from mass spectrometry data is an important method
for protein identification. Recently, various deep learning approaches were
applied for de novo peptide sequencing and DeepNovoV2 is one of the
represetative models. In this study, we proposed an enhanced model, DePS, which
can improve the accuracy of de novo peptide sequencing even with missing signal
peaks or large number of noisy peaks in tandem mass spectrometry data. It is
showed that, for the same test set of DeepNovoV2, the DePS model achieved
excellent results of 74.22%, 74.21% and 41.68% for amino acid recall, amino
acid precision and peptide recall respectively. Furthermore, the results
suggested that DePS outperforms DeepNovoV2 on the cross species dataset.
 
      
        Related papers
        - PLAME: Leveraging Pretrained Language Models to Generate Enhanced   Protein Multiple Sequence Alignments [53.55710514466851]
 Protein structure prediction is essential for drug discovery and understanding biological functions.<n>Most folding models rely heavily on multiple sequence alignments (MSAs) to boost prediction performance.<n>We propose PLAME, a novel MSA design model that leverages evolutionary embeddings from pretrained protein language models.
 arXiv  Detail & Related papers  (2025-06-17T04:11:30Z)
- A general language model for peptide identification [4.044600688588866]
 PDeepPP is a unified deep learning framework that integrates pretrained protein language models with a hybrid transformer-convolutional architecture.<n>By enabling large-scale, accurate peptide analysis, PDeepPP supports biomedical research and the discovery of novel therapeutic targets for disease treatment.
 arXiv  Detail & Related papers  (2025-02-21T17:31:22Z)
- Disentangling the Complex Multiplexed DIA Spectra in De Novo Peptide   Sequencing [7.24090686599962]
 Data-Independent Acquisition (DIA) was introduced to improve sensitivity to cover all peptides in a range rather than only sampling high-intensity peaks.
It is not very clear how useful DIA data is for de novo peptide sequencing as the DIA data are marred with coeluted peptides, high noises, and varying data quality.
 arXiv  Detail & Related papers  (2024-11-24T02:10:29Z)
- NovoBench: Benchmarking Deep Learning-based De Novo Peptide Sequencing   Methods in Proteomics [58.03989832372747]
 We present the first unified benchmark NovoBench for emphde novo peptide sequencing.
It comprises diverse mass spectrum data, integrated models, and comprehensive evaluation metrics.
Recent methods, including DeepNovo, PointNovo, Casanovo, InstaNovo, AdaNovo and $pi$-HelixNovo are integrated into our framework.
 arXiv  Detail & Related papers  (2024-06-16T08:23:21Z)
- AdaNovo: Adaptive \emph{De Novo} Peptide Sequencing with Conditional   Mutual Information [46.23980841020632]
 We propose AdaNovo, a novel framework that calculates conditional mutual information (CMI) between the spectrum and each amino acid/peptide.
AdaNovo excels in identifying amino acids with post-translational modifications (PTMs) and exhibits robustness against data noise.
 arXiv  Detail & Related papers  (2024-03-09T11:54:58Z)
- Transformer-based de novo peptide sequencing for data-independent   acquisition mass spectrometry [1.338778493151964]
 We introduce DiaTrans, a deep-learning model based on transformer architecture.
It deciphers peptide sequences from DIA mass spectrometry data.
Our results show significant improvements over existing STOA methods.
 arXiv  Detail & Related papers  (2024-02-17T19:04:23Z)
- PSC-CPI: Multi-Scale Protein Sequence-Structure Contrasting for
  Efficient and Generalizable Compound-Protein Interaction Prediction [63.50967073653953]
 Compound-Protein Interaction prediction aims to predict the pattern and strength of compound-protein interactions for rational drug discovery.
Existing deep learning-based methods utilize only the single modality of protein sequences or structures.
We propose a novel multi-scale Protein Sequence-structure Contrasting framework for CPI prediction.
 arXiv  Detail & Related papers  (2024-02-13T03:51:10Z)
- ContraNovo: A Contrastive Learning Approach to Enhance De Novo Peptide
  Sequencing [70.12220342151113]
 ContraNovo is a pioneering algorithm that leverages contrastive learning to extract the relationship between spectra and peptides.
ContraNovo consistently outshines contemporary state-of-the-art solutions.
 arXiv  Detail & Related papers  (2023-12-18T12:49:46Z)
- Efficiently Predicting Protein Stability Changes Upon Single-point
  Mutation with Large Language Models [51.57843608615827]
 The ability to precisely predict protein thermostability is pivotal for various subfields and applications in biochemistry.
We introduce an ESM-assisted efficient approach that integrates protein sequence and structural features to predict the thermostability changes in protein upon single-point mutations.
 arXiv  Detail & Related papers  (2023-12-07T03:25:49Z)
- Efficient Prediction of Peptide Self-assembly through Sequential and
  Graphical Encoding [57.89530563948755]
 This work provides a benchmark analysis of peptide encoding with advanced deep learning models.
It serves as a guide for a wide range of peptide-related predictions such as isoelectric points, hydration free energy, etc.
 arXiv  Detail & Related papers  (2023-07-17T00:43:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.