PointIso: Point Cloud Based Deep Learning Model for Detecting
Arbitrary-Precision Peptide Features in LC-MS Map through Attention Based
Segmentation
- URL: http://arxiv.org/abs/2009.07250v1
- Date: Tue, 15 Sep 2020 17:34:14 GMT
- Title: PointIso: Point Cloud Based Deep Learning Model for Detecting
Arbitrary-Precision Peptide Features in LC-MS Map through Attention Based
Segmentation
- Authors: Fatema Tuz Zohora, M Ziaur Rahman, Ngoc Hieu Tran, Lei Xin, Baozhen
Shan, Ming Li
- Abstract summary: PointIso is a point cloud based, arbitrary-precision deep learning network to address the problem of peptide feature detection.
It achieves 98% detection of high quality MS/MS identifications in a benchmark dataset.
- Score: 5.495506445661776
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: A promising technique of discovering disease biomarkers is to measure the
relative protein abundance in multiple biofluid samples through liquid
chromatography with tandem mass spectrometry (LC-MS/MS) based quantitative
proteomics. The key step involves peptide feature detection in LC-MS map, along
with its charge and intensity. Existing heuristic algorithms suffer from
inaccurate parameters since different settings of the parameters result in
significantly different outcomes. Therefore, we propose PointIso, to serve the
necessity of an automated system for peptide feature detection that is able to
find out the proper parameters itself, and is easily adaptable to different
types of datasets. It consists of an attention based scanning step for
segmenting the multi-isotopic pattern of peptide features along with charge and
a sequence classification step for grouping those isotopes into potential
peptide features. PointIso is the first point cloud based, arbitrary-precision
deep learning network to address the problem and achieves 98% detection of high
quality MS/MS identifications in a benchmark dataset, which is higher than
several other widely used algorithms. Besides contributing to the proteomics
study, we believe our novel segmentation technique should serve the general
image processing domain as well.
Related papers
- MeToken: Uniform Micro-environment Token Boosts Post-Translational Modification Prediction [65.33218256339151]
Post-translational modifications (PTMs) profoundly expand the complexity and functionality of the proteome.
Existing computational approaches predominantly focus on protein sequences to predict PTM sites, driven by the recognition of sequence-dependent motifs.
We introduce the MeToken model, which tokenizes the micro-environment of each acid, integrating both sequence and structural information into unified discrete tokens.
arXiv Detail & Related papers (2024-11-04T07:14:28Z) - NovoBench: Benchmarking Deep Learning-based De Novo Peptide Sequencing Methods in Proteomics [58.03989832372747]
We present the first unified benchmark NovoBench for emphde novo peptide sequencing.
It comprises diverse mass spectrum data, integrated models, and comprehensive evaluation metrics.
Recent methods, including DeepNovo, PointNovo, Casanovo, InstaNovo, AdaNovo and $pi$-HelixNovo are integrated into our framework.
arXiv Detail & Related papers (2024-06-16T08:23:21Z) - Pathological Primitive Segmentation Based on Visual Foundation Model with Zero-Shot Mask Generation [3.5177988631063486]
We present a novel approach that adapts pre-trained natural image encoders of SAM for detection-based region proposals.
The entire base framework, SAM, requires no additional training or fine-tuning but could produce an end-to-end result for two fundamental segmentation tasks in pathology.
arXiv Detail & Related papers (2024-04-12T16:29:49Z) - Domain Adaptive Synapse Detection with Weak Point Annotations [63.97144211520869]
We present AdaSyn, a framework for domain adaptive synapse detection with weak point annotations.
In the WASPSYN challenge at I SBI 2023, our method ranks the 1st place.
arXiv Detail & Related papers (2023-08-31T05:05:53Z) - Optimal transport for automatic alignment of untargeted metabolomic data [8.692678207022084]
We introduce GromovMatcher, a flexible and user-friendly algorithm that automatically combines LC-MS datasets using optimal transport.
By capitalizing on feature intensity correlation structures, GromovMatcher delivers superior alignment accuracy and robustness.
We show how GromovMatcher facilitates the search for biomarkers associated with lifestyle risk factors linked to several cancer types.
arXiv Detail & Related papers (2023-06-05T20:08:19Z) - Active Learning Enhances Classification of Histopathology Whole Slide
Images with Attention-based Multiple Instance Learning [48.02011627390706]
We train an attention-based MIL and calculate a confidence metric for every image in the dataset to select the most uncertain WSIs for expert annotation.
With a novel attention guiding loss, this leads to an accuracy boost of the trained models with few regions annotated for each class.
It may in the future serve as an important contribution to train MIL models in the clinically relevant context of cancer classification in histopathology.
arXiv Detail & Related papers (2023-03-02T15:18:58Z) - Untargeted Region of Interest Selection for GC-MS Data using a Pseudo
F-Ratio Moving Window ($\psi$FRMV) [0.0]
We propose a new method for automated, untargeted region of interest selection in GC-MS data.
It is based on the ratio of the squared first, and second singular values from the Singular Value Decomposition of a window that moves across the chromatogram.
The sensitivity of the algorithm was tested by investigating the concentration at which it can no longer pick out chromatographic regions known to contain signal.
arXiv Detail & Related papers (2022-07-30T21:43:05Z) - Lung Cancer Lesion Detection in Histopathology Images Using Graph-Based
Sparse PCA Network [93.22587316229954]
We propose a graph-based sparse principal component analysis (GS-PCA) network, for automated detection of cancerous lesions on histological lung slides stained by hematoxylin and eosin (H&E)
We evaluate the performance of the proposed algorithm on H&E slides obtained from an SVM K-rasG12D lung cancer mouse model using precision/recall rates, F-score, Tanimoto coefficient, and area under the curve (AUC) of the receiver operator characteristic (ROC)
arXiv Detail & Related papers (2021-10-27T19:28:36Z) - Ovarian Cancer Detection based on Dimensionality Reduction Techniques
and Genetic Algorithm [0.0]
We have two serum SELDI datasets to identify proteomic cancerous serums from normal serums.
We have chosen to evaluate the performance of PCA (Principal Component Analysis) and GA (Genetic algorithm)
We conclude that GA is more efficient for features selection and hence for cancerous patterns detection than PCA technique.
arXiv Detail & Related papers (2021-05-04T20:38:29Z) - G-MIND: An End-to-End Multimodal Imaging-Genetics Framework for
Biomarker Identification and Disease Classification [49.53651166356737]
We propose a novel deep neural network architecture to integrate imaging and genetics data, as guided by diagnosis, that provides interpretable biomarkers.
We have evaluated our model on a population study of schizophrenia that includes two functional MRI (fMRI) paradigms and Single Nucleotide Polymorphism (SNP) data.
arXiv Detail & Related papers (2021-01-27T19:28:04Z) - Accurate Cell Segmentation in Digital Pathology Images via Attention
Enforced Networks [0.0]
We propose an Attention Enforced Network (AENet) to integrate local features with global dependencies and weight effective channels adaptively.
In the test stage, we present an individual color normalization method to deal with the stain variation problem.
arXiv Detail & Related papers (2020-12-14T03:39:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.