Isotopic envelope identification by analysis of the spatial distribution
of components in MALDI-MSI data
- URL: http://arxiv.org/abs/2302.06051v2
- Date: Tue, 14 Feb 2023 21:39:44 GMT
- Title: Isotopic envelope identification by analysis of the spatial distribution
of components in MALDI-MSI data
- Authors: Anna Glodek, Joanna Pola\'nska, Marta Gawin
- Abstract summary: This paper presents a method for identifying isotope envelopes in MALDI-ToF molecular imaging data based on the Mamdani-Assilan fuzzy system.
The algorithm was tested on eight datasets obtained from the MALDI-ToF experiment on samples from patients with cancer of the head and neck region.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: One of the significant steps in the process leading to the identification of
proteins is mass spectrometry, which allows for obtaining information about the
structure of proteins. Removing isotope peaks from the mass spectrum is vital
and it is done in a process called deisotoping. There are different algorithms
for deisotoping, but they have their limitations, they are dedicated to
different methods of mass spectrometry. Data from experiments performed with
the MALDI-ToF technique are characterized by high dimensionality. This paper
presents a method for identifying isotope envelopes in MALDI-ToF molecular
imaging data based on the Mamdani-Assilan fuzzy system and spatial maps of the
molecular distribution of peaks included in the isotopic envelope. Several
image texture measures were used to evaluate spatial molecular distribution
maps. The algorithm was tested on eight datasets obtained from the MALDI-ToF
experiment on samples from the National Institute of Oncology in Gliwice from
patients with cancer of the head and neck region. The data were subjected to
pre-processing and feature extraction. The results were collected and compared
with three existing deisotoping algorithms. The analysis of the obtained
results showed that the method for identifying isotopic envelopes proposed in
this paper enables the detection of overlapping envelopes by using the approach
oriented to study peak pairs. Moreover, the proposed algorithm enables the
analysis of large data sets.
Related papers
- CryoFM: A Flow-based Foundation Model for Cryo-EM Densities [50.291974465864364]
We present CryoFM, a foundation model designed as a generative model, learning the distribution of high-quality density maps.
Built on flow matching, CryoFM is trained to accurately capture the prior distribution of biomolecular density maps.
arXiv Detail & Related papers (2024-10-11T08:53:58Z) - Unraveling Molecular Structure: A Multimodal Spectroscopic Dataset for Chemistry [0.1747623282473278]
This dataset comprises simulated $1$H-NMR, $13$C-NMR, HSQC-NMR, Infrared, and Mass spectra for 790k molecules extracted from chemical reactions in patent data.
We provide benchmarks for evaluating single-modality tasks such as structure elucidation, predicting the spectra for a target molecule, and functional group predictions.
arXiv Detail & Related papers (2024-07-04T12:52:48Z) - Data-Efficient Molecular Generation with Hierarchical Textual Inversion [48.816943690420224]
We introduce Hierarchical textual Inversion for Molecular generation (HI-Mol), a novel data-efficient molecular generation method.
HI-Mol is inspired by the importance of hierarchical information, e.g., both coarse- and fine-grained features, in understanding the molecule distribution.
Compared to the conventional textual inversion method in the image domain using a single-level token embedding, our multi-level token embeddings allow the model to effectively learn the underlying low-shot molecule distribution.
arXiv Detail & Related papers (2024-05-05T08:35:23Z) - Datacube segmentation via Deep Spectral Clustering [76.48544221010424]
Extended Vision techniques often pose a challenge in their interpretation.
The huge dimensionality of data cube spectra poses a complex task in its statistical interpretation.
In this paper, we explore the possibility of applying unsupervised clustering methods in encoded space.
A statistical dimensional reduction is performed by an ad hoc trained (Variational) AutoEncoder, while the clustering process is performed by a (learnable) iterative K-Means clustering algorithm.
arXiv Detail & Related papers (2024-01-31T09:31:28Z) - A Heat Diffusion Perspective on Geodesic Preserving Dimensionality
Reduction [66.21060114843202]
We propose a more general heat kernel based manifold embedding method that we call heat geodesic embeddings.
Results show that our method outperforms existing state of the art in preserving ground truth manifold distances.
We also showcase our method on single cell RNA-sequencing datasets with both continuum and cluster structure.
arXiv Detail & Related papers (2023-05-30T13:58:50Z) - A Segmentation Method for fluorescence images without a machine learning
approach [0.0]
This study describes a deterministic computational neuroscience approach for identifying cells and nuclei.
The method is robust, based on formally correct functions, and does not suffer from tuning on specific data sets.
arXiv Detail & Related papers (2022-12-28T16:47:05Z) - SHREC 2021: Classification in cryo-electron tomograms [13.443446070180562]
cryo-electron tomography (cryo-ET) is an imaging technique that allows three-dimensional visualization of macro-molecular assemblies.
Cryo-ET comes with a number of challenges, mainly low signal-to-noise and inability to obtain images from all angles.
We generate a novel simulated dataset to benchmark different methods of localization and classification of biological macromolecules in tomograms.
arXiv Detail & Related papers (2022-03-18T16:08:22Z) - Unsupervised Machine Learning for Exploratory Data Analysis of Exoplanet
Transmission Spectra [68.8204255655161]
We focus on unsupervised techniques for analyzing spectral data from transiting exoplanets.
We show that there is a high degree of correlation in the spectral data, which calls for appropriate low-dimensional representations.
We uncover interesting structures in the principal component basis, namely, well-defined branches corresponding to different chemical regimes.
arXiv Detail & Related papers (2022-01-07T22:26:33Z) - Gaussian Process Regression for Absorption Spectra Analysis of Molecular
Dimers [68.8204255655161]
We discuss an approach based on a machine learning technique, where the parameters for the numerical calculations are chosen from Gaussian Process Regression (GPR)
This approach does not only quickly converge to an optimal parameter set, but in addition provides information about the complete parameter space.
We find that indeed the GPR gives reliable results which are in agreement with direct calculations of these parameters using quantum chemical methods.
arXiv Detail & Related papers (2021-12-14T17:46:45Z) - HyperPCA: a Powerful Tool to Extract Elemental Maps from Noisy Data
Obtained in LIBS Mapping of Materials [7.648784748888189]
We introduce HyperPCA, a new analysis tool for hyperspectral images based on a sparse representation of the data.
We show that the method presents advantages both in quantity and quality of the information recovered, thus improving the physico-chemical characterisation of analysed surfaces.
arXiv Detail & Related papers (2021-11-30T07:52:44Z) - PointIso: Point Cloud Based Deep Learning Model for Detecting
Arbitrary-Precision Peptide Features in LC-MS Map through Attention Based
Segmentation [5.495506445661776]
PointIso is a point cloud based, arbitrary-precision deep learning network to address the problem of peptide feature detection.
It achieves 98% detection of high quality MS/MS identifications in a benchmark dataset.
arXiv Detail & Related papers (2020-09-15T17:34:14Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.