De-novo Identification of Small Molecules from Their GC-EI-MS Spectra
- URL: http://arxiv.org/abs/2304.01634v1
- Date: Tue, 4 Apr 2023 08:46:00 GMT
- Title: De-novo Identification of Small Molecules from Their GC-EI-MS Spectra
- Authors: Adam H\'ajek and Michal Star\'y and Filip Jozefov and Helge Hecht and
Elliott Price and Ale\v{s} K\v{r}enek
- Abstract summary: Machine learning based emphde-novo methods, which derive molecular structure directly from its mass spectrum gained attention recently.
We present anovel method in this family, addressing aspecific usecase of GC-EI-MS spectra, which is particularly hard due to lack of additional information from the first stage of MS/MS experiments.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Identification of experimentally acquired mass spectra of unknown compounds
presents a~particular challenge because reliable spectral databases do not
cover the potential chemical space with sufficient density. Therefore machine
learning based \emph{de-novo} methods, which derive molecular structure
directly from its mass spectrum gained attention recently. We present a~novel
method in this family, addressing a~specific usecase of GC-EI-MS spectra, which
is particularly hard due to lack of additional information from the first stage
of MS/MS experiments, on which the previously published methods rely. We
analyze strengths and drawbacks or our approach and discuss future directions.
Related papers
- MassSpecGym: A benchmark for the discovery and identification of molecules [21.471140898806315]
We propose MassSpecGym -- the first comprehensive benchmark for the discovery and identification of molecules from MS/MS data.
Our benchmark comprises the largest publicly available collection of high-quality labeled MS/MS spectra.
It defines three MS/MS annotation challenges: textitde novo molecular structure generation, molecule retrieval, and spectrum simulation.
arXiv Detail & Related papers (2024-10-30T15:08:05Z) - Machine learning meets mass spectrometry: a focused perspective [0.0]
Mass spectrometry is a widely used method to study molecules and processes in medicine, life sciences, chemistry, and industrial product quality control, among many other applications.
One of the main features of some mass spectrometry techniques is the extensive level of characterization and a large amount of generated data per measurement.
With the development of machine learning methods, the opportunity arises to unlock the potential of these data, enabling previously inaccessible discoveries.
arXiv Detail & Related papers (2024-06-27T14:18:23Z) - Datacube segmentation via Deep Spectral Clustering [76.48544221010424]
Extended Vision techniques often pose a challenge in their interpretation.
The huge dimensionality of data cube spectra poses a complex task in its statistical interpretation.
In this paper, we explore the possibility of applying unsupervised clustering methods in encoded space.
A statistical dimensional reduction is performed by an ad hoc trained (Variational) AutoEncoder, while the clustering process is performed by a (learnable) iterative K-Means clustering algorithm.
arXiv Detail & Related papers (2024-01-31T09:31:28Z) - Deep Learning Models of the Discrete Component of the Galactic
Interstellar Gamma-Ray Emission [61.26321023273399]
A significant point-like component from the small scale (or discrete) structure in the H2 interstellar gas might be present in the Fermi-LAT data.
We show that deep learning may be effectively employed to model the gamma-ray emission traced by these rare H2 proxies within statistical significance in data-rich regions.
arXiv Detail & Related papers (2022-06-06T18:00:07Z) - Unsupervised Machine Learning for Exploratory Data Analysis of Exoplanet
Transmission Spectra [68.8204255655161]
We focus on unsupervised techniques for analyzing spectral data from transiting exoplanets.
We show that there is a high degree of correlation in the spectral data, which calls for appropriate low-dimensional representations.
We uncover interesting structures in the principal component basis, namely, well-defined branches corresponding to different chemical regimes.
arXiv Detail & Related papers (2022-01-07T22:26:33Z) - Gaussian Process Regression for Absorption Spectra Analysis of Molecular
Dimers [68.8204255655161]
We discuss an approach based on a machine learning technique, where the parameters for the numerical calculations are chosen from Gaussian Process Regression (GPR)
This approach does not only quickly converge to an optimal parameter set, but in addition provides information about the complete parameter space.
We find that indeed the GPR gives reliable results which are in agreement with direct calculations of these parameters using quantum chemical methods.
arXiv Detail & Related papers (2021-12-14T17:46:45Z) - Unsupervised Spectral Unmixing For Telluric Correction Using A Neural
Network Autoencoder [58.720142291102135]
We present a neural network autoencoder approach for extracting a telluric transmission spectrum from a large set of high-precision observed solar spectra from the HARPS-N radial velocity spectrograph.
arXiv Detail & Related papers (2021-11-17T12:54:48Z) - Automated fragment identification for electron ionisation mass
spectrometry: application to atmospheric measurements of halocarbons [0.0]
Non-target screening consists in searching a sample for all present substances, suspected or unknown.
This approach has been introduced more than a decade ago in the field of water analysis, but is still very scarce for indoor and atmospheric trace gas measurements.
We present data analysis tools to enable automated identification of unknown compounds measured by GC-EI-HRMS.
arXiv Detail & Related papers (2021-03-23T09:35:10Z) - Machine-learning-enhanced time-of-flight mass spectrometry analysis [10.16825220733013]
We introduce an approach that leverages modern machine learning technique to identify peak patterns in time-of-flight mass spectra within microseconds.
Our approach is cross-validated on mass spectra generated from different time-of-flight mass spectrometry(ToF-MS) techniques, offering the ToF-MS community an open-source, intelligent mass spectra analysis.
arXiv Detail & Related papers (2020-10-02T14:35:47Z) - A Review of 1D Convolutional Neural Networks toward Unknown Substance
Identification in Portable Raman Spectrometer [0.0]
Raman spectroscopy is a powerful analytical tool with applications ranging from quality control to cutting edge biomedical research.
They have been adopted widely by first responders and law enforcement agencies for the field analysis of unknown substances.
Field detection and identification of unknown substances with Raman spectroscopy rely heavily on the spectral matching capability of the devices on hand.
arXiv Detail & Related papers (2020-06-18T14:28:00Z) - Two-Dimensional Single- and Multiple-Quantum Correlation Spectroscopy in
Zero-Field Nuclear Magnetic Resonance [55.41644538483948]
We present single- and multiple-quantum correlation $J$-spectroscopy detected in zero magnetic field using a Rb vapor-cell magnetometer.
At zero field the spectrum of ethanol appears as a mixture of carbon isotopomers, and correlation spectroscopy is useful in separating the two composite spectra.
arXiv Detail & Related papers (2020-04-09T10:02:45Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.