Efficiently predicting high resolution mass spectra with graph neural
networks
- URL: http://arxiv.org/abs/2301.11419v1
- Date: Thu, 26 Jan 2023 21:10:26 GMT
- Title: Efficiently predicting high resolution mass spectra with graph neural
networks
- Authors: Michael Murphy, Stefanie Jegelka, Ernest Fraenkel, Tobias Kind, David
Healey, Thomas Butler
- Abstract summary: Identifying a small molecule from its mass spectrum is the primary open problem in computational metabolomics.
This is typically cast as information retrieval: an unknown spectrum is matched against spectra predicted computationally from a large database of chemical structures.
We resolve this tradeoff by casting spectrum prediction as a mapping from an input molecular graph to a probability distribution over molecular formulas.
- Score: 28.387227518307604
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Identifying a small molecule from its mass spectrum is the primary open
problem in computational metabolomics. This is typically cast as information
retrieval: an unknown spectrum is matched against spectra predicted
computationally from a large database of chemical structures. However, current
approaches to spectrum prediction model the output space in ways that force a
tradeoff between capturing high resolution mass information and tractable
learning. We resolve this tradeoff by casting spectrum prediction as a mapping
from an input molecular graph to a probability distribution over molecular
formulas. We discover that a large corpus of mass spectra can be closely
approximated using a fixed vocabulary constituting only 2% of all observed
formulas. This enables efficient spectrum prediction using an architecture
similar to graph classification - GrAFF-MS - achieving significantly lower
prediction error and orders-of-magnitude faster runtime than state-of-the-art
methods.
Related papers
- FraGNNet: A Deep Probabilistic Model for Mass Spectrum Prediction [5.941101105232284]
Compound to mass spectrum (C2MS) models can improve retrieval rates by augmenting real libraries with predicted spectra.
We develop a new probabilistic method for C2MS prediction, FraGNNet, that can efficiently and accurately predict high-resolution spectra.
Our model achieves state-of-the-art performance in terms of prediction error, and surpasses existing C2MS models as a tool for retrieval-based MS2C.
arXiv Detail & Related papers (2024-04-02T23:16:15Z) - Datacube segmentation via Deep Spectral Clustering [76.48544221010424]
Extended Vision techniques often pose a challenge in their interpretation.
The huge dimensionality of data cube spectra poses a complex task in its statistical interpretation.
In this paper, we explore the possibility of applying unsupervised clustering methods in encoded space.
A statistical dimensional reduction is performed by an ad hoc trained (Variational) AutoEncoder, while the clustering process is performed by a (learnable) iterative K-Means clustering algorithm.
arXiv Detail & Related papers (2024-01-31T09:31:28Z) - Mass Spectra Prediction with Structural Motif-based Graph Neural
Networks [21.71309513265843]
MoMS-Net is a system that predicts mass spectra using the information derived from structural motifs and the implementation of Graph Neural Networks (GNNs)
We have tested our model across diverse mass spectra and have observed its superiority over other existing models.
arXiv Detail & Related papers (2023-06-28T10:33:57Z) - Towards Predicting Equilibrium Distributions for Molecular Systems with
Deep Learning [60.02391969049972]
We introduce a novel deep learning framework, called Distributional Graphormer (DiG), in an attempt to predict the equilibrium distribution of molecular systems.
DiG employs deep neural networks to transform a simple distribution towards the equilibrium distribution, conditioned on a descriptor of a molecular system.
arXiv Detail & Related papers (2023-06-08T17:12:08Z) - Graph Fourier MMD for Signals on Graphs [67.68356461123219]
We propose a novel distance between distributions and signals on graphs.
GFMMD is defined via an optimal witness function that is both smooth on the graph and maximizes difference in expectation.
We showcase it on graph benchmark datasets as well as on single cell RNA-sequencing data analysis.
arXiv Detail & Related papers (2023-06-05T00:01:17Z) - Prefix-Tree Decoding for Predicting Mass Spectra from Molecules [12.868704267691125]
We use a new intermediate strategy for predicting mass spectra from molecules by treating mass spectra as sets of molecular formulae, which are themselves multisets of atoms.
We show promising empirical results on mass spectra prediction tasks.
arXiv Detail & Related papers (2023-03-11T17:44:28Z) - Handling Missing Data via Max-Entropy Regularized Graph Autoencoder [37.8103274049137]
MEGAE is a regularized graph autoencoder for graph attribute imputation.
It aims at mitigating spectral concentration problem by maximizing the graph spectral entropy.
It outperforms all the other state-of-the-art imputation methods on a variety of benchmark datasets.
arXiv Detail & Related papers (2022-11-30T06:22:40Z) - Unsupervised Spectral Unmixing For Telluric Correction Using A Neural
Network Autoencoder [58.720142291102135]
We present a neural network autoencoder approach for extracting a telluric transmission spectrum from a large set of high-precision observed solar spectra from the HARPS-N radial velocity spectrograph.
arXiv Detail & Related papers (2021-11-17T12:54:48Z) - MassFormer: Tandem Mass Spectrum Prediction for Small Molecules using
Graph Transformers [3.2951121243459522]
Tandem mass spectra capture fragmentation patterns that provide key structural information about a molecule.
For over seventy years, spectrum prediction has remained a key challenge in the field.
We propose a new model, MassFormer, for accurately predicting tandem mass spectra.
arXiv Detail & Related papers (2021-11-08T20:55:15Z) - Spectral-Spatial Global Graph Reasoning for Hyperspectral Image
Classification [50.899576891296235]
Convolutional neural networks have been widely applied to hyperspectral image classification.
Recent methods attempt to address this issue by performing graph convolutions on spatial topologies.
arXiv Detail & Related papers (2021-06-26T06:24:51Z) - Block-Approximated Exponential Random Graphs [77.4792558024487]
An important challenge in the field of exponential random graphs (ERGs) is the fitting of non-trivial ERGs on large graphs.
We propose an approximative framework to such non-trivial ERGs that result in dyadic independence (i.e., edge independent) distributions.
Our methods are scalable to sparse graphs consisting of millions of nodes.
arXiv Detail & Related papers (2020-02-14T11:42:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.