Machine Learning for automatic identification of new minor species
- URL: http://arxiv.org/abs/2012.08175v1
- Date: Tue, 15 Dec 2020 09:51:19 GMT
- Title: Machine Learning for automatic identification of new minor species
- Authors: Frederic Schmidt, Guillaume Cruz Mermy, Justin Erwin, Severine Robert,
Lori Neary, Ian R. Thomas, Frank Daerden, Bojan Ristic, Manish R. Patel,
Giancarlo Bellucci, Jose-Juan Lopez-Moreno, Ann-Carine Vandaele
- Abstract summary: We propose a new method based on unsupervised machine learning, to automatically detect new minor species.
We approximate the dataset non-linearity by a linear mixture of abundance and source spectra (endmembers)
Our approach is able to detect chemical compounds present in form of 100 hidden spectra out of $104$, at 1.5 times the noise level.
- Score: 1.617647375371818
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: One of the main difficulties to analyze modern spectroscopic datasets is due
to the large amount of data. For example, in atmospheric transmittance
spectroscopy, the solar occultation channel (SO) of the NOMAD instrument
onboard the ESA ExoMars2016 satellite called Trace Gas Orbiter (TGO) had
produced $\sim$10 millions of spectra in 20000 acquisition sequences since the
beginning of the mission in April 2018 until 15 January 2020. Other datasets
are even larger with $\sim$billions of spectra for OMEGA onboard Mars Express
or CRISM onboard Mars Reconnaissance Orbiter. Usually, new lines are discovered
after a long iterative process of model fitting and manual residual analysis.
Here we propose a new method based on unsupervised machine learning, to
automatically detect new minor species. Although precise quantification is out
of scope, this tool can also be used to quickly summarize the dataset, by
giving few endmembers ("source") and their abundances. We approximate the
dataset non-linearity by a linear mixture of abundance and source spectra
(endmembers). We used unsupervised source separation in form of non-negative
matrix factorization to estimate those quantities. Several methods are tested
on synthetic and simulation data. Our approach is dedicated to detect minor
species spectra rather than precisely quantifying them. On synthetic example,
this approach is able to detect chemical compounds present in form of 100
hidden spectra out of $10^4$, at 1.5 times the noise level. Results on
simulated spectra of NOMAD-SO targeting CH$_{4}$ show that detection limits
goes in the range of 100-500 ppt in favorable conditions. Results on real
martian data from NOMAD-SO show that CO$_{2}$ and H$_{2}$O are present, as
expected, but CH$_{4}$ is absent. Nevertheless, we confirm a set of new
unexpected lines in the database, attributed by ACS instrument Team to the
CO$_{2}$ magnetic dipole.
Related papers
- Real-time gravitational-wave inference for binary neutron stars using machine learning [71.29593576787549]
We present a machine learning framework that performs complete BNS inference in just one second without making any approximations.
Our approach enhances multi-messenger observations by providing (i) accurate localization even before the merger; (ii) improved localization precision by $sim30%$ compared to approximate low-latency methods; and (iii) detailed information on luminosity distance, inclination, and masses.
arXiv Detail & Related papers (2024-07-12T18:00:02Z) - Machine Learning for Exoplanet Detection in High-Contrast Spectroscopy: Revealing Exoplanets by Leveraging Hidden Molecular Signatures in Cross-Correlated Spectra with Convolutional Neural Networks [0.0]
Cross-correlation for spectroscopy uses molecular templates to isolate a planet's spectrum from its host star.
We introduce machine learning for cross-correlation spectroscopy (MLCCS)
The method aims to leverage weak assumptions on exoplanet characterisation, such as the presence of specific molecules in atmospheres, to improve detection sensitivity for exoplanets.
arXiv Detail & Related papers (2024-05-22T09:25:58Z) - Extracting the gamma-ray source-count distribution below the Fermi-LAT detection limit with deep learning [44.99833362998488]
We train a convolutional neural network on synthetic 2-dimensional sky-maps and incorporate the Fermi-LAT instrumental response functions.
The trained neural network is then applied to the Fermi-LAT data, from which we estimate the source count distribution down to flux levels a factor of 50 below the Fermi-LAT threshold.
arXiv Detail & Related papers (2023-02-03T19:00:51Z) - Unsupervised Spectral Unmixing For Telluric Correction Using A Neural
Network Autoencoder [58.720142291102135]
We present a neural network autoencoder approach for extracting a telluric transmission spectrum from a large set of high-precision observed solar spectra from the HARPS-N radial velocity spectrograph.
arXiv Detail & Related papers (2021-11-17T12:54:48Z) - Primordial non-Gaussianity from the Completed SDSS-IV extended Baryon
Oscillation Spectroscopic Survey I: Catalogue Preparation and Systematic
Mitigation [3.2855185490071444]
We investigate the large-scale clustering of the final spectroscopic sample of quasars from the recently completed extended Baryon Oscillation Spectroscopic Survey (eBOSS)
We develop a neural network-based approach to mitigate spurious fluctuations in the density field caused by spatial variations in the quality of the imaging data used to select targets for follow-up spectroscopy.
arXiv Detail & Related papers (2021-06-25T16:01:19Z) - AGNet: Weighing Black Holes with Machine Learning [2.598391092244943]
Supermassive black holes (SMBHs) are ubiquitously found at the centers of most galaxies.
Traditional methods require spectral data which is expensive to gather.
We present an algorithm that weighs SMBHs using quasar light time series.
arXiv Detail & Related papers (2020-11-30T18:30:24Z) - DeepShadows: Separating Low Surface Brightness Galaxies from Artifacts
using Deep Learning [70.80563014913676]
We investigate the use of convolutional neural networks (CNNs) for the problem of separating low-surface-brightness galaxies from artifacts in survey images.
We show that CNNs offer a very promising path in the quest to study the low-surface-brightness universe.
arXiv Detail & Related papers (2020-11-24T22:51:08Z) - Linear-Sample Learning of Low-Rank Distributions [56.59844655107251]
We show that learning $ktimes k$, rank-$r$, matrices to normalized $L_1$ distance requires $Omega(frackrepsilon2)$ samples.
We propose an algorithm that uses $cal O(frackrepsilon2log2fracepsilon)$ samples, a number linear in the high dimension, and nearly linear in the matrices, typically low, rank proofs.
arXiv Detail & Related papers (2020-09-30T19:10:32Z) - Deep learning for gravitational-wave data analysis: A resampling
white-box approach [62.997667081978825]
We apply Convolutional Neural Networks (CNNs) to detect gravitational wave (GW) signals of compact binary coalescences, using single-interferometer data from LIGO detectors.
CNNs were quite precise to detect noise but not sensitive enough to recall GW signals, meaning that CNNs are better for noise reduction than generation of GW triggers.
arXiv Detail & Related papers (2020-09-09T03:28:57Z) - Cycle-StarNet: Bridging the gap between theory and data by leveraging
large datasets [0.0]
Current automated methods for analyzing spectra are either (a) data-driven, which requires prior knowledge of stellar parameters and elemental abundances, or (b) based on theoretical synthetic models that are susceptible to the gap between theory and practice.
We present a hybrid generative domain adaptation method that turns simulated stellar spectra into realistic spectra by applying unsupervised learning to large spectroscopic surveys.
arXiv Detail & Related papers (2020-07-06T23:06:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.