Unsupervised Machine Learning for the Classification of Astrophysical
X-ray Sources
- URL: http://arxiv.org/abs/2401.12203v1
- Date: Mon, 22 Jan 2024 18:42:31 GMT
- Title: Unsupervised Machine Learning for the Classification of Astrophysical
X-ray Sources
- Authors: V\'ictor Samuel P\'erez-D\'iaz, Juan Rafael Mart\'inez-Galarza,
Alexander Caicedo, Raffaele D'Abrusco
- Abstract summary: We develop an unsupervised machine learning approach to provide probabilistic classes to Chandra Source Catalog sources.
We provide a catalog of probabilistic classes for 8,756 sources, comprising a total of 14,507 detections.
We investigate the consistency between the distribution of features among classified objects and well-established astrophysical hypotheses.
- Score: 44.99833362998488
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The automatic classification of X-ray detections is a necessary step in
extracting astrophysical information from compiled catalogs of astrophysical
sources. Classification is useful for the study of individual objects,
statistics for population studies, as well as for anomaly detection, i.e., the
identification of new unexplored phenomena, including transients and spectrally
extreme sources. Despite the importance of this task, classification remains
challenging in X-ray astronomy due to the lack of optical counterparts and
representative training sets. We develop an alternative methodology that
employs an unsupervised machine learning approach to provide probabilistic
classes to Chandra Source Catalog sources with a limited number of labeled
sources, and without ancillary information from optical and infrared catalogs.
We provide a catalog of probabilistic classes for 8,756 sources, comprising a
total of 14,507 detections, and demonstrate the success of the method at
identifying emission from young stellar objects, as well as distinguishing
between small-scale and large-scale compact accretors with a significant level
of confidence. We investigate the consistency between the distribution of
features among classified objects and well-established astrophysical hypotheses
such as the unified AGN model. This provides interpretability to the
probabilistic classifier. Code and tables are available publicly through
GitHub. We provide a web playground for readers to explore our final
classification at https://umlcaxs-playground.streamlit.app.
Related papers
- Detecting and Classifying Flares in High-Resolution Solar Spectra with Supervised Machine Learning [0.0]
We present a standardized procedure to classify solar flares with the aid of supervised machine learning.
Using flare data from the RHESSI mission and solar spectra from the HARPS-N instrument, we trained several supervised machine learning models.
The best-trained model achieves an average aggregate accuracy score of 0.65, and categorical accuracy scores of over 0.70 for the no-flare and weak-flare classes.
arXiv Detail & Related papers (2024-06-21T18:52:03Z) - Seeing Unseen: Discover Novel Biomedical Concepts via
Geometry-Constrained Probabilistic Modeling [53.7117640028211]
We present a geometry-constrained probabilistic modeling treatment to resolve the identified issues.
We incorporate a suite of critical geometric properties to impose proper constraints on the layout of constructed embedding space.
A spectral graph-theoretic method is devised to estimate the number of potential novel classes.
arXiv Detail & Related papers (2024-03-02T00:56:05Z) - Learning Invariant Molecular Representation in Latent Discrete Space [52.13724532622099]
We propose a new framework for learning molecular representations that exhibit invariance and robustness against distribution shifts.
Our model achieves stronger generalization against state-of-the-art baselines in the presence of various distribution shifts.
arXiv Detail & Related papers (2023-10-22T04:06:44Z) - Searching for Novel Chemistry in Exoplanetary Atmospheres using Machine
Learning for Anomaly Detection [1.8434042562191815]
We advocate the application of machine learning (ML) techniques for anomaly (novelty) detection to exoplanet transit spectra.
We demonstrate the feasibility of two popular anomaly detection methods on a large public database of synthetic spectra.
arXiv Detail & Related papers (2023-08-15T07:19:54Z) - Semi-Supervised Domain Adaptation for Cross-Survey Galaxy Morphology
Classification and Anomaly Detection [57.85347204640585]
We develop a Universal Domain Adaptation method DeepAstroUDA.
It can be applied to datasets with different types of class overlap.
For the first time, we demonstrate the successful use of domain adaptation on two very different observational datasets.
arXiv Detail & Related papers (2022-11-01T18:07:21Z) - Improving Astronomical Time-series Classification via Data Augmentation
with Generative Adversarial Networks [1.2891210250935146]
We propose a data augmentation methodology based on Generative Adrial Networks (GANs) to generate a variety of synthetic light curves from variable stars.
The classification accuracy of variable stars is improved significantly when training with synthetic data and testing with real data.
arXiv Detail & Related papers (2022-05-13T16:39:54Z) - "Flux+Mutability": A Conditional Generative Approach to One-Class
Classification and Anomaly Detection [4.371371475735559]
Anomaly detection is at the forefront of finding new physics beyond the Standard Model.
This paper details the implementation of a novel Machine Learning architecture, called Flux+Mutability.
arXiv Detail & Related papers (2022-04-19T01:55:58Z) - Unsupervised Machine Learning for Exploratory Data Analysis of Exoplanet
Transmission Spectra [68.8204255655161]
We focus on unsupervised techniques for analyzing spectral data from transiting exoplanets.
We show that there is a high degree of correlation in the spectral data, which calls for appropriate low-dimensional representations.
We uncover interesting structures in the principal component basis, namely, well-defined branches corresponding to different chemical regimes.
arXiv Detail & Related papers (2022-01-07T22:26:33Z) - Informative Bayesian model selection for RR Lyrae star classifiers [0.0]
We develop a method based on an informative marginal likelihood to evaluate variable star classifiers.
We perform experiments with a set of Bayesian Logistic Regressions, which are trained to classify RR Lyraes.
Our methodology provides a more rigorous alternative to assess machine learning models using astronomical knowledge.
arXiv Detail & Related papers (2021-05-24T20:55:04Z) - Saliency-based Weighted Multi-label Linear Discriminant Analysis [101.12909759844946]
We propose a new variant of Linear Discriminant Analysis (LDA) to solve multi-label classification tasks.
The proposed method is based on a probabilistic model for defining the weights of individual samples.
The Saliency-based weighted Multi-label LDA approach is shown to lead to performance improvements in various multi-label classification problems.
arXiv Detail & Related papers (2020-04-08T19:40:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.