Extracting latent representations from X-ray spectra. Classification, regression, and accretion signatures of Chandra sources
- URL: http://arxiv.org/abs/2510.14102v1
- Date: Wed, 15 Oct 2025 21:20:32 GMT
- Title: Extracting latent representations from X-ray spectra. Classification, regression, and accretion signatures of Chandra sources
- Authors: Nicolò Oreste Pinciroli Vago, Juan Rafael MartĂnez-Galarza, Roberta Amato,
- Abstract summary: This work aims to develop a compact and physically meaningful representation of Chandra X-ray spectra using deep learning.<n>We use a transformer-based autoencoder to compress X-ray spectra.<n>We evaluate the learned representation in terms of spectral reconstruction accuracy, clustering performance, and correlation with physical quantities.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The study of X-ray spectra is crucial to understanding the physical nature of astrophysical sources. Machine learning methods can extract compact and informative representations of data from large datasets. The Chandra Source Catalog (CSC) provides a rich archive of X-ray spectral data, which remains largely underexplored in this context. This work aims to develop a compact and physically meaningful representation of Chandra X-ray spectra using deep learning. To verify that the learned representation captures relevant information, we evaluate it through classification, regression, and interpretability analyses. We use a transformer-based autoencoder to compress X-ray spectra. The input spectra, drawn from the CSC, include only high-significance detections. Astrophysical source types and physical summary statistics are compiled from external catalogs. We evaluate the learned representation in terms of spectral reconstruction accuracy, clustering performance on 8 known astrophysical source classes, and correlation with physical quantities such as hardness ratios and hydrogen column density ($N_H$). The autoencoder accurately reconstructs spectra with 8 latent variables. Clustering in the latent space yields a balanced classification accuracy of $\sim$40% across the 8 source classes, increasing to $\sim$69% when restricted to AGNs and stellar-mass compact objects exclusively. Moreover, latent features correlate with non-linear combinations of spectral fluxes, suggesting that the compressed representation encodes physically relevant information. The proposed autoencoder-based pipeline is a powerful tool for the representation and interpretation of X-ray spectra, providing a compact latent space that supports both classification and the estimation of physical properties. This work demonstrates the potential of deep learning for spectral studies and uncovering new patterns in X-ray data.
Related papers
- Augmenting representations with scientific papers [0.820984376071696]
Astronomers have acquired vast repositories of multimodal data, including images, spectra, and time series.<n>These data sources are rarely systematically integrated.<n>This work introduces a contrastive learning framework designed to align X-ray spectra with domain knowledge extracted from scientific literature.
arXiv Detail & Related papers (2026-03-04T19:04:45Z) - SpecCLIP: Aligning and Translating Spectroscopic Measurements for Stars [1.4217538206528657]
We present SpecCLIP, a foundation model framework that extends LLM-inspired methodologies to stellar spectral analysis.<n>By training foundation models on large-scale spectral datasets, our goal is to learn robust and informative embeddings that support diverse downstream applications.<n>We demonstrate that fine-tuning these models on moderate-sized labeled datasets improves adaptability to tasks such as stellar- parameter estimation and chemical-abundance determination.
arXiv Detail & Related papers (2025-07-02T17:49:52Z) - CARL: Camera-Agnostic Representation Learning for Spectral Image Analysis [69.02751635551724]
Spectral imaging offers promising applications across diverse domains, including medicine and urban scene understanding.<n> variability in channel dimensionality and captured wavelengths among spectral cameras impede the development of AI-driven methodologies.<n>We introduce CARL, a model for Camera-Agnostic Representation Learning across RGB, multispectral, and hyperspectral imaging modalities.
arXiv Detail & Related papers (2025-04-27T13:06:40Z) - Datacube segmentation via Deep Spectral Clustering [76.48544221010424]
Extended Vision techniques often pose a challenge in their interpretation.
The huge dimensionality of data cube spectra poses a complex task in its statistical interpretation.
In this paper, we explore the possibility of applying unsupervised clustering methods in encoded space.
A statistical dimensional reduction is performed by an ad hoc trained (Variational) AutoEncoder, while the clustering process is performed by a (learnable) iterative K-Means clustering algorithm.
arXiv Detail & Related papers (2024-01-31T09:31:28Z) - Unsupervised Machine Learning for the Classification of Astrophysical
X-ray Sources [44.99833362998488]
We develop an unsupervised machine learning approach to provide probabilistic classes to Chandra Source Catalog sources.
We provide a catalog of probabilistic classes for 8,756 sources, comprising a total of 14,507 detections.
We investigate the consistency between the distribution of features among classified objects and well-established astrophysical hypotheses.
arXiv Detail & Related papers (2024-01-22T18:42:31Z) - Hodge-Aware Contrastive Learning [101.56637264703058]
Simplicial complexes prove effective in modeling data with multiway dependencies.
We develop a contrastive self-supervised learning approach for processing simplicial data.
arXiv Detail & Related papers (2023-09-14T00:40:07Z) - Object Detection in Hyperspectral Image via Unified Spectral-Spatial
Feature Aggregation [55.9217962930169]
We present S2ADet, an object detector that harnesses the rich spectral and spatial complementary information inherent in hyperspectral images.
S2ADet surpasses existing state-of-the-art methods, achieving robust and reliable results.
arXiv Detail & Related papers (2023-06-14T09:01:50Z) - Unsupervised Machine Learning for Exploratory Data Analysis of Exoplanet
Transmission Spectra [68.8204255655161]
We focus on unsupervised techniques for analyzing spectral data from transiting exoplanets.
We show that there is a high degree of correlation in the spectral data, which calls for appropriate low-dimensional representations.
We uncover interesting structures in the principal component basis, namely, well-defined branches corresponding to different chemical regimes.
arXiv Detail & Related papers (2022-01-07T22:26:33Z) - Unsupervised Spectral Unmixing For Telluric Correction Using A Neural
Network Autoencoder [58.720142291102135]
We present a neural network autoencoder approach for extracting a telluric transmission spectrum from a large set of high-precision observed solar spectra from the HARPS-N radial velocity spectrograph.
arXiv Detail & Related papers (2021-11-17T12:54:48Z) - Spectral Pyramid Graph Attention Network for Hyperspectral Image
Classification [5.572542792318872]
Convolutional neural networks (CNN) have made significant advances in hyperspectral image (HSI) classification.
Standard convolutional kernel neglects intrinsic connections between data points, resulting in poor region delineation and small spurious predictions.
This paper presents a novel architecture which explicitly addresses these two issues.
arXiv Detail & Related papers (2020-01-20T13:49:43Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.