LCDC: Bridging Science and Machine Learning for Light Curve Analysis
- URL: http://arxiv.org/abs/2504.10550v1
- Date: Mon, 14 Apr 2025 07:50:55 GMT
- Title: LCDC: Bridging Science and Machine Learning for Light Curve Analysis
- Authors: Daniel Kyselica, Tomáš Hrobár, Jiří Šilha, Roman Ďurikovič, Marek Šuppa,
- Abstract summary: Python-based toolkit enables preprocessing, analysis, and machine learning applications of light curve data.<n>First standardized dataset for rocket body classification, RoBo6, used to train and evaluate several benchmark machine learning models.<n>These use cases highlight LCDC's potential to advance space debris characterization and promote sustainable space exploration.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The characterization and analysis of light curves are vital for understanding the physical and rotational properties of artificial space objects such as satellites, rocket stages, and space debris. This paper introduces the Light Curve Dataset Creator (LCDC), a Python-based toolkit designed to facilitate the preprocessing, analysis, and machine learning applications of light curve data. LCDC enables seamless integration with publicly available datasets, such as the newly introduced Mini Mega Tortora (MMT) database. Moreover, it offers data filtering, transformation, as well as feature extraction tooling. To demonstrate the toolkit's capabilities, we created the first standardized dataset for rocket body classification, RoBo6, which was used to train and evaluate several benchmark machine learning models, addressing the lack of reproducibility and comparability in recent studies. Furthermore, the toolkit enables advanced scientific analyses, such as surface characterization of the Atlas 2AS Centaur and the rotational dynamics of the Delta 4 rocket body, by streamlining data preprocessing, feature extraction, and visualization. These use cases highlight LCDC's potential to advance space debris characterization and promote sustainable space exploration. Additionally, they highlight the toolkit's ability to enable AI-focused research within the space debris community.
Related papers
- Towards a Universal Vibration Analysis Dataset: A Framework for Transfer Learning in Predictive Maintenance and Structural Health Monitoring [0.36136619420474764]
This research will include a broader range of vibration signals from multiple types of machinery.
By mirroring the success of ImageNet in visual computing, this dataset has the potential to improve the development of intelligent systems in industrial applications.
arXiv Detail & Related papers (2025-04-15T19:57:26Z) - Efficient Self-Supervised Learning for Earth Observation via Dynamic Dataset Curation [67.23953699167274]
Self-supervised learning (SSL) has enabled the development of vision foundation models for Earth Observation (EO)<n>In EO, this challenge is amplified by the redundancy and heavy-tailed distributions common in satellite imagery.<n>We propose a dynamic dataset pruning strategy designed to improve SSL pre-training by maximizing dataset diversity and balance.
arXiv Detail & Related papers (2025-04-09T15:13:26Z) - Training Datasets Generation for Machine Learning: Application to Vision Based Navigation [0.0]
Vision Based Navigation consists in utilizing cameras as precision sensors for GNC after extracting information from images.<n>To enable the adoption of machine learning for space applications, one of obstacles is the demonstration that available training datasets are adequate to validate the algorithms.<n>The objective of the study is to generate datasets of images and metadata suitable for training machine learning algorithms.
arXiv Detail & Related papers (2024-09-17T17:34:24Z) - SuperBench: A Super-Resolution Benchmark Dataset for Scientific Machine
Learning [42.76583514565341]
We introduce SuperBench, the first benchmark dataset featuring high-resolution datasets.
We focus on validating spatial robustness SR data-centric and physics-preserved perspectives.
We identify limitations of SR methods in capturing fine-scale features and preserving fundamental physical properties and constraints in scientific data.
arXiv Detail & Related papers (2023-06-24T22:39:33Z) - Closing the loop: Autonomous experiments enabled by
machine-learning-based online data analysis in synchrotron beamline
environments [80.49514665620008]
Machine learning can be used to enhance research involving large or rapidly generated datasets.
In this study, we describe the incorporation of ML into a closed-loop workflow for X-ray reflectometry (XRR)
We present solutions that provide an elementary data analysis in real time during the experiment without introducing the additional software dependencies in the beamline control software environment.
arXiv Detail & Related papers (2023-06-20T21:21:19Z) - Exoplanet Detection by Machine Learning with Data Augmentation [0.0]
Deep learning has potential to automate parts of the exoplanet detection pipeline.
Smallness of available datasets makes it difficult to realize the level of performance one expects from powerful network architectures.
We demonstrate that data augmentation has a potential to improve model performance for the exoplanet detection problem.
arXiv Detail & Related papers (2022-11-28T17:35:16Z) - Learning to Simulate Realistic LiDARs [66.7519667383175]
We introduce a pipeline for data-driven simulation of a realistic LiDAR sensor.
We show that our model can learn to encode realistic effects such as dropped points on transparent surfaces.
We use our technique to learn models of two distinct LiDAR sensors and use them to improve simulated LiDAR data accordingly.
arXiv Detail & Related papers (2022-09-22T13:12:54Z) - TRoVE: Transforming Road Scene Datasets into Photorealistic Virtual
Environments [84.6017003787244]
This work proposes a synthetic data generation pipeline to address the difficulties and domain-gaps present in simulated datasets.
We show that using annotations and visual cues from existing datasets, we can facilitate automated multi-modal data generation.
arXiv Detail & Related papers (2022-08-16T20:46:08Z) - A Survey of Learning on Small Data: Generalization, Optimization, and
Challenge [101.27154181792567]
Learning on small data that approximates the generalization ability of big data is one of the ultimate purposes of AI.
This survey follows the active sampling theory under a PAC framework to analyze the generalization error and label complexity of learning on small data.
Multiple data applications that may benefit from efficient small data representation are surveyed.
arXiv Detail & Related papers (2022-07-29T02:34:19Z) - Satellite Image Time Series Analysis for Big Earth Observation Data [50.591267188664666]
This paper describes sits, an open-source R package for satellite image time series analysis using machine learning.
We show that this approach produces high accuracy for land use and land cover maps through a case study in the Cerrado biome.
arXiv Detail & Related papers (2022-04-24T15:23:25Z) - Information-Theoretic Odometry Learning [83.36195426897768]
We propose a unified information theoretic framework for learning-motivated methods aimed at odometry estimation.
The proposed framework provides an elegant tool for performance evaluation and understanding in information-theoretic language.
arXiv Detail & Related papers (2022-03-11T02:37:35Z) - GalaxAI: Machine learning toolbox for interpretable analysis of
spacecraft telemetry data [48.42042893355919]
GalaxAI is a versatile machine learning toolbox for analysis of spacecraft telemetry data.
It employs various machine learning algorithms for multivariate time series analyses, classification, regression and structured output prediction.
We show the utility and versatility of GalaxAI on two use-cases concerning two different spacecraft.
arXiv Detail & Related papers (2021-08-03T10:45:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.