Self-Driving Telescopes: Autonomous Scheduling of Astronomical
Observation Campaigns with Offline Reinforcement Learning
- URL: http://arxiv.org/abs/2311.18094v1
- Date: Wed, 29 Nov 2023 21:23:30 GMT
- Title: Self-Driving Telescopes: Autonomous Scheduling of Astronomical
Observation Campaigns with Offline Reinforcement Learning
- Authors: Franco Terranova, M. Voetberg, Brian Nord, Amanda Pagul
- Abstract summary: We use simulated data to test and compare multiple implementations of a Deep Q-Network (DQN) for the task of optimizing the schedule of observations from the Stone Edge Observatory (SEO)
We show that DQNs can achieve an average reward of 87%+-6% of the maximum achievable reward in each state on the test set.
This is the first comparison of offline RL algorithms for a particular astronomical challenge and the first open-source framework for performing such a comparison and assessment task.
- Score: 0.6976905094072174
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Modern astronomical experiments are designed to achieve multiple scientific
goals, from studies of galaxy evolution to cosmic acceleration. These goals
require data of many different classes of night-sky objects, each of which has
a particular set of observational needs. These observational needs are
typically in strong competition with one another. This poses a challenging
multi-objective optimization problem that remains unsolved. The effectiveness
of Reinforcement Learning (RL) as a valuable paradigm for training autonomous
systems has been well-demonstrated, and it may provide the basis for
self-driving telescopes capable of optimizing the scheduling for astronomy
campaigns. Simulated datasets containing examples of interactions between a
telescope and a discrete set of sky locations on the celestial sphere can be
used to train an RL model to sequentially gather data from these several
locations to maximize a cumulative reward as a measure of the quality of the
data gathered. We use simulated data to test and compare multiple
implementations of a Deep Q-Network (DQN) for the task of optimizing the
schedule of observations from the Stone Edge Observatory (SEO). We combine
multiple improvements on the DQN and adjustments to the dataset, showing that
DQNs can achieve an average reward of 87%+-6% of the maximum achievable reward
in each state on the test set. This is the first comparison of offline RL
algorithms for a particular astronomical challenge and the first open-source
framework for performing such a comparison and assessment task.
Related papers
- Towards Satellite Non-IID Imagery: A Spectral Clustering-Assisted Federated Learning Approach [29.593406320684448]
Low Earth orbit (LEO) satellites are capable of gathering abundant Earth observation data (EOD) to enable different Internet of Things (IoT) applications.
To accomplish an effective EOD processing mechanism, it is imperative to investigate: 1) the challenge of processing the observed data without transmitting those large-size data to the ground because the connection between the satellites and the ground stations is intermittent, and 2) the challenge of processing the non-independent and identically distributed (non-IID) satellite data.
We propose an orbit-based spectral clustering-assisted clustered federated self-knowledge distillation (OSC-FSKD)
arXiv Detail & Related papers (2024-10-17T14:36:58Z) - Earth Observation Satellite Scheduling with Graph Neural Networks [1.1684839631276702]
This paper presents a new technique for selecting and scheduling observations based on Graph Neural Networks (GNNs) and Deep Reinforcement Learning (DRL)
Our simulations show that it is able to learn on small problem instances and generalize to larger real-world instances, with very competitive performance compared to traditional approaches.
arXiv Detail & Related papers (2024-08-27T13:10:26Z) - CUDC: A Curiosity-Driven Unsupervised Data Collection Method with
Adaptive Temporal Distances for Offline Reinforcement Learning [62.58375643251612]
We propose a Curiosity-driven Unsupervised Data Collection (CUDC) method to expand feature space using adaptive temporal distances for task-agnostic data collection.
With this adaptive reachability mechanism in place, the feature representation can be diversified, and the agent can navigate itself to collect higher-quality data with curiosity.
Empirically, CUDC surpasses existing unsupervised methods in efficiency and learning performance in various downstream offline RL tasks of the DeepMind control suite.
arXiv Detail & Related papers (2023-12-19T14:26:23Z) - Accelerating exploration and representation learning with offline
pre-training [52.6912479800592]
We show that exploration and representation learning can be improved by separately learning two different models from a single offline dataset.
We show that learning a state representation using noise-contrastive estimation and a model of auxiliary reward can significantly improve the sample efficiency on the challenging NetHack benchmark.
arXiv Detail & Related papers (2023-03-31T18:03:30Z) - Semi-Supervised Domain Adaptation for Cross-Survey Galaxy Morphology
Classification and Anomaly Detection [57.85347204640585]
We develop a Universal Domain Adaptation method DeepAstroUDA.
It can be applied to datasets with different types of class overlap.
For the first time, we demonstrate the successful use of domain adaptation on two very different observational datasets.
arXiv Detail & Related papers (2022-11-01T18:07:21Z) - Improving Astronomical Time-series Classification via Data Augmentation
with Generative Adversarial Networks [1.2891210250935146]
We propose a data augmentation methodology based on Generative Adrial Networks (GANs) to generate a variety of synthetic light curves from variable stars.
The classification accuracy of variable stars is improved significantly when training with synthetic data and testing with real data.
arXiv Detail & Related papers (2022-05-13T16:39:54Z) - First Full-Event Reconstruction from Imaging Atmospheric Cherenkov
Telescope Real Data with Deep Learning [55.41644538483948]
The Cherenkov Telescope Array is the future of ground-based gamma-ray astronomy.
Its first prototype telescope built on-site, the Large Size Telescope 1, is currently under commissioning and taking its first scientific data.
We present for the first time the development of a full-event reconstruction based on deep convolutional neural networks and its application to real data.
arXiv Detail & Related papers (2021-05-31T12:51:42Z) - DeepMerge II: Building Robust Deep Learning Algorithms for Merging
Galaxy Identification Across Domains [0.0]
In astronomy, neural networks are often trained on simulation data with the prospect of being used on telescope observations.
We show that the addition of each domain adaptation technique improves the performance of a classifier when compared to conventional deep learning algorithms.
We demonstrate this on two examples: between two Illustris-1 simulated datasets of distant merging galaxies, and between Illustris-1 simulated data of nearby merging galaxies and observed data from the Sloan Digital Sky Survey.
arXiv Detail & Related papers (2021-03-02T00:24:10Z) - Batch Exploration with Examples for Scalable Robotic Reinforcement
Learning [63.552788688544254]
Batch Exploration with Examples (BEE) explores relevant regions of the state-space guided by a modest number of human provided images of important states.
BEE is able to tackle challenging vision-based manipulation tasks both in simulation and on a real Franka robot.
arXiv Detail & Related papers (2020-10-22T17:49:25Z) - A Maximum Independent Set Method for Scheduling Earth Observing
Satellite Constellations [41.013477422930755]
This paper introduces a new approach for solving the satellite scheduling problem by generating an infeasibility-based graph representation of the problem.
It is tested on a scenarios of up to 10,000 requested imaging locations for the Skysat constellation of optical satellites as well as simulated constellations of up to 24 satellites.
arXiv Detail & Related papers (2020-08-15T19:32:21Z) - ClusterVO: Clustering Moving Instances and Estimating Visual Odometry
for Self and Surroundings [54.33327082243022]
ClusterVO is a stereo Visual Odometry which simultaneously clusters and estimates the motion of both ego and surrounding rigid clusters/objects.
Unlike previous solutions relying on batch input or imposing priors on scene structure or dynamic object models, ClusterVO is online, general and thus can be used in various scenarios including indoor scene understanding and autonomous driving.
arXiv Detail & Related papers (2020-03-29T09:06:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.