Classifying CMB time-ordered data through deep neural networks
- URL: http://arxiv.org/abs/2004.06226v1
- Date: Mon, 13 Apr 2020 22:34:30 GMT
- Title: Classifying CMB time-ordered data through deep neural networks
- Authors: Felipe Rojas, Lo\"ic Maurin, Rolando D\"unner, Karim Pichara
- Abstract summary: We propose a supervised machine learning model to classify detectors of CMB experiments.
The model corresponds to a deep convolutional neural network.
Our model is about 10x faster than the current pipeline, making it potentially suitable for real-time implementations.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The Cosmic Microwave Background (CMB) has been measured over a wide range of
multipoles. Experiments with arc-minute resolution like the Atacama Cosmology
Telescope (ACT) have contributed to the measurement of primary and secondary
anisotropies, leading to remarkable scientific discoveries. Such findings
require careful data selection in order to remove poorly-behaved detectors and
unwanted contaminants. The current data classification methodology used by ACT
relies on several statistical parameters that are assessed and fine-tuned by an
expert. This method is highly time-consuming and band or season-specific, which
makes it less scalable and efficient for future CMB experiments. In this work,
we propose a supervised machine learning model to classify detectors of CMB
experiments. The model corresponds to a deep convolutional neural network. We
tested our method on real ACT data, using the 2008 season, 148 GHz, as training
set with labels provided by the ACT data selection software. The model learns
to classify time-streams starting directly from the raw data. For the season
and frequency considered during the training, we find that our classifier
reaches a precision of 99.8%. For 220 and 280 GHz data, season 2008, we
obtained 99.4% and 97.5% of precision, respectively. Finally, we performed a
cross-season test over 148 GHz data from 2009 and 2010 for which our model
reaches a precision of 99.8% and 99.5%, respectively. Our model is about 10x
faster than the current pipeline, making it potentially suitable for real-time
implementations.
Related papers
- DataDecide: How to Predict Best Pretraining Data with Small Experiments [67.95896457895404]
We release models, data, and evaluations in DataDecide -- the most extensive open suite of models over differences in data and scale.
We conduct controlled pretraining experiments across 25 corpora with differing sources, deduplication, and filtering up to 100B tokens, model sizes up to 1B parameters, and 3 random seeds.
arXiv Detail & Related papers (2025-04-15T17:02:15Z) - Improved detection of discarded fish species through BoxAL active learning [0.2544632696242629]
In this study, we present an active learning technique, named BoxAL, which includes estimation of epistemic certainty of the Faster R-CNN object-detection model.
The method allows selecting the most uncertain training images from an unlabeled pool, which are then used to train the object-detection model.
Our study additionally showed that the sampled new data is more valuable for training than the remaining unlabeled data.
arXiv Detail & Related papers (2024-10-07T10:01:30Z) - Comparison of gait phase detection using traditional machine learning
and deep learning techniques [3.11526333124308]
This study proposes a few Machine Learning (ML) based models on lower-limb EMG data for human walking.
The results show up to 75% average accuracy for traditional ML models and 79% for Deep Learning (DL) model.
arXiv Detail & Related papers (2024-03-07T10:05:09Z) - Deepbet: Fast brain extraction of T1-weighted MRI using Convolutional
Neural Networks [0.40125518029941076]
deepbet builds a fast, high-precision brain extraction tool called deepbet.
Deepbet uses LinkNet, a modern UNet architecture, in a two stage prediction process.
Model accelerates brain extraction by a factor of 10 compared to current methods.
arXiv Detail & Related papers (2023-08-14T08:39:09Z) - Machine Learning Force Fields with Data Cost Aware Training [94.78998399180519]
Machine learning force fields (MLFF) have been proposed to accelerate molecular dynamics (MD) simulation.
Even for the most data-efficient MLFFs, reaching chemical accuracy can require hundreds of frames of force and energy labels.
We propose a multi-stage computational framework -- ASTEROID, which lowers the data cost of MLFFs by leveraging a combination of cheap inaccurate data and expensive accurate data.
arXiv Detail & Related papers (2023-06-05T04:34:54Z) - Post-training Model Quantization Using GANs for Synthetic Data
Generation [57.40733249681334]
We investigate the use of synthetic data as a substitute for the calibration with real data for the quantization method.
We compare the performance of models quantized using data generated by StyleGAN2-ADA and our pre-trained DiStyleGAN, with quantization using real data and an alternative data generation method based on fractal images.
arXiv Detail & Related papers (2023-05-10T11:10:09Z) - Efficient Testing of Deep Neural Networks via Decision Boundary Analysis [28.868479656437145]
We propose a novel technique, named Aries, that can estimate the performance of DNNs on new unlabeled data.
The estimated accuracy by Aries is only 0.03% -- 2.60% (on average 0.61%) off the true accuracy.
arXiv Detail & Related papers (2022-07-22T08:39:10Z) - ALT-MAS: A Data-Efficient Framework for Active Testing of Machine
Learning Algorithms [58.684954492439424]
We propose a novel framework to efficiently test a machine learning model using only a small amount of labeled test data.
The idea is to estimate the metrics of interest for a model-under-test using Bayesian neural network (BNN)
arXiv Detail & Related papers (2021-04-11T12:14:04Z) - Classification of COVID-19 in CT Scans using Multi-Source Transfer
Learning [91.3755431537592]
We propose the use of Multi-Source Transfer Learning to improve upon traditional Transfer Learning for the classification of COVID-19 from CT scans.
With our multi-source fine-tuning approach, our models outperformed baseline models fine-tuned with ImageNet.
Our best performing model was able to achieve an accuracy of 0.893 and a Recall score of 0.897, outperforming its baseline Recall score by 9.3%.
arXiv Detail & Related papers (2020-09-22T11:53:06Z) - Complete parameter inference for GW150914 using deep learning [0.0]
LIGO and Virgo gravitational-wave observatories have detected many exciting events over the past five years.
As the rate of detections grows with detector sensitivity, this poses a growing computational challenge for data analysis.
We apply deep learning techniques to perform fast likelihood-free Bayesian inference for gravitational waves.
arXiv Detail & Related papers (2020-08-07T18:00:02Z) - Assessing Graph-based Deep Learning Models for Predicting Flash Point [52.931492216239995]
Graph-based deep learning (GBDL) models were implemented in predicting flash point for the first time.
Average R2 and Mean Absolute Error (MAE) scores of MPNN are, respectively, 2.3% lower and 2.0 K higher than previous comparable studies.
arXiv Detail & Related papers (2020-02-26T06:10:12Z) - Radioactive data: tracing through training [130.2266320167683]
We propose a new technique, emphradioactive data, that makes imperceptible changes to this dataset such that any model trained on it will bear an identifiable mark.
Given a trained model, our technique detects the use of radioactive data and provides a level of confidence (p-value)
Our method is robust to data augmentation and backdoority of deep network optimization.
arXiv Detail & Related papers (2020-02-03T18:41:08Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.