Efficient labeling of solar flux evolution videos by a deep learning
model
- URL: http://arxiv.org/abs/2308.14976v1
- Date: Tue, 29 Aug 2023 02:05:40 GMT
- Title: Efficient labeling of solar flux evolution videos by a deep learning
model
- Authors: Subhamoy Chatterjee, Andr\'es Mu\~noz-Jaramillo, and Derek A. Lamb
- Abstract summary: We show that convolutional neural networks (CNNs) can be leveraged to improve the quality of data labeling.
We train CNNs using crude labels, manually verify, correct labeling vs. CNN disagreements, and repeat this process until convergence.
We find that a high-quality labeled dataset, derived through this iterative process, reduces the necessary manual verification by 50%.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Machine learning (ML) is becoming a critical tool for interrogation of large
complex data. Labeling, defined as the process of adding meaningful
annotations, is a crucial step of supervised ML. However, labeling datasets is
time consuming. Here we show that convolutional neural networks (CNNs), trained
on crudely labeled astronomical videos, can be leveraged to improve the quality
of data labeling and reduce the need for human intervention. We use videos of
the solar magnetic field, crudely labeled into two classes: emergence or
non-emergence of bipolar magnetic regions (BMRs), based on their first
detection on the solar disk. We train CNNs using crude labels, manually verify,
correct labeling vs. CNN disagreements, and repeat this process until
convergence. Traditionally, flux emergence labelling is done manually. We find
that a high-quality labeled dataset, derived through this iterative process,
reduces the necessary manual verification by 50%. Furthermore, by gradually
masking the videos and looking for maximum change in CNN inference, we locate
BMR emergence time without retraining the CNN. This demonstrates the
versatility of CNNs for simplifying the challenging task of labeling complex
dynamic events.
Related papers
- Granular-ball Representation Learning for Deep CNN on Learning with Label Noise [14.082510085545582]
We propose a general granular-ball computing (GBC) module that can be embedded into a CNN model.
In this study, we split the input samples as $gb$ samples at feature-level, each of which can correspond to multiple samples with varying numbers and share one single label.
Experiments demonstrate that the proposed method can improve the robustness of CNN models with no additional data or optimization.
arXiv Detail & Related papers (2024-09-05T05:18:31Z) - Mapping Degeneration Meets Label Evolution: Learning Infrared Small Target Detection with Single Point Supervision [32.27190522495652]
We make the first attempt to achieve infrared small target detection with point-level supervision.
We propose a label evolution framework named label evolution with single point supervision (LESPS) to progressively expand the point label.
We conduct extensive experiments with insightful visualizations to validate the effectiveness of our method.
arXiv Detail & Related papers (2023-04-04T02:55:57Z) - Label Noise-Resistant Mean Teaching for Weakly Supervised Fake News
Detection [93.6222609806278]
We propose a novel label noise-resistant mean teaching approach (LNMT) for weakly supervised fake news detection.
LNMT leverages unlabeled news and feedback comments of users to enlarge the amount of training data.
LNMT establishes a mean teacher framework equipped with label propagation and label reliability estimation.
arXiv Detail & Related papers (2022-06-10T16:01:58Z) - Informative Pseudo-Labeling for Graph Neural Networks with Few Labels [12.83841767562179]
Graph Neural Networks (GNNs) have achieved state-of-the-art results for semi-supervised node classification on graphs.
The challenge of how to effectively learn GNNs with very few labels is still under-explored.
We propose a novel informative pseudo-labeling framework, called InfoGNN, to facilitate learning of GNNs with extremely few labels.
arXiv Detail & Related papers (2022-01-20T01:49:30Z) - R$^2$-Net: Relation of Relation Learning Network for Sentence Semantic
Matching [58.72111690643359]
We propose a Relation of Relation Learning Network (R2-Net) for sentence semantic matching.
We first employ BERT to encode the input sentences from a global perspective.
Then a CNN-based encoder is designed to capture keywords and phrase information from a local perspective.
To fully leverage labels for better relation information extraction, we introduce a self-supervised relation of relation classification task.
arXiv Detail & Related papers (2020-12-16T13:11:30Z) - A CNN-based Feature Space for Semi-supervised Incremental Learning in
Assisted Living Applications [2.1485350418225244]
We propose using the feature space that results from the training dataset to automatically label problematic images.
The resulting semi-supervised incremental learning process allows improving the classification accuracy of new instances by 40%.
arXiv Detail & Related papers (2020-11-11T12:31:48Z) - Deep learning for gravitational-wave data analysis: A resampling
white-box approach [62.997667081978825]
We apply Convolutional Neural Networks (CNNs) to detect gravitational wave (GW) signals of compact binary coalescences, using single-interferometer data from LIGO detectors.
CNNs were quite precise to detect noise but not sensitive enough to recall GW signals, meaning that CNNs are better for noise reduction than generation of GW triggers.
arXiv Detail & Related papers (2020-09-09T03:28:57Z) - Temporal Calibrated Regularization for Robust Noisy Label Learning [60.90967240168525]
Deep neural networks (DNNs) exhibit great success on many tasks with the help of large-scale well annotated datasets.
However, labeling large-scale data can be very costly and error-prone so that it is difficult to guarantee the annotation quality.
We propose a Temporal Calibrated Regularization (TCR) in which we utilize the original labels and the predictions in the previous epoch together.
arXiv Detail & Related papers (2020-07-01T04:48:49Z) - Labelling unlabelled videos from scratch with multi-modal
self-supervision [82.60652426371936]
unsupervised labelling of a video dataset does not come for free from strong feature encoders.
We propose a novel clustering method that allows pseudo-labelling of a video dataset without any human annotations.
An extensive analysis shows that the resulting clusters have high semantic overlap to ground truth human labels.
arXiv Detail & Related papers (2020-06-24T12:28:17Z) - 3D medical image segmentation with labeled and unlabeled data using
autoencoders at the example of liver segmentation in CT images [58.720142291102135]
This work investigates the potential of autoencoder-extracted features to improve segmentation with a convolutional neural network.
A convolutional autoencoder was used to extract features from unlabeled data and a multi-scale, fully convolutional CNN was used to perform the target task of 3D liver segmentation in CT images.
arXiv Detail & Related papers (2020-03-17T20:20:43Z) - VideoSSL: Semi-Supervised Learning for Video Classification [30.348819309923098]
We propose a semi-supervised learning approach for video classification, VideoSSL, using convolutional neural networks (CNN)
To minimize the dependence on a large annotated dataset, our proposed method trains from a small number of labeled examples.
We show that, under the supervision of these guiding signals from unlabeled examples, a video classification CNN can achieve impressive performances.
arXiv Detail & Related papers (2020-02-29T07:13:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.