Transferable Cross-Tokamak Disruption Prediction with Deep Hybrid Neural
Network Feature Extractor
- URL: http://arxiv.org/abs/2208.09594v1
- Date: Sat, 20 Aug 2022 03:29:10 GMT
- Title: Transferable Cross-Tokamak Disruption Prediction with Deep Hybrid Neural
Network Feature Extractor
- Authors: Wei Zheng, Fengming Xue, Ming Zhang, Zhongyong Chen, Chengshuo Shen,
Xinkun Ai, Nengchao Wang, Dalong Chen, Bihao Guo, Yonghua Ding, Zhipeng Chen,
Zhoujun Yang, Biao Shen, Bingjia Xiao, Yuan Pan
- Abstract summary: Future tokamaks can hardly tolerate disruptions at high performance discharge.
A machine learning method capable of transferring a disruption prediction model trained on one tokamak to another is required.
- Score: 4.736144727269566
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Predicting disruptions across different tokamaks is a great obstacle to
overcome. Future tokamaks can hardly tolerate disruptions at high performance
discharge. Few disruption discharges at high performance can hardly compose an
abundant training set, which makes it difficult for current data-driven methods
to obtain an acceptable result. A machine learning method capable of
transferring a disruption prediction model trained on one tokamak to another is
required to solve the problem. The key is a disruption prediction model
containing a feature extractor that is able to extract common disruption
precursor traces in tokamak diagnostic data, and a transferable disruption
classifier. Based on the concerns above, the paper first presents a deep fusion
feature extractor designed specifically for extracting disruption precursor
features from common diagnostics on tokamaks according to currently known
precursors of disruption, providing a promising foundation for transferable
models. The fusion feature extractor is proved by comparing with manual feature
extraction on J-TEXT. Based on the feature extractor trained on J-TEXT, the
disruption prediction model was transferred to EAST data with mere 20
discharges from EAST experiment. The performance is comparable with a model
trained with 1896 discharges from EAST. From the comparison among other model
training scenarios, transfer learning showed its potential in predicting
disruptions across different tokamaks.
Related papers
- Benchmarking Few-shot Transferability of Pre-trained Models with Improved Evaluation Protocols [123.73663884421272]
Few-shot transfer has been revolutionized by stronger pre-trained models and improved adaptation algorithms.<n>We establish FEWTRANS, a comprehensive benchmark containing 10 diverse datasets.<n>By releasing FEWTRANS, we aim to provide a rigorous "ruler" to streamline reproducible advances in few-shot transfer learning research.
arXiv Detail & Related papers (2026-02-28T05:41:57Z) - Exploiting Edge Features for Transferable Adversarial Attacks in Distributed Machine Learning [54.26807397329468]
This work explores a previously overlooked vulnerability in distributed deep learning systems.<n>An adversary who intercepts the intermediate features transmitted between them can still pose a serious threat.<n>We propose an exploitation strategy specifically designed for distributed settings.
arXiv Detail & Related papers (2025-07-09T20:09:00Z) - Data Curation Matters: Model Collapse and Spurious Shift Performance Prediction from Training on Uncurated Text Embeddings [0.0]
Training models on uncurated Text Embeddings (TEs) can lead to a severe failure mode known as model collapse.<n>We introduce a set of metrics that capture the extent of model collapse, offering a new perspective on TE quality as a proxy for data curation.<n>These findings highlight the need for more nuanced curation and evaluation of embedding-based representations.
arXiv Detail & Related papers (2025-06-22T11:01:41Z) - Downstream Transfer Attack: Adversarial Attacks on Downstream Models with Pre-trained Vision Transformers [95.22517830759193]
This paper studies the transferability of such an adversarial vulnerability from a pre-trained ViT model to downstream tasks.
We show that DTA achieves an average attack success rate (ASR) exceeding 90%, surpassing existing methods by a huge margin.
arXiv Detail & Related papers (2024-08-03T08:07:03Z) - Securely Fine-tuning Pre-trained Encoders Against Adversarial Examples [28.947545367473086]
We propose a two-stage adversarial fine-tuning approach aimed at enhancing the robustness of downstream models.
Our experiments, conducted across ten self-supervised training methods and six datasets, demonstrate that Gen-AF attains high testing accuracy and robust testing accuracy against state-of-the-art DAEs.
arXiv Detail & Related papers (2024-03-16T04:23:46Z) - Task-customized Masked AutoEncoder via Mixture of Cluster-conditional
Experts [104.9871176044644]
Masked Autoencoder(MAE) is a prevailing self-supervised learning method that achieves promising results in model pre-training.
We propose a novel MAE-based pre-training paradigm, Mixture of Cluster-conditional Experts (MoCE)
MoCE trains each expert only with semantically relevant images by using cluster-conditional gates.
arXiv Detail & Related papers (2024-02-08T03:46:32Z) - Leveraging Diffusion-Based Image Variations for Robust Training on
Poisoned Data [26.551317580666353]
Backdoor attacks pose a serious security threat for training neural networks.
We propose a novel approach that enables model training on potentially poisoned datasets by utilizing the power of recent diffusion models.
arXiv Detail & Related papers (2023-10-10T07:25:06Z) - Cross-tokamak Disruption Prediction based on Physics-Guided Feature
Extraction and domain adaptation [3.0854960284180133]
In this paper, we demonstrate a novel approach to predict disruption in a future tokamak using only a few discharges.
The first step is to use the existing understanding of physics to extract physics-guided features from the diagnostic signals of each tokamak.
The second step is to align a few data from the future tokamak and a large amount of data from existing tokamak.
arXiv Detail & Related papers (2023-09-11T10:13:30Z) - Disruption Precursor Onset Time Study Based on Semi-supervised Anomaly
Detection [6.287037295927318]
We present a disruption prediction method based on anomaly detection that overcomes the drawbacks of unbalanced positive and negative data samples.
We optimize precursor labeling using the onset times inferred by the anomaly detection predictor and test the optimized labels on supervised learning disruption predictors.
arXiv Detail & Related papers (2023-03-27T07:54:56Z) - Robust Transferable Feature Extractors: Learning to Defend Pre-Trained
Networks Against White Box Adversaries [69.53730499849023]
We show that adversarial examples can be successfully transferred to another independently trained model to induce prediction errors.
We propose a deep learning-based pre-processing mechanism, which we refer to as a robust transferable feature extractor (RTFE)
arXiv Detail & Related papers (2022-09-14T21:09:34Z) - Self-supervised Transformer for Deepfake Detection [112.81127845409002]
Deepfake techniques in real-world scenarios require stronger generalization abilities of face forgery detectors.
Inspired by transfer learning, neural networks pre-trained on other large-scale face-related tasks may provide useful features for deepfake detection.
In this paper, we propose a self-supervised transformer based audio-visual contrastive learning method.
arXiv Detail & Related papers (2022-03-02T17:44:40Z) - Explain, Edit, and Understand: Rethinking User Study Design for
Evaluating Model Explanations [97.91630330328815]
We conduct a crowdsourcing study, where participants interact with deception detection models that have been trained to distinguish between genuine and fake hotel reviews.
We observe that for a linear bag-of-words model, participants with access to the feature coefficients during training are able to cause a larger reduction in model confidence in the testing phase when compared to the no-explanation control.
arXiv Detail & Related papers (2021-12-17T18:29:56Z) - Learning to Learn Transferable Attack [77.67399621530052]
Transfer adversarial attack is a non-trivial black-box adversarial attack that aims to craft adversarial perturbations on the surrogate model and then apply such perturbations to the victim model.
We propose a Learning to Learn Transferable Attack (LLTA) method, which makes the adversarial perturbations more generalized via learning from both data and model augmentation.
Empirical results on the widely-used dataset demonstrate the effectiveness of our attack method with a 12.85% higher success rate of transfer attack compared with the state-of-the-art methods.
arXiv Detail & Related papers (2021-12-10T07:24:21Z) - Better sampling in explanation methods can prevent dieselgate-like
deception [0.0]
Interpretability of prediction models is necessary to determine their biases and causes of errors.
Popular techniques, such as IME, LIME, and SHAP, use perturbation of instance features to explain individual predictions.
We show that the improved sampling increases the robustness of the LIME and SHAP, while previously untested method IME is already the most robust of all.
arXiv Detail & Related papers (2021-01-26T13:41:37Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.