Cross-tokamak Disruption Prediction based on Physics-Guided Feature
Extraction and domain adaptation
- URL: http://arxiv.org/abs/2309.05361v2
- Date: Wed, 1 Nov 2023 09:18:35 GMT
- Title: Cross-tokamak Disruption Prediction based on Physics-Guided Feature
Extraction and domain adaptation
- Authors: Chengshuo Shen, Wei Zheng, Bihao Guo, Yonghua Ding, Dalong Chen,
Xinkun Ai, Fengming Xue, Yu Zhong, Nengchao Wang, Biao Shen, Binjia Xiao,
Zhongyong Chen, Yuan Pan and J-TEXT team
- Abstract summary: In this paper, we demonstrate a novel approach to predict disruption in a future tokamak using only a few discharges.
The first step is to use the existing understanding of physics to extract physics-guided features from the diagnostic signals of each tokamak.
The second step is to align a few data from the future tokamak and a large amount of data from existing tokamak.
- Score: 3.0854960284180133
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The high acquisition cost and the significant demand for disruptive
discharges for data-driven disruption prediction models in future tokamaks pose
an inherent contradiction in disruption prediction research. In this paper, we
demonstrated a novel approach to predict disruption in a future tokamak using
only a few discharges. The first step is to use the existing understanding of
physics to extract physics-guided features from the diagnostic signals of each
tokamak, called physics-guided feature extraction (PGFE). The second step is to
align a few data from the future tokamak (target domain) and a large amount of
data from existing tokamak (source domain) based on a domain adaptation
algorithm called CORrelation ALignment (CORAL). It is the first attempt at
applying domain adaptation in the task of disruption prediction. PGFE has been
successfully applied in J-TEXT to predict disruption with excellent
performance. PGFE can also reduce the data volume requirements due to
extracting the less device-specific features, thereby establishing a solid
foundation for cross-tokamak disruption prediction. We have further improved
CORAL (supervised CORAL, S-CORAL) to enhance its appropriateness in feature
alignment for the disruption prediction task. To simulate the existing and
future tokamak case, we selected J-TEXT as the existing tokamak and EAST as the
future tokamak, which has a large gap in the ranges of plasma parameters. The
utilization of the S-CORAL improves the disruption prediction performance on
future tokamak. Through interpretable analysis, we discovered that the learned
knowledge of the disruption prediction model through this approach exhibits
more similarities to the model trained on large data volumes of future tokamak.
Related papers
- AGMA: Adaptive Gaussian Mixture Anchors for Prior-Guided Multimodal Human Trajectory Forecasting [28.338648039645822]
We propose AGMA (Adaptive Gaussian Mixture Anchors), which constructs expressive priors through two stages.<n>Experiments on ETH-UCY, Stanford Drone, and JRDB datasets demonstrate that AGMA achieves state-of-the-art performance.
arXiv Detail & Related papers (2026-02-04T04:42:57Z) - Temporal Continual Learning with Prior Compensation for Human Motion Prediction [32.764096325603994]
Human Motion Prediction (HMP) aims to predict future poses at different moments according to past motion sequences.<n>Previous approaches have treated the prediction of various moments equally, resulting in two main limitations.<n>We introduce a novel multi-stage training framework called Temporal Continual Learning (TCL) to address the above challenges.
arXiv Detail & Related papers (2025-07-05T14:48:30Z) - Future-Guided Learning: A Predictive Approach To Enhance Time-Series Forecasting [4.866362841501992]
We introduce Future-Guided Learning, an approach that enhances time-series event forecasting.
Our approach involves two models: a detection model that analyzes future data to identify critical events and a forecasting model that predicts these events based on present data.
When discrepancies arise between the forecasting and detection models, the forecasting model undergoes more substantial updates.
arXiv Detail & Related papers (2024-10-19T21:22:55Z) - A Causally Informed Pretraining Approach for Multimodal Foundation Models: Applications in Remote Sensing [16.824262496666893]
Self-supervised learning has emerged as a powerful paradigm for pretraining foundation models using large-scale data.
We propose Causally Informed Variable-Step Forecasting (CI-VSF), a novel pretraining task that models forecasting as a conditional generation task.
We demonstrate that pretraining in such a fashion leads to enhanced performance when finetuned on both prediction and forecasting.
arXiv Detail & Related papers (2024-07-29T02:49:55Z) - F-FOMAML: GNN-Enhanced Meta-Learning for Peak Period Demand Forecasting with Proxy Data [65.6499834212641]
We formulate the demand prediction as a meta-learning problem and develop the Feature-based First-Order Model-Agnostic Meta-Learning (F-FOMAML) algorithm.
By considering domain similarities through task-specific metadata, our model improved generalization, where the excess risk decreases as the number of training tasks increases.
Compared to existing state-of-the-art models, our method demonstrates a notable improvement in demand prediction accuracy, reducing the Mean Absolute Error by 26.24% on an internal vending machine dataset and by 1.04% on the publicly accessible JD.com dataset.
arXiv Detail & Related papers (2024-06-23T21:28:50Z) - Fusion of Movement and Naive Predictions for Point Forecasting in Univariate Random Walks [6.935130578959931]
Method is derived from a variant definition of random walks, where the random error term for the future value is expressed as a positive random error multiplied by a direction sign.
It reliably outperforms naive forecasts with moderate movement prediction accuracies, such as 0.55.
Method is particularly advantageous when accurate point predictions are challenging but accurate movement predictions are attainable.
arXiv Detail & Related papers (2024-06-20T16:32:18Z) - GVFs in the Real World: Making Predictions Online for Water Treatment [23.651798878534635]
We investigate the use of reinforcement-learning based prediction approaches for a real drinking-water treatment plant.
We first describe this dataset, and highlight challenges with seasonality, nonstationarity, partial observability.
We show the importance of learning in deployment, by comparing a TD agent trained purely offline with no online updating to a TD agent that learns online.
arXiv Detail & Related papers (2023-12-04T04:49:10Z) - Performative Time-Series Forecasting [71.18553214204978]
We formalize performative time-series forecasting (PeTS) from a machine-learning perspective.
We propose a novel approach, Feature Performative-Shifting (FPS), which leverages the concept of delayed response to anticipate distribution shifts.
We conduct comprehensive experiments using multiple time-series models on COVID-19 and traffic forecasting tasks.
arXiv Detail & Related papers (2023-10-09T18:34:29Z) - EquiDiff: A Conditional Equivariant Diffusion Model For Trajectory
Prediction [11.960234424309265]
We propose EquiDiff, a deep generative model for predicting future vehicle trajectories.
EquiDiff is based on the conditional diffusion model, which generates future trajectories by incorporating historical information and random Gaussian noise.
Our results demonstrate that EquiDiff outperforms other baseline models in short-term prediction, but has slightly higher errors for long-term prediction.
arXiv Detail & Related papers (2023-08-12T13:17:09Z) - Prediction-Oriented Bayesian Active Learning [51.426960808684655]
Expected predictive information gain (EPIG) is an acquisition function that measures information gain in the space of predictions rather than parameters.
EPIG leads to stronger predictive performance compared with BALD across a range of datasets and models.
arXiv Detail & Related papers (2023-04-17T10:59:57Z) - Transferable Cross-Tokamak Disruption Prediction with Deep Hybrid Neural
Network Feature Extractor [4.736144727269566]
Future tokamaks can hardly tolerate disruptions at high performance discharge.
A machine learning method capable of transferring a disruption prediction model trained on one tokamak to another is required.
arXiv Detail & Related papers (2022-08-20T03:29:10Z) - Domain-Adjusted Regression or: ERM May Already Learn Features Sufficient
for Out-of-Distribution Generalization [52.7137956951533]
We argue that devising simpler methods for learning predictors on existing features is a promising direction for future research.
We introduce Domain-Adjusted Regression (DARE), a convex objective for learning a linear predictor that is provably robust under a new model of distribution shift.
Under a natural model, we prove that the DARE solution is the minimax-optimal predictor for a constrained set of test distributions.
arXiv Detail & Related papers (2022-02-14T16:42:16Z) - Scenario adaptive disruption prediction study for next generation
burning-plasma tokamaks [0.0]
Next generation high performance (HP) tokamaks risk damage from unmitigated disruptions at high current and power.
We demonstrate how the operational regimes of tokamaks can affect the power of a trained disruption predictor.
arXiv Detail & Related papers (2021-09-18T15:48:02Z) - Back2Future: Leveraging Backfill Dynamics for Improving Real-time
Predictions in Future [73.03458424369657]
In real-time forecasting in public health, data collection is a non-trivial and demanding task.
'Backfill' phenomenon and its effect on model performance has been barely studied in the prior literature.
We formulate a novel problem and neural framework Back2Future that aims to refine a given model's predictions in real-time.
arXiv Detail & Related papers (2021-06-08T14:48:20Z) - Learning Output Embeddings in Structured Prediction [73.99064151691597]
A powerful and flexible approach to structured prediction consists in embedding the structured objects to be predicted into a feature space of possibly infinite dimension.
A prediction in the original space is computed by solving a pre-image problem.
In this work, we propose to jointly learn a finite approximation of the output embedding and the regression function into the new feature space.
arXiv Detail & Related papers (2020-07-29T09:32:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.