Empirical Models for Multidimensional Regression of Fission Systems
- URL: http://arxiv.org/abs/2105.14645v1
- Date: Sun, 30 May 2021 22:53:39 GMT
- Title: Empirical Models for Multidimensional Regression of Fission Systems
- Authors: Akshay J. Dave (1), Jiankai Yu (1), Jarod Wilson (1), Bren Phillips
(1), Kaichao Sun (1), Benoit Forget (1) ((1) Massachusetts Institute of
Technology)
- Abstract summary: We develop guidelines for developing empirical models for multidimensional regression of neutron transport.
An assessment of the accuracy and precision finds that the SVR, followed closely by ANN, performs the best.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The development of next-generation autonomous control of fission systems,
such as nuclear power plants, will require leveraging advancements in machine
learning. For fission systems, accurate prediction of nuclear transport is
important to quantify the safety margin and optimize performance. The
state-of-the-art approach to this problem is costly Monte Carlo (MC)
simulations to approximate solutions of the neutron transport equation. Such an
approach is feasible for offline calculations e.g., for design or licensing,
but is precluded from use as a model-based controller. In this work, we explore
the use of Artificial Neural Networks (ANN), Gradient Boosting Regression
(GBR), Gaussian Process Regression (GPR) and Support Vector Regression (SVR) to
generate empirical models. The empirical model can then be deployed, e.g., in a
model predictive controller. Two fission systems are explored: the subcritical
MIT Graphite Exponential Pile (MGEP), and the critical MIT Research Reactor
(MITR).
Findings from this work establish guidelines for developing empirical models
for multidimensional regression of neutron transport. An assessment of the
accuracy and precision finds that the SVR, followed closely by ANN, performs
the best. For both MGEP and MITR, the optimized SVR model exhibited a
domain-averaged, test, mean absolute percentage error of 0.17 %. A spatial
distribution of performance metrics indicates that physical regions of poor
performance coincide with locations of largest neutron flux perturbation --
this outcome is mitigated by ANN and SVR. Even at local maxima, ANN and SVR
bias is within experimental uncertainty bounds. A comparison of the performance
vs. training dataset size found that SVR is more data-efficient than ANN. Both
ANN and SVR achieve a greater than 7 order reduction in evaluation time vs. a
MC simulation.
Related papers
- An Investigation on Machine Learning Predictive Accuracy Improvement and Uncertainty Reduction using VAE-based Data Augmentation [2.517043342442487]
Deep generative learning uses certain ML models to learn the underlying distribution of existing data and generate synthetic samples that resemble the real data.
In this study, our objective is to evaluate the effectiveness of data augmentation using variational autoencoder (VAE)-based deep generative models.
We investigated whether the data augmentation leads to improved accuracy in the predictions of a deep neural network (DNN) model trained using the augmented data.
arXiv Detail & Related papers (2024-10-24T18:15:48Z) - A Causal Graph-Enhanced Gaussian Process Regression for Modeling Engine-out NOx [0.0]
The objective of this paper is to develop and validate a probabilistic model to predict engine-out NOx emissions using Gaussian process regression.
We employ three variants of Gaussian process models: the first with a standard radial basis function kernel with input window, the second incorporating a deep kernel using convolutional neural networks to capture temporal dependencies, and the third enriching the deep kernel with a causal graph derived via graph convolutional networks.
All models are compared against a virtual ECM sensor using both quantitative and qualitative metrics. We conclude that our model provides an improvement in predictive performance when using an input window and a deep kernel structure.
arXiv Detail & Related papers (2024-10-24T04:23:57Z) - Optimizing Diffusion Models for Joint Trajectory Prediction and Controllable Generation [49.49868273653921]
Diffusion models are promising for joint trajectory prediction and controllable generation in autonomous driving.
We introduce Optimal Gaussian Diffusion (OGD) and Estimated Clean Manifold (ECM) Guidance.
Our methodology streamlines the generative process, enabling practical applications with reduced computational overhead.
arXiv Detail & Related papers (2024-08-01T17:59:59Z) - Investigating the Robustness of Counterfactual Learning to Rank Models: A Reproducibility Study [61.64685376882383]
Counterfactual learning to rank (CLTR) has attracted extensive attention in the IR community for its ability to leverage massive logged user interaction data to train ranking models.
This paper investigates the robustness of existing CLTR models in complex and diverse situations.
We find that the DLA models and IPS-DCM show better robustness under various simulation settings than IPS-PBM and PRS with offline propensity estimation.
arXiv Detail & Related papers (2024-04-04T10:54:38Z) - Complexity Matters: Rethinking the Latent Space for Generative Modeling [65.64763873078114]
In generative modeling, numerous successful approaches leverage a low-dimensional latent space, e.g., Stable Diffusion.
In this study, we aim to shed light on this under-explored topic by rethinking the latent space from the perspective of model complexity.
arXiv Detail & Related papers (2023-07-17T07:12:29Z) - Fitting a Directional Microstructure Model to Diffusion-Relaxation MRI
Data with Self-Supervised Machine Learning [2.8167227950959206]
Self-supervised machine learning is emerging as an attractive alternative to supervised learning.
In this paper, we demonstrate self-supervised machine learning model fitting for a directional microstructural model.
Our approach shows clear improvements in parameter estimation and computational time, compared to standard non-linear least squares fitting.
arXiv Detail & Related papers (2022-10-05T15:51:39Z) - Real-to-Sim: Predicting Residual Errors of Robotic Systems with Sparse
Data using a Learning-based Unscented Kalman Filter [65.93205328894608]
We learn the residual errors between a dynamic and/or simulator model and the real robot.
We show that with the learned residual errors, we can further close the reality gap between dynamic models, simulations, and actual hardware.
arXiv Detail & Related papers (2022-09-07T15:15:12Z) - Value-Consistent Representation Learning for Data-Efficient
Reinforcement Learning [105.70602423944148]
We propose a novel method, called value-consistent representation learning (VCR), to learn representations that are directly related to decision-making.
Instead of aligning this imagined state with a real state returned by the environment, VCR applies a $Q$-value head on both states and obtains two distributions of action values.
It has been demonstrated that our methods achieve new state-of-the-art performance for search-free RL algorithms.
arXiv Detail & Related papers (2022-06-25T03:02:25Z) - Quantum-Assisted Support Vector Regression for Detecting Facial
Landmarks [0.0]
We devise algorithms, namely simulated and quantum-classical hybrid, for training two SVR models.
We compare their empirical performances against the SVR implementation of Python's scikit-learn package.
Our work is a proof-of-concept example for applying quantu-assisted SVR to a supervised learning task.
arXiv Detail & Related papers (2021-11-17T18:57:10Z) - Flow based features and validation metric for machine learning
reconstruction of PIV data [0.0]
Reconstruction of flow field from real data by a physics-oriented approach is a current challenge for fluid scientists in the AI community.
The present article applies machine learning approach to study contribution of different flow-based features.
A metric is proposed that reflects mass conservation law as an important requirement for a physical flow reproduction.
arXiv Detail & Related papers (2021-05-27T20:05:41Z) - ANNETTE: Accurate Neural Network Execution Time Estimation with Stacked
Models [56.21470608621633]
We propose a time estimation framework to decouple the architectural search from the target hardware.
The proposed methodology extracts a set of models from micro- kernel and multi-layer benchmarks and generates a stacked model for mapping and network execution time estimation.
We compare estimation accuracy and fidelity of the generated mixed models, statistical models with the roofline model, and a refined roofline model for evaluation.
arXiv Detail & Related papers (2021-05-07T11:39:05Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.