BubbleML: A Multi-Physics Dataset and Benchmarks for Machine Learning
- URL: http://arxiv.org/abs/2307.14623v2
- Date: Fri, 25 Aug 2023 03:17:29 GMT
- Title: BubbleML: A Multi-Physics Dataset and Benchmarks for Machine Learning
- Authors: Sheikh Md Shakeel Hassan, Arthur Feeney, Akash Dhruv, Jihoon Kim,
Youngjoon Suh, Jaiyoung Ryu, Yoonjin Won, Aparna Chandramowlishwaran
- Abstract summary: This dataset covers a wide range of parameters, including varying gravity conditions, flow rates, sub-cooling levels, and wall superheat, comprising 79 simulations.
BubbleML is validated against experimental observations and trends, establishing it as an invaluable resource for machine learning (ML) research.
We showcase its potential to facilitate exploration of diverse downstream tasks by introducing two benchmarks: (a) optical flow analysis to capture bubble dynamics, and (b) operator networks for learning temperature dynamics.
- Score: 15.681756756338546
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In the field of phase change phenomena, the lack of accessible and diverse
datasets suitable for machine learning (ML) training poses a significant
challenge. Existing experimental datasets are often restricted, with limited
availability and sparse ground truth data, impeding our understanding of this
complex multiphysics phenomena. To bridge this gap, we present the BubbleML
Dataset
\footnote{\label{git_dataset}\url{https://github.com/HPCForge/BubbleML}} which
leverages physics-driven simulations to provide accurate ground truth
information for various boiling scenarios, encompassing nucleate pool boiling,
flow boiling, and sub-cooled boiling. This extensive dataset covers a wide
range of parameters, including varying gravity conditions, flow rates,
sub-cooling levels, and wall superheat, comprising 79 simulations. BubbleML is
validated against experimental observations and trends, establishing it as an
invaluable resource for ML research. Furthermore, we showcase its potential to
facilitate exploration of diverse downstream tasks by introducing two
benchmarks: (a) optical flow analysis to capture bubble dynamics, and (b)
operator networks for learning temperature dynamics. The BubbleML dataset and
its benchmarks serve as a catalyst for advancements in ML-driven research on
multiphysics phase change phenomena, enabling the development and comparison of
state-of-the-art techniques and models.
Related papers
- Physics-Learning AI Datamodel (PLAID) datasets: a collection of physics simulations for machine learning [0.15469999759898032]
PLAID is a framework for representing and sharing datasets of physics simulations.<n> PLAID defines a unified standard for describing simulation data.<n>We release six datasets under the PLAID standard, covering structural mechanics and computational fluid dynamics.
arXiv Detail & Related papers (2025-05-05T18:59:17Z) - Robust Multi-View Learning via Representation Fusion of Sample-Level Attention and Alignment of Simulated Perturbation [61.64052577026623]
Real-world multi-view datasets are often heterogeneous and imperfect.
We propose a novel robust MVL method (namely RML) with simultaneous representation fusion and alignment.
In experiments, we employ it in unsupervised multi-view clustering, noise-label classification, and as a plug-and-play module for cross-modal hashing retrieval.
arXiv Detail & Related papers (2025-03-06T07:01:08Z) - HyperFLINT: Hypernetwork-based Flow Estimation and Temporal Interpolation for Scientific Ensemble Visualization [26.472939569860607]
HyperFLINT is a novel deep learning-based approach for estimating flow fields, temporally interpolating fields, and facilitating parameter space exploration in ensemble data.
A series of experiments demonstrates HyperFLINT's significantly improved performance in flow field estimation, as well as its potential in enabling parameter space exploration.
arXiv Detail & Related papers (2024-12-05T12:01:20Z) - The Well: a Large-Scale Collection of Diverse Physics Simulations for Machine Learning [4.812580392361432]
Well is a large-scale collection of numerical simulations of a wide variety of physical systems.
These datasets can be used individually or as part of a broader benchmark suite.
We provide a unified PyTorch interface for training and evaluating models.
arXiv Detail & Related papers (2024-11-30T19:42:14Z) - FedMLLM: Federated Fine-tuning MLLM on Multimodal Heterogeneity Data [64.50893177169996]
Fine-tuning Multimodal Large Language Models (MLLMs) with Federated Learning (FL) allows for expanding the training data scope by including private data sources.
We introduce a benchmark for evaluating various downstream tasks in the federated fine-tuning of MLLMs within multimodal heterogeneous scenarios.
We develop a general FedMLLM framework that integrates four representative FL methods alongside two modality-agnostic strategies.
arXiv Detail & Related papers (2024-11-22T04:09:23Z) - Learning Pore-scale Multi-phase Flow from Experimental Data with Graph Neural Network [2.2101344151283944]
Current numerical models are often incapable of accurately capturing the complex pore-scale physics observed in experiments.
We propose a graph neural network-based approach and directly learn pore-scale fluid flow using micro-CT experimental data.
arXiv Detail & Related papers (2024-11-21T15:01:17Z) - The Sound of Water: Inferring Physical Properties from Pouring Liquids [85.30865788636386]
We study the connection between audio-visual observations and the underlying physics of pouring liquids.
Our objective is to automatically infer physical properties such as the liquid level, the shape and size of the container, the pouring rate and the time to fill.
arXiv Detail & Related papers (2024-11-18T01:19:37Z) - Recent Advances on Machine Learning for Computational Fluid Dynamics: A Survey [51.87875066383221]
This paper introduces fundamental concepts, traditional methods, and benchmark datasets, then examine the various roles Machine Learning plays in improving CFD.
We highlight real-world applications of ML for CFD in critical scientific and engineering disciplines, including aerodynamics, combustion, atmosphere & ocean science, biology fluid, plasma, symbolic regression, and reduced order modeling.
We draw the conclusion that ML is poised to significantly transform CFD research by enhancing simulation accuracy, reducing computational time, and enabling more complex analyses of fluid dynamics.
arXiv Detail & Related papers (2024-08-22T07:33:11Z) - MMSci: A Dataset for Graduate-Level Multi-Discipline Multimodal Scientific Understanding [59.41495657570397]
This dataset includes figures such as schematic diagrams, simulated images, macroscopic/microscopic photos, and experimental visualizations.
We developed benchmarks for scientific figure captioning and multiple-choice questions, evaluating six proprietary and over ten open-source models.
The dataset and benchmarks will be released to support further research.
arXiv Detail & Related papers (2024-07-06T00:40:53Z) - Diffusion posterior sampling for simulation-based inference in tall data settings [53.17563688225137]
Simulation-based inference ( SBI) is capable of approximating the posterior distribution that relates input parameters to a given observation.
In this work, we consider a tall data extension in which multiple observations are available to better infer the parameters of the model.
We compare our method to recently proposed competing approaches on various numerical experiments and demonstrate its superiority in terms of numerical stability and computational cost.
arXiv Detail & Related papers (2024-04-11T09:23:36Z) - Accurate machine learning force fields via experimental and simulation
data fusion [0.0]
Machine Learning (ML)-based force fields are attracting ever-increasing interest due to their capacity to span scales of classical interatomic potentials at quantum-level accuracy.
Here we leverage both Density Functional Theory (DFT) calculations and experimentally measured mechanical properties and lattice parameters to train an ML potential of titanium.
We demonstrate that the fused data learning strategy can concurrently satisfy all target objectives, thus resulting in a molecular model of higher accuracy compared to the models trained with a single source data.
arXiv Detail & Related papers (2023-08-17T18:22:19Z) - Data-Efficient Learning via Minimizing Hyperspherical Energy [48.47217827782576]
This paper considers the problem of data-efficient learning from scratch using a small amount of representative data.
We propose a MHE-based active learning (MHEAL) algorithm, and provide comprehensive theoretical guarantees for MHEAL.
arXiv Detail & Related papers (2022-06-30T11:39:12Z) - Physics-informed machine learning with differentiable programming for
heterogeneous underground reservoir pressure management [64.17887333976593]
Avoiding over-pressurization in subsurface reservoirs is critical for applications like CO2 sequestration and wastewater injection.
Managing the pressures by controlling injection/extraction are challenging because of complex heterogeneity in the subsurface.
We use differentiable programming with a full-physics model and machine learning to determine the fluid extraction rates that prevent over-pressurization.
arXiv Detail & Related papers (2022-06-21T20:38:13Z) - Prediction of liquid fuel properties using machine learning models with
Gaussian processes and probabilistic conditional generative learning [56.67751936864119]
The present work aims to construct cheap-to-compute machine learning (ML) models to act as closure equations for predicting the physical properties of alternative fuels.
Those models can be trained using the database from MD simulations and/or experimental measurements in a data-fusion-fidelity approach.
The results show that ML models can predict accurately the fuel properties of a wide range of pressure and temperature conditions.
arXiv Detail & Related papers (2021-10-18T14:43:50Z) - Inferring micro-bubble dynamics with physics-informed deep learning [0.0]
Multiphase flow simulation requires high accuracy due to possible component losses that may be caused by sparse meshing during the computation.
We propose a novel deep learning framework BubbleNet, which entails three main parts: deep neural networks (DNN) with sub nets for predicting different physics fields.
Results indicate our framework can predict the physics fields more accurately, estimating the absolute prediction errors.
arXiv Detail & Related papers (2021-05-15T09:17:56Z) - Deep Learning Interfacial Momentum Closures in Coarse-Mesh CFD Two-Phase
Flow Simulation Using Validation Data [5.099083753474628]
Feature-similarity measurement (FSM) is developed and applied to improve the simulation capability of two-phase flow with coarse-mesh CFD approach.
FSM can substantially improve the prediction of the coarse-mesh CFD model, regardless of the choice of interfacial closures.
arXiv Detail & Related papers (2020-05-07T21:25:22Z) - Using Deep Learning to Explore Local Physical Similarity for
Global-scale Bridging in Thermal-hydraulic Simulation [4.350727579753697]
Current system thermal-hydraulic codes have limited credibility in simulating real plant conditions.
This paper proposes a data-driven approach, Feature Similarity Measurement FFSM, to overcome these difficulties.
Deep learning is applied to construct and explore the relationship between the local physical features and simulation errors.
arXiv Detail & Related papers (2020-01-06T20:14:46Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.