Achieving 100X faster simulations of complex biological phenomena by
coupling ML to HPC ensembles
- URL: http://arxiv.org/abs/2104.04797v1
- Date: Sat, 10 Apr 2021 15:52:39 GMT
- Title: Achieving 100X faster simulations of complex biological phenomena by
coupling ML to HPC ensembles
- Authors: Alexander Brace, Hyungro Lee, Heng Ma, Anda Trifan, Matteo Turilli,
Igor Yaskushin, Todd Munson, Ian Foster, Shantenu Jha and Arvind Ramanathan
- Abstract summary: We present DeepDriveMD, a tool for a range of prototypical ML-driven HPC simulation scenarios.
We use it to quantify improvements in the scientific performance of ML-driven ensemble-based applications.
- Score: 47.44377051031385
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The use of ML methods to dynamically steer ensemble-based simulations
promises significant improvements in the performance of scientific
applications. We present DeepDriveMD, a tool for a range of prototypical
ML-driven HPC simulation scenarios, and use it to quantify improvements in the
scientific performance of ML-driven ensemble-based applications. We discuss its
design and characterize its performance. Motivated by the potential for further
scientific improvements and applicability to more sophisticated physical
systems, we extend the design of DeepDriveMD to support stream-based
communication between simulations and learning methods. It demonstrates a 100x
speedup to fold proteins, and performs 1.6x more simulations per unit time,
improving resource utilization compared to the sequential framework.
Experiments are performed on leadership-class platforms, at scales of up to
O(1000) nodes, and for production workloads. We establish DeepDriveMD as a
high-performance framework for ML-driven HPC simulation scenarios, that
supports diverse simulation and ML back-ends, and which enables new scientific
insights by improving length- and time-scale accessed.
Related papers
- A Multi-Grained Symmetric Differential Equation Model for Learning Protein-Ligand Binding Dynamics [73.35846234413611]
In drug discovery, molecular dynamics (MD) simulation provides a powerful tool for predicting binding affinities, estimating transport properties, and exploring pocket sites.
We propose NeuralMD, the first machine learning (ML) surrogate that can facilitate numerical MD and provide accurate simulations in protein-ligand binding dynamics.
We demonstrate the efficiency and effectiveness of NeuralMD, achieving over 1K$times$ speedup compared to standard numerical MD simulations.
arXiv Detail & Related papers (2024-01-26T09:35:17Z) - Waymax: An Accelerated, Data-Driven Simulator for Large-Scale Autonomous
Driving Research [76.93956925360638]
Waymax is a new data-driven simulator for autonomous driving in multi-agent scenes.
It runs entirely on hardware accelerators such as TPUs/GPUs and supports in-graph simulation for training.
We benchmark a suite of popular imitation and reinforcement learning algorithms with ablation studies on different design decisions.
arXiv Detail & Related papers (2023-10-12T20:49:15Z) - Fully Convolutional Generative Machine Learning Method for Accelerating
Non-Equilibrium Greens Function Simulations [0.0879626117219674]
This work describes a novel simulation approach that combines machine learning and device modelling simulations.
We have named our new simulation approach ML-NEGF and we have implemented it in our in-house simulator called NESS.
arXiv Detail & Related papers (2023-09-17T20:43:54Z) - In Situ Framework for Coupling Simulation and Machine Learning with
Application to CFD [51.04126395480625]
Recent years have seen many successful applications of machine learning (ML) to facilitate fluid dynamic computations.
As simulations grow, generating new training datasets for traditional offline learning creates I/O and storage bottlenecks.
This work offers a solution by simplifying this coupling and enabling in situ training and inference on heterogeneous clusters.
arXiv Detail & Related papers (2023-06-22T14:07:54Z) - Forces are not Enough: Benchmark and Critical Evaluation for Machine
Learning Force Fields with Molecular Simulations [5.138982355658199]
Molecular dynamics (MD) simulation techniques are widely used for various natural science applications.
We benchmark a collection of state-of-the-art (SOTA) ML FF models and illustrate, in particular, how the commonly benchmarked force accuracy is not well aligned with relevant simulation metrics.
arXiv Detail & Related papers (2022-10-13T17:59:03Z) - Deep Bayesian Active Learning for Accelerating Stochastic Simulation [74.58219903138301]
Interactive Neural Process (INP) is a deep active learning framework for simulations and with active learning approaches.
For active learning, we propose a novel acquisition function, Latent Information Gain (LIG), calculated in the latent space of NP based models.
The results demonstrate STNP outperforms the baselines in the learning setting and LIG achieves the state-of-the-art for active learning.
arXiv Detail & Related papers (2021-06-05T01:31:51Z) - Using Machine Learning at Scale in HPC Simulations with SmartSim: An
Application to Ocean Climate Modeling [52.77024349608834]
We demonstrate the first climate-scale, numerical ocean simulations improved through distributed, online inference of Deep Neural Networks (DNN) using SmartSim.
SmartSim is a library dedicated to enabling online analysis and Machine Learning (ML) for traditional HPC simulations.
arXiv Detail & Related papers (2021-04-13T19:27:28Z) - Integrating Machine Learning with HPC-driven Simulations for Enhanced
Student Learning [0.0]
We develop a web application that supports both HPC-driven simulation and the ML surrogate methods to produce simulation outputs.
The evaluation of the tool via in-classroom student feedback and surveys shows that the ML-enhanced tool provides a dynamic and responsive simulation environment.
arXiv Detail & Related papers (2020-08-24T22:48:21Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.