Multitask machine learning of collective variables for enhanced sampling
of rare events
- URL: http://arxiv.org/abs/2012.03909v1
- Date: Mon, 7 Dec 2020 18:40:18 GMT
- Title: Multitask machine learning of collective variables for enhanced sampling
of rare events
- Authors: Lixin Sun, Jonathan Vandermause, Simon Batzner, Yu Xie, David Clark,
Wei Chen, Boris Kozinsky
- Abstract summary: A data-driven machine learning algorithm is devised to learn collective variables with a neural network.
The resulting latent space is shown to be an effective low-dimensional representation.
This approach is successfully applied to model systems including a 5D M"uller Brown model, a 5D three-well model, and alanine dipeptide in vacuum.
- Score: 9.632096602077919
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Computing accurate reaction rates is a central challenge in computational
chemistry and biology because of the high cost of free energy estimation with
unbiased molecular dynamics. In this work, a data-driven machine learning
algorithm is devised to learn collective variables with a multitask neural
network, where a common upstream part reduces the high dimensionality of atomic
configurations to a low dimensional latent space, and separate downstream parts
map the latent space to predictions of basin class labels and potential
energies. The resulting latent space is shown to be an effective
low-dimensional representation, capturing the reaction progress and guiding
effective umbrella sampling to obtain accurate free energy landscapes. This
approach is successfully applied to model systems including a 5D M\"uller Brown
model, a 5D three-well model, and alanine dipeptide in vacuum. This approach
enables automated dimensionality reduction for energy controlled reactions in
complex systems, offers a unified framework that can be trained with limited
data, and outperforms single-task learning approaches, including autoencoders.
Related papers
- WasteGAN: Data Augmentation for Robotic Waste Sorting through Generative Adversarial Networks [7.775894876221921]
We introduce a data augmentation method based on a novel GAN architecture called wasteGAN.
The proposed method allows to increase the performance of semantic segmentation models, starting from a very limited bunch of labeled examples.
We then leverage the higher-quality segmentation masks predicted from models trained on the wasteGAN synthetic data to compute semantic-aware grasp poses.
arXiv Detail & Related papers (2024-09-25T15:04:21Z) - Quanv4EO: Empowering Earth Observation by means of Quanvolutional Neural Networks [62.12107686529827]
This article highlights a significant shift towards leveraging quantum computing techniques in processing large volumes of remote sensing data.
The proposed Quanv4EO model introduces a quanvolution method for preprocessing multi-dimensional EO data.
Key findings suggest that the proposed model not only maintains high precision in image classification but also shows improvements of around 5% in EO use cases.
arXiv Detail & Related papers (2024-07-24T09:11:34Z) - Clustering Molecular Energy Landscapes by Adaptive Network Embedding [2.676713226382288]
We present a data driven approach for clustering potential energy landscapes of molecular structures.
We also incorporate an entropy sensitive adaptive scheme for hierarchical sampling of the energy landscape.
We demonstrate the framework through Lennard-Jones clusters and a human DNA sequence.
arXiv Detail & Related papers (2024-01-19T17:12:07Z) - Determination of droplet size from wide-angle light scattering image
data using convolutional neural networks [0.0]
We introduce a fully automatic machine learning-based approach that employs convolutional neural networks (CNNs) in order to streamline the droplet sizing process.
We consider WALS data from an ethanol spray flame process at various heights above the burner surface (HABs)
The models are trained and cross-validated on a large dataset comprising nearly 35000 WALS images.
arXiv Detail & Related papers (2023-11-03T18:05:47Z) - Value function estimation using conditional diffusion models for control [62.27184818047923]
We propose a simple algorithm called Diffused Value Function (DVF)
It learns a joint multi-step model of the environment-robot interaction dynamics using a diffusion model.
We show how DVF can be used to efficiently capture the state visitation measure for multiple controllers.
arXiv Detail & Related papers (2023-06-09T18:40:55Z) - GANs and Closures: Micro-Macro Consistency in Multiscale Modeling [0.0]
We present an approach that couples physics-based simulations and biasing methods for sampling conditional distributions with Machine Learning-based conditional generative adversarial networks.
We show that this framework can improve multiscale SDE dynamical systems sampling, and even shows promise for systems of increasing complexity.
arXiv Detail & Related papers (2022-08-23T03:45:39Z) - Improved Speech Emotion Recognition using Transfer Learning and
Spectrogram Augmentation [56.264157127549446]
Speech emotion recognition (SER) is a challenging task that plays a crucial role in natural human-computer interaction.
One of the main challenges in SER is data scarcity.
We propose a transfer learning strategy combined with spectrogram augmentation.
arXiv Detail & Related papers (2021-08-05T10:39:39Z) - Towards an Automatic Analysis of CHO-K1 Suspension Growth in
Microfluidic Single-cell Cultivation [63.94623495501023]
We propose a novel Machine Learning architecture, which allows us to infuse a neural deep network with human-powered abstraction on the level of data.
Specifically, we train a generative model simultaneously on natural and synthetic data, so that it learns a shared representation, from which a target variable, such as the cell count, can be reliably estimated.
arXiv Detail & Related papers (2020-10-20T08:36:51Z) - Evidential Sparsification of Multimodal Latent Spaces in Conditional
Variational Autoencoders [63.46738617561255]
We consider the problem of sparsifying the discrete latent space of a trained conditional variational autoencoder.
We use evidential theory to identify the latent classes that receive direct evidence from a particular input condition and filter out those that do not.
Experiments on diverse tasks, such as image generation and human behavior prediction, demonstrate the effectiveness of our proposed technique.
arXiv Detail & Related papers (2020-10-19T01:27:21Z) - Ensemble Learning of Coarse-Grained Molecular Dynamics Force Fields with
a Kernel Approach [2.562811344441631]
Gradient-domain machine learning (GDML) is an accurate and efficient approach to learn a molecular potential and associated force field.
We demonstrate its application to learn an effective coarse-grained (CG) model from all-atom simulation data.
Using ensemble learning and stratified sampling, we propose a data-efficient and memory-saving alternative.
arXiv Detail & Related papers (2020-05-04T21:20:01Z) - SUOD: Accelerating Large-Scale Unsupervised Heterogeneous Outlier
Detection [63.253850875265115]
Outlier detection (OD) is a key machine learning (ML) task for identifying abnormal objects from general samples.
We propose a modular acceleration system, called SUOD, to address it.
arXiv Detail & Related papers (2020-03-11T00:22:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.