Related papers: Automating MD simulations for Proteins using Large language Models: NAMD-Agent

Automating MD simulations for Proteins using Large language Models: NAMD-Agent

URL: http://arxiv.org/abs/2507.07887v1
Date: Thu, 10 Jul 2025 16:17:40 GMT
Title: Automating MD simulations for Proteins using Large language Models: NAMD-Agent
Authors: Achuth Chandrasekhar, Amir Barati Farimani,
Abstract summary: We introduce an automated pipeline that leverages Large Language Models (LLMs), specifically Gemini 2.0 Flash, in conjunction with python scripting and Selenium based web automation.<n>The pipeline exploits CHARMM GUI's comprehensive web-based interface for preparing simulation-ready inputs for NAMD.<n>Results demonstrate that this approach reduces setup time, minimizes manual errors, and offers a scalable solution for handling multiple protein systems in parallel.
Score: 9.339909188265333
License: http://creativecommons.org/licenses/by-sa/4.0/
Abstract: Molecular dynamics simulations are an essential tool in understanding protein structure, dynamics, and function at the atomic level. However, preparing high quality input files for MD simulations can be a time consuming and error prone process. In this work, we introduce an automated pipeline that leverages Large Language Models (LLMs), specifically Gemini 2.0 Flash, in conjunction with python scripting and Selenium based web automation to streamline the generation of MD input files. The pipeline exploits CHARMM GUI's comprehensive web-based interface for preparing simulation-ready inputs for NAMD. By integrating Gemini's code generation and iterative refinement capabilities, simulation scripts are automatically written, executed, and revised to navigate CHARMM GUI, extract appropriate parameters, and produce the required NAMD input files. Post processing is performed using additional software to further refine the simulation outputs, thereby enabling a complete and largely hands free workflow. Our results demonstrate that this approach reduces setup time, minimizes manual errors, and offers a scalable solution for handling multiple protein systems in parallel. This automated framework paves the way for broader application of LLMs in computational structural biology, offering a robust and adaptable platform for future developments in simulation automation.

Related papers

chemtrain-deploy: A parallel and scalable framework for machine learning potentials in million-atom MD simulations [0.6240840318920522]
We present chemtrain-deploy, a framework that enables model-agnostic deployment of LAMMPS in MD simulations.<n>Chemtrain-deploy supports any JAX-defined semi-local potential, allowing users to exploit the functionality of LAMMPS.<n>It achieves state-of-the-art efficiency and scales to systems containing millions of atoms.
arXiv Detail & Related papers (2025-06-04T15:19:26Z)
MooseAgent: A LLM Based Multi-agent Framework for Automating Moose Simulation [1.729730091778761]
This paper proposes an automated solution framework, MooseAgent, for the multi-physics simulation framework MOOSE.<n>MooseAgent combines large-scale pre-trained language models (LLMs) with a multi-agent system.<n>Results show that MooseAgent can automate the MOOSE simulation process to a certain extent.
arXiv Detail & Related papers (2025-04-11T15:25:50Z)
MDCrow: Automating Molecular Dynamics Workflows with Large Language Models [0.6130124744675498]
We introduce MDCrow, an agentic LLM assistant capable of automating Molecular dynamics simulations.<n>We assess MDCrow's performance across 25 tasks of varying required subtasks and difficulty, and we evaluate the agent's robustness to both difficulty and prompt style.
arXiv Detail & Related papers (2025-02-13T18:19:20Z)
AutoFLUKA: A Large Language Model Based Framework for Automating Monte Carlo Simulations in FLUKA [6.571041942559539]
Monte Carlo (MC) simulations are essential for replicating real-world scenarios across scientific and engineering fields. Despite the robustness and versatility, FLUKA faces significant limitations in automation and integration with external post-processing tools. This study explores the potential of Large Language Models (LLMs) and AI agents to address these limitations. We introduce AutoFLUKA, an AI agent application developed using the LangChain Python Framework to automate typical MC simulation in FLUKA.
arXiv Detail & Related papers (2024-10-19T21:50:11Z)
Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows? [73.81908518992161]
We introduce Spider2-V, the first multimodal agent benchmark focusing on professional data science and engineering. Spider2-V features real-world tasks in authentic computer environments and incorporating 20 enterprise-level professional applications. These tasks evaluate the ability of a multimodal agent to perform data-related tasks by writing code and managing the GUI in enterprise data software systems.
arXiv Detail & Related papers (2024-07-15T17:54:37Z)
A Multi-Grained Symmetric Differential Equation Model for Learning Protein-Ligand Binding Dynamics [73.35846234413611]
In drug discovery, molecular dynamics (MD) simulation provides a powerful tool for predicting binding affinities, estimating transport properties, and exploring pocket sites. We propose NeuralMD, the first machine learning (ML) surrogate that can facilitate numerical MD and provide accurate simulations in protein-ligand binding dynamics. We demonstrate the efficiency and effectiveness of NeuralMD, achieving over 1K$times$ speedup compared to standard numerical MD simulations.
arXiv Detail & Related papers (2024-01-26T09:35:17Z)
In Situ Framework for Coupling Simulation and Machine Learning with Application to CFD [51.04126395480625]
Recent years have seen many successful applications of machine learning (ML) to facilitate fluid dynamic computations. As simulations grow, generating new training datasets for traditional offline learning creates I/O and storage bottlenecks. This work offers a solution by simplifying this coupling and enabling in situ training and inference on heterogeneous clusters.
arXiv Detail & Related papers (2023-06-22T14:07:54Z)
OmniForce: On Human-Centered, Large Model Empowered and Cloud-Edge Collaborative AutoML System [85.8338446357469]
We introduce OmniForce, a human-centered AutoML system that yields both human-assisted ML and ML-assisted human techniques. We show how OmniForce can put an AutoML system into practice and build adaptive AI in open-environment scenarios.
arXiv Detail & Related papers (2023-03-01T13:35:22Z)
Continual learning autoencoder training for a particle-in-cell simulation via streaming [52.77024349608834]
upcoming exascale era will provide a new generation of physics simulations with high resolution. These simulations will have a high resolution, which will impact the training of machine learning models since storing a high amount of simulation data on disk is nearly impossible. This work presents an approach that trains a neural network concurrently to a running simulation without data on a disk.
arXiv Detail & Related papers (2022-11-09T09:55:14Z)
Learning Large-scale Subsurface Simulations with a Hybrid Graph Network Simulator [57.57321628587564]
We introduce Hybrid Graph Network Simulator (HGNS) for learning reservoir simulations of 3D subsurface fluid flows. HGNS consists of a subsurface graph neural network (SGNN) to model the evolution of fluid flows, and a 3D-U-Net to model the evolution of pressure. Using an industry-standard subsurface flow dataset (SPE-10) with 1.1 million cells, we demonstrate that HGNS is able to reduce the inference time up to 18 times compared to standard subsurface simulators.
arXiv Detail & Related papers (2022-06-15T17:29:57Z)
Software tool-set for automated quantum system identification and device bring up [0.0]
We present a software tool-set which combines the theoretical, optimal control view of quantum devices with the practical operation and characterization tasks.<n>We perform model-based simulations to create control schemes, calibrate these controls in a closed-loop with the device.<n>Finally, we improve the system model through minimization of the mismatch between simulation and experiment, resulting in a digital twin of the device.
arXiv Detail & Related papers (2022-05-10T12:06:53Z)
MLMOD: Machine Learning Methods for Data-Driven Modeling in LAMMPS [0.0]
We present a prototype C++/Python package for characterizing microscale mechanics and molecular dynamics. The package is integrated currently with the mesomod and molecular dynamics simulation package LAMMPS and PyTorch.
arXiv Detail & Related papers (2021-07-29T22:55:26Z)

This list is automatically generated from the titles and abstracts of the papers in this site.