Hierarchical Multi-agent Large Language Model Reasoning for Autonomous Functional Materials Discovery
- URL: http://arxiv.org/abs/2512.13930v1
- Date: Mon, 15 Dec 2025 22:08:18 GMT
- Title: Hierarchical Multi-agent Large Language Model Reasoning for Autonomous Functional Materials Discovery
- Authors: Samuel Rothfarb, Megan C. Davis, Ivana Matanovic, Baikun Li, Edward F. Holby, Wilton J. M. Kort-Kamp,
- Abstract summary: We introduce Materials Agents for Simulation and Theory in Electronic-structure Reasoning (Master)<n>In MASTER, a multimodal system translates natural language into density functional theory, while higher-level reasoning agents guide discovery through a hierarchy of strategies.<n> Reasoning trajectories reveal chemically grounded decisions that cannot be explained by sampling or semantic bias.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Artificial intelligence is reshaping scientific exploration, but most methods automate procedural tasks without engaging in scientific reasoning, limiting autonomy in discovery. We introduce Materials Agents for Simulation and Theory in Electronic-structure Reasoning (MASTER), an active learning framework where large language models autonomously design, execute, and interpret atomistic simulations. In MASTER, a multimodal system translates natural language into density functional theory workflows, while higher-level reasoning agents guide discovery through a hierarchy of strategies, including a single agent baseline and three multi-agent approaches: peer review, triage-ranking, and triage-forms. Across two chemical applications, CO adsorption on Cu-surface transition metal (M) adatoms and on M-N-C catalysts, reasoning-driven exploration reduces required atomistic simulations by up to 90% relative to trial-and-error selection. Reasoning trajectories reveal chemically grounded decisions that cannot be explained by stochastic sampling or semantic bias. Altogether, multi-agent collaboration accelerates materials discovery and marks a new paradigm for autonomous scientific exploration.
Related papers
- Reasoning-Driven Design of Single Atom Catalysts via a Multi-Agent Large Language Model Framework [1.4582793306013617]
Large language models (LLMs) are becoming increasingly applied beyond natural language processing.<n>Here, we present a Multi-Agent-based Electrocatalyst Search Through Reasoning and Optimization framework.<n>In this framework, multiple LLMs with specialized roles collaboratively discover high-performance single atom catalysts.
arXiv Detail & Related papers (2026-02-25T03:43:24Z) - AgentArk: Distilling Multi-Agent Intelligence into a Single LLM Agent [57.10083973844841]
AgentArk is a novel framework to distill multi-agent dynamics into the weights of a single model.<n>We investigate three hierarchical distillation strategies across various models, tasks, scaling, and scenarios.<n>By shifting the burden of computation from inference to training, the distilled models preserve the efficiency of one agent while exhibiting strong reasoning and self-correction performance of multiple agents.
arXiv Detail & Related papers (2026-02-03T19:18:28Z) - QUASAR: A Universal Autonomous System for Atomistic Simulation and a Benchmark of Its Capabilities [0.7519872646378835]
QUASAR is a universal autonomous system for atomistic simulation designed to facilitate production-grade scientific discovery.<n>We benchmark QUASAR against a series of three-tiered tasks, progressing from routine tasks to frontier research challenges such as photocatalyst screening and novel material assessment.<n>Results suggest that QUASAR can function as a general atomistic reasoning system rather than a task-specific automation framework.
arXiv Detail & Related papers (2026-01-30T05:29:44Z) - Agentic reinforcement learning empowers next-generation chemical language models for molecular design and synthesis [51.83339196548892]
ChemCraft is a novel framework that decouples chemical reasoning from knowledge storage.<n>ChemCraft achieves superior performance with minimal inference costs.<n>This work establishes a cost-effective and privacy-preserving paradigm for AI-aided chemistry.
arXiv Detail & Related papers (2026-01-25T04:23:34Z) - Autonomous Quantum Simulation through Large Language Model Agents [0.29165586612027233]
Large language model (LLM) agents can autonomously perform tensor network simulations of quantum many-body systems.<n>We create autonomous AI agents that can be trained in specialized computational domains within minutes.
arXiv Detail & Related papers (2026-01-15T08:50:57Z) - An Agentic Framework for Autonomous Materials Computation [70.24472585135929]
Large Language Models (LLMs) have emerged as powerful tools for accelerating scientific discovery.<n>Recent advances integrate LLMs into agentic frameworks, enabling retrieval, reasoning, and tool use for complex scientific experiments.<n>Here, we present a domain-specialized agent designed for reliable automation of first-principles materials computations.
arXiv Detail & Related papers (2025-12-22T15:03:57Z) - SciAgent: A Unified Multi-Agent System for Generalistic Scientific Reasoning [54.186990494217916]
SciAgent is a unified multi-agent system designed for generalistic scientific reasoning.<n>A Coordinator Agent interprets each problem's domain and complexity, dynamically orchestrating specialized Worker Systems.<n>These Worker Systems are composed of interacting reasoning Sub-agents for symbolic deduction, conceptual modeling, numerical computation, and verification.
arXiv Detail & Related papers (2025-11-11T12:00:34Z) - Prospects for Using Artificial Intelligence to Understand Intrinsic Kinetics of Heterogeneous Catalytic Reactions [2.0971479389679337]
Key frontier is integrating AI with multiscale models and multimodal experiments.<n> inconsistent data quality and model complexity limit mechanistic discovery.<n>Generative and agentic AI can automate model generation, quantify uncertainty, and couple theory with experiment.
arXiv Detail & Related papers (2025-10-21T04:35:26Z) - NewtonBench: Benchmarking Generalizable Scientific Law Discovery in LLM Agents [65.85967483058705]
Large language models are emerging as powerful tools for scientific law discovery.<n>Existing benchmarks for this task suffer from a fundamental methodological trilemma.<n>We introduce NewtonBench, a benchmark comprising 324 scientific law discovery tasks across 12 physics domains.
arXiv Detail & Related papers (2025-10-08T16:12:11Z) - ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows [82.07367406991678]
Large Language Models (LLMs) have extended their impact beyond Natural Language Processing.<n>Among these, computer-using agents are capable of interacting with operating systems as humans do.<n>We introduce ScienceBoard, which encompasses a realistic, multi-domain environment featuring dynamic and visually rich scientific software.
arXiv Detail & Related papers (2025-05-26T12:27:27Z) - Rapid and Automated Alloy Design with Graph Neural Network-Powered LLM-Driven Multi-Agent Systems [0.0]
A multi-agent AI model is used to automate the discovery of new metallic alloys.
We focus on the NbMoTa family of body-centered cubic (bcc) alloys, modeled using an ML-based interatomic potential.
By synergizing the predictive power of GNNs with the dynamic collaboration of LLM-based agents, the system autonomously navigates vast alloy design spaces.
arXiv Detail & Related papers (2024-10-17T17:06:26Z) - SciAgents: Automating scientific discovery through multi-agent intelligent graph reasoning [0.0]
A key challenge in artificial intelligence is the creation of systems capable of autonomously advancing scientific understanding.
We present SciAgents, an approach that leverages three core concepts.
The framework autonomously generates and refines research hypotheses, elucidating underlying mechanisms, design principles, and unexpected material properties.
Our case studies demonstrate scalable capabilities to combine generative AI, ontological representations, and multi-agent modeling, harnessing a swarm of intelligence' similar to biological systems.
arXiv Detail & Related papers (2024-09-09T12:25:10Z) - Hypothetical Minds: Scaffolding Theory of Mind for Multi-Agent Tasks with Large Language Models [4.9108308035618515]
Multi-agent reinforcement learning (MARL) methods struggle with the non-stationarity of multi-agent systems.<n>Here, we leverage large language models (LLMs) to create an autonomous agent that can handle these challenges.<n>Our agent, Hypothetical Minds, consists of a cognitively-inspired architecture, featuring modular components for perception, memory, and hierarchical planning over two levels of abstraction.
arXiv Detail & Related papers (2024-07-09T17:57:15Z) - Simulation Intelligence: Towards a New Generation of Scientific Methods [81.75565391122751]
"Nine Motifs of Simulation Intelligence" is a roadmap for the development and integration of the essential algorithms necessary for a merger of scientific computing, scientific simulation, and artificial intelligence.
We argue the motifs of simulation intelligence are interconnected and interdependent, much like the components within the layers of an operating system.
We believe coordinated efforts between motifs offers immense opportunity to accelerate scientific discovery.
arXiv Detail & Related papers (2021-12-06T18:45:31Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.