Related papers: Orchestrating Human-AI Teams: The Manager Agent as a Unifying Research Challenge

Orchestrating Human-AI Teams: The Manager Agent as a Unifying Research Challenge

URL: http://arxiv.org/abs/2510.02557v1
Date: Thu, 02 Oct 2025 20:51:39 GMT
Title: Orchestrating Human-AI Teams: The Manager Agent as a Unifying Research Challenge
Authors: Charlie Masters, Advaith Vellanki, Jiangbo Shangguan, Bart Kultys, Jonathan Gilmore, Alastair Moore, Stefano V. Albrecht,
Abstract summary: This paper presents a research vision for autonomous agentic systems that orchestrate collaboration within dynamic human-AI teams.<n>We propose the Autonomous Manager Agent as a core challenge: an agent that decomposes complex goals into task graphs, allocates tasks to human and AI workers, monitors progress, and maintains transparent stakeholder communication.<n>We release MA-Gym, an open-source simulation and evaluation framework for multi-agent workflow orchestration.
Score: 9.36518257854918
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: While agentic AI has advanced in automating individual tasks, managing complex multi-agent workflows remains a challenging problem. This paper presents a research vision for autonomous agentic systems that orchestrate collaboration within dynamic human-AI teams. We propose the Autonomous Manager Agent as a core challenge: an agent that decomposes complex goals into task graphs, allocates tasks to human and AI workers, monitors progress, adapts to changing conditions, and maintains transparent stakeholder communication. We formalize workflow management as a Partially Observable Stochastic Game and identify four foundational challenges: (1) compositional reasoning for hierarchical decomposition, (2) multi-objective optimization under shifting preferences, (3) coordination and planning in ad hoc teams, and (4) governance and compliance by design. To advance this agenda, we release MA-Gym, an open-source simulation and evaluation framework for multi-agent workflow orchestration. Evaluating GPT-5-based Manager Agents across 20 workflows, we find they struggle to jointly optimize for goal completion, constraint adherence, and workflow runtime - underscoring workflow management as a difficult open problem. We conclude with organizational and ethical implications of autonomous management systems.

Related papers

A Practical Guide to Agentic AI Transition in Organizations [4.085087405595323]
Agentic AI represents a significant shift in how intelligence is applied within organizations.<n>This paper proposes a pragmatic framework for transitioning organizational functions from manual processes to automated agentic AI systems.
arXiv Detail & Related papers (2026-01-27T10:49:59Z)
Advances and Innovations in the Multi-Agent Robotic System (MARS) Challenge [170.47383225329915]
Multi-agent system frameworks are becoming essential for achieving scalable, efficient, and collaborative solutions.<n>This shift is fueled by three primary factors: increasing agent capabilities, enhancing system efficiency through task delegation, and enabling advanced human-agent interactions.<n>We propose the Multi-Agent Robotic System (MARS) Challenge, held at the NeurIPS 2025 Workshop on SpaVLE.
arXiv Detail & Related papers (2026-01-26T17:56:19Z)
Agentic AI for Mobile Network RAN Management and Optimization [0.0]
Agentic AI represents a new paradigm for automating complex systems by using Large AI Models (LAMs)<n>This paper contributes to ongoing research on Agentic AI in 5G and 6G networks by tracing its evolution from classical agents to Agentic AI.<n>Core design patterns-reflection, planning, tool use, and multi-agent collaboration-are then described to illustrate how intelligent behaviors are orchestrated.
arXiv Detail & Related papers (2025-11-04T12:34:57Z)
Agentic Lybic: Multi-Agent Execution System with Tiered Reasoning and Orchestration [21.929452003961927]
Agentic Lybic is a novel multi-agent system where the entire architecture operates as a finite-state machine (FSM)<n>We show that Agentic Lybic achieves a state-of-the-art 57.07% success rate in 50 steps, substantially outperforming existing methods.
arXiv Detail & Related papers (2025-09-14T03:22:27Z)
Allen: Rethinking MAS Design through Step-Level Policy Autonomy [0.0]
We introduce a new Multi-Agent System (MAS) - Allen, designed to address two core challenges in current MAS design.<n>We have constructed a four-tier state architecture to constrain system behavior from both task-oriented and execution-oriented perspectives.<n>Allen grants unprecedented Policy Autonomy, while making a trade-off for the controllability of the collaborative structure.
arXiv Detail & Related papers (2025-08-15T08:02:34Z)
SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience [71.82719117238307]
We propose SEAgent, an agentic self-evolving framework enabling computer-use agents to evolve through interactions with unfamiliar software.<n>We validate the effectiveness of SEAgent across five novel software environments within OS-World.<n>Our approach achieves a significant improvement of 23.2% in success rate, from 11.3% to 34.5%, over a competitive open-source CUA.
arXiv Detail & Related papers (2025-08-06T17:58:46Z)
Multi-Agent Collaboration via Evolving Orchestration [61.93162413517026]
Large language models (LLMs) have achieved remarkable results across diverse downstream tasks, but their monolithic nature restricts scalability and efficiency in complex problem-solving.<n>We propose a puppeteer-style paradigm for LLM-based multi-agent collaboration, where a central orchestrator dynamically directs agents in response to evolving task states.<n> Experiments on closed- and open-domain scenarios show that this method achieves superior performance with reduced computational costs.
arXiv Detail & Related papers (2025-05-26T07:02:17Z)
AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenges [3.7414278978078204]
This review critically distinguishes between AI Agents and Agentic AI, offering a structured, conceptual taxonomy, application mapping, and analysis of opportunities and challenges to clarify their divergent design philosophies and capabilities.
arXiv Detail & Related papers (2025-05-15T16:21:33Z)
Agent-Oriented Planning in Multi-Agent Systems [54.429028104022066]
We propose AOP, a novel framework for agent-oriented planning in multi-agent systems.<n>In this study, we identify three critical design principles of agent-oriented planning, including solvability, completeness, and non-redundancy.<n> Extensive experiments demonstrate the advancement of AOP in solving real-world problems compared to both single-agent systems and existing planning strategies for multi-agent systems.
arXiv Detail & Related papers (2024-10-03T04:07:51Z)
The Foundations of Computational Management: A Systematic Approach to Task Automation for the Integration of Artificial Intelligence into Existing Workflows [55.2480439325792]
This article introduces Computational Management, a systematic approach to task automation. The article offers three easy step-by-step procedures to begin the process of implementing AI within a workflow.
arXiv Detail & Related papers (2024-02-07T01:45:14Z)
Investigate-Consolidate-Exploit: A General Strategy for Inter-Task Agent Self-Evolution [92.84441068115517]
Investigate-Consolidate-Exploit (ICE) is a novel strategy for enhancing the adaptability and flexibility of AI agents. ICE promotes the transfer of knowledge between tasks for genuine self-evolution. Our experiments on the XAgent framework demonstrate ICE's effectiveness, reducing API calls by as much as 80%.
arXiv Detail & Related papers (2024-01-25T07:47:49Z)
A Dynamic LLM-Powered Agent Network for Task-Oriented Agent Collaboration [55.35849138235116]
We propose automatically selecting a team of agents from candidates to collaborate in a dynamic communication structure toward different tasks and domains. Specifically, we build a framework named Dynamic LLM-Powered Agent Network ($textDyLAN$) for LLM-powered agent collaboration. We demonstrate that DyLAN outperforms strong baselines in code generation, decision-making, general reasoning, and arithmetic reasoning tasks with moderate computational cost.
arXiv Detail & Related papers (2023-10-03T16:05:48Z)

This list is automatically generated from the titles and abstracts of the papers in this site.