Engineering AI Agents for Clinical Workflows: A Case Study in Architecture,MLOps, and Governance
- URL: http://arxiv.org/abs/2602.00751v1
- Date: Sat, 31 Jan 2026 14:33:57 GMT
- Title: Engineering AI Agents for Clinical Workflows: A Case Study in Architecture,MLOps, and Governance
- Authors: Cláudio Lúcio do Val Lopes, João Marcus Pitta, Fabiano Belém, Gildson Alves, Flávio Vinícius Cruzeiro Martins,
- Abstract summary: We show how a Human-in-the-Loop governance model is technically integrated not merely as a safety check, but as a critical, event-driven data source for continuous improvement.<n>We present the platform as a reference architecture, offering practical lessons for engineers building maintainable, scalable, and accountable AI-enabled systems in high-stakes domains.
- Score: 0.21748200848556345
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: The integration of Artificial Intelligence (AI) into clinical settings presents a software engineering challenge, demanding a shift from isolated models to robust, governable, and reliable systems. However, brittle, prototype-derived architectures often plague industrial applications and a lack of systemic oversight, creating a ``responsibility vacuum'' where safety and accountability are compromised. This paper presents an industry case study of the ``Maria'' platform, a production-grade AI system in primary healthcare that addresses this gap. Our central hypothesis is that trustworthy clinical AI is achieved through the holistic integration of four foundational engineering pillars. We present a synergistic architecture that combines Clean Architecture for maintainability with an Event-driven architecture for resilience and auditability. We introduce the Agent as the primary unit of modularity, each possessing its own autonomous MLOps lifecycle. Finally, we show how a Human-in-the-Loop governance model is technically integrated not merely as a safety check, but as a critical, event-driven data source for continuous improvement. We present the platform as a reference architecture, offering practical lessons for engineers building maintainable, scalable, and accountable AI-enabled systems in high-stakes domains.
Related papers
- Human-Certified Module Repositories for the AI Age [0.0]
Human-Certified Module Repositories (HCMRs) are introduced in this work as a new architectural model for constructing trustworthy software.<n>HCMRs blend human oversight with automated analysis to certify modules and support safe, predictable assembly by both humans and AI agents.
arXiv Detail & Related papers (2026-03-03T01:46:41Z) - Sovereign-by-Design A Reference Architecture for AI and Blockchain Enabled Systems [2.9407799864652837]
We argue that sovereignty must be treated as a first-class architectural property rather than a purely regulatory objective.<n>We introduce a Sovereign Reference Architecture that integrates self-sovereign identity, blockchain-based trust and auditability, sovereign data governance, and Generative AI deployed under explicit architectural control.
arXiv Detail & Related papers (2026-02-05T09:49:04Z) - EmboCoach-Bench: Benchmarking AI Agents on Developing Embodied Robots [68.29056647487519]
Embodied AI is fueled by high-fidelity simulation and large-scale data collection.<n>However, this scaling capability remains bottlenecked by a reliance on labor-intensive manual oversight.<n>We introduce textscEmboCoach-Bench, a benchmark evaluating the capacity of LLM agents to autonomously engineer embodied policies.
arXiv Detail & Related papers (2026-01-29T11:33:49Z) - Agentic Design Patterns: A System-Theoretic Framework [8.108572809924956]
Existing efforts to agentic design patterns often lack a rigorous systems-theoretic foundation.<n>We propose a novel system-theoretic framework that deconstructs an agentic AI system into five core, interacting functional subsystems.<n>We present a collection of 12 agentic design patterns that offer reusable, structural solutions to recurring problems in agent design.
arXiv Detail & Related papers (2026-01-27T16:14:08Z) - Advances and Frontiers of LLM-based Issue Resolution in Software Engineering: A Comprehensive Survey [59.3507264893654]
Issue resolution is a complex Software Engineering task integral to real-world development.<n> benchmarks like SWE-bench revealed this task as profoundly difficult for large language models.<n>This paper presents a systematic survey of this emerging domain.
arXiv Detail & Related papers (2026-01-15T18:55:03Z) - AI-NativeBench: An Open-Source White-Box Agentic Benchmark Suite for AI-Native Systems [52.65695508605237]
We introduce AI-NativeBench, the first application-centric and white-box AI-Native benchmark suite grounded in Model Context Protocol (MCP) and Agent-to-Agent (A2A) standards.<n>By treating agentic spans as first-class citizens within distributed traces, our methodology enables granular analysis of engineering characteristics beyond simple capabilities.<n>This work provides the first systematic evidence to guide the transition from measuring model capability to engineering reliable AI-Native systems.
arXiv Detail & Related papers (2026-01-14T11:32:07Z) - Agentic AI for Cyber Resilience: A New Security Paradigm and Its System-Theoretic Foundations [20.297163645916157]
Cybersecurity is being fundamentally reshaped by foundation-model-based artificial intelligence.<n>This chapter argues for a shift from prevention-centric security toward agentic cyber resilience.
arXiv Detail & Related papers (2025-12-28T11:17:36Z) - An Agentic Framework for Autonomous Materials Computation [70.24472585135929]
Large Language Models (LLMs) have emerged as powerful tools for accelerating scientific discovery.<n>Recent advances integrate LLMs into agentic frameworks, enabling retrieval, reasoning, and tool use for complex scientific experiments.<n>Here, we present a domain-specialized agent designed for reliable automation of first-principles materials computations.
arXiv Detail & Related papers (2025-12-22T15:03:57Z) - From Accuracy to Impact: The Impact-Driven AI Framework (IDAIF) for Aligning Engineering Architecture with Theory of Change [0.40611352512781873]
Impact-Driven AI Framework (IDAIF) is a novel architectural methodology that integrates Theory of Change (ToC) principles with modern artificial intelligence system design.<n>IDAIF addresses the gap by establishing a systematic mapping between ToC's five-stage model and corresponding AI architectural layers.<n>Three case studies demonstrate IDAIF application across healthcare, cybersecurity, and software engineering domains.
arXiv Detail & Related papers (2025-12-09T10:21:02Z) - A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems [53.37728204835912]
Most existing AI systems rely on manually crafted configurations that remain static after deployment.<n>Recent research has explored agent evolution techniques that aim to automatically enhance agent systems based on interaction data and environmental feedback.<n>This survey aims to provide researchers and practitioners with a systematic understanding of self-evolving AI agents.
arXiv Detail & Related papers (2025-08-10T16:07:32Z) - Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems [132.77459963706437]
This book provides a comprehensive overview, framing intelligent agents within modular, brain-inspired architectures.<n>It explores self-enhancement and adaptive evolution mechanisms, exploring how agents autonomously refine their capabilities.<n>It also examines the collective intelligence emerging from agent interactions, cooperation, and societal structures.
arXiv Detail & Related papers (2025-03-31T18:00:29Z) - Proceedings of the Robust Artificial Intelligence System Assurance
(RAISA) Workshop 2022 [0.0]
The RAISA workshop will focus on research, development and application of robust artificial intelligence (AI) and machine learning (ML) systems.
Rather than studying robustness with respect to particular ML algorithms, our approach will be to explore robustness assurance at the system architecture level.
arXiv Detail & Related papers (2022-02-10T01:15:50Z) - STARdom: an architecture for trusted and secure human-centered
manufacturing systems [4.093985503448998]
We propose an architecture that integrates forecasts, Explainable Artificial Intelligence, supports collecting users' feedback, and uses Active Learning and Simulated Reality to enhance forecasts.
We tailor it for the domain of demand forecasting and validate it on a real-world case study.
arXiv Detail & Related papers (2021-04-02T11:00:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.