AI-Augmented CI/CD Pipelines: From Code Commit to Production with Autonomous Decisions
- URL: http://arxiv.org/abs/2508.11867v1
- Date: Sat, 16 Aug 2025 01:51:59 GMT
- Title: AI-Augmented CI/CD Pipelines: From Code Commit to Production with Autonomous Decisions
- Authors: Mohammad Baqar, Saba Naqvi, Rajat Khanda,
- Abstract summary: We propose AI-augmented CI/CD Pipelines, where large language models and autonomous agents act as policy-bounded co-pilots.<n>We discuss ethics, verification, auditability, and threats to validity, and chart a roadmap for verifiable autonomy in production delivery systems.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Modern software delivery has accelerated from quarterly releases to multiple deployments per day. While CI/CD tooling has matured, human decision points interpreting flaky tests, choosing rollback strategies, tuning feature flags, and deciding when to promote a canary remain major sources of latency and operational toil. We propose AI-Augmented CI/CD Pipelines, where large language models (LLMs) and autonomous agents act as policy-bounded co-pilots and progressively as decision makers. We contribute: (1) a reference architecture for embedding agentic decision points into CI/CD, (2) a decision taxonomy and policy-as-code guardrail pattern, (3) a trust-tier framework for staged autonomy, (4) an evaluation methodology using DevOps Research and Assessment ( DORA) metrics and AI-specific indicators, and (5) a detailed industrial-style case study migrating a React 19 microservice to an AI-augmented pipeline. We discuss ethics, verification, auditability, and threats to validity, and chart a roadmap for verifiable autonomy in production delivery systems.
Related papers
- From Prompt-Response to Goal-Directed Systems: The Evolution of Agentic AI Software Architecture [0.0]
Agentic AI denotes an architectural transition from stateless, prompt-driven generative models toward goal-directed systems.<n>This paper examines this transition by connecting intelligent agent theories, with contemporary LLM-centric approaches.<n>The study identifies a convergence toward standardized agent loops, registries, and auditable control mechanisms.
arXiv Detail & Related papers (2026-02-11T03:34:48Z) - Let the Barbarians In: How AI Can Accelerate Systems Performance Research [80.43506848683633]
We term this iterative cycle of generation, evaluation, and refinement AI-Driven Research for Systems.<n>We demonstrate that ADRS-generated solutions can match or even outperform human state-of-the-art designs.
arXiv Detail & Related papers (2025-12-16T18:51:23Z) - QOC DAO - Stepwise Development Towards an AI Driven Decentralized Autonomous Organization [0.0]
This paper introduces a structured approach to improving decision making in Decentralized Autonomous Organizations (DAO)<n>We outline a stepwise governance framework that evolves from human led evaluations to fully autonomous, AI-driven processes.
arXiv Detail & Related papers (2025-11-10T12:55:23Z) - DAO-AI: Evaluating Collective Decision-Making through Agentic AI in Decentralized Governance [7.230919380272301]
This paper presents a first empirical study of agentic AI as autonomous decision-makers in decentralized governance.<n>We build an agentic AI voter that interprets proposal contexts, retrieves historical deliberation data, and independently determines its voting position.<n>The study contributes to the design of explainable and economically rigorous AI agents for decentralized financial systems.
arXiv Detail & Related papers (2025-10-24T03:13:14Z) - AI/ML Life Cycle Management for Interoperable AI Native RAN [50.61227317567369]
Artificial intelligence (AI) and machine learning (ML) models are rapidly permeating the 5G Radio Access Network (RAN)<n>These developments lay the foundation for AI-native transceivers as a key enabler for 6G.
arXiv Detail & Related papers (2025-07-24T16:04:59Z) - HADA: Human-AI Agent Decision Alignment Architecture [0.0]
HADA is a protocol- and framework reference architecture that keeps both large language model (LLM) agents and legacy algorithms aligned with organizational targets and values.<n>Technical and non-technical actors can query, steer, audit, or contest every decision across strategic, tactical, and real-time horizons.
arXiv Detail & Related papers (2025-06-01T14:04:52Z) - Generative AI for Autonomous Driving: Frontiers and Opportunities [145.6465312554513]
This survey delivers a comprehensive synthesis of the emerging role of GenAI across the autonomous driving stack.<n>We begin by distilling the principles and trade-offs of modern generative modeling, encompassing VAEs, GANs, Diffusion Models, and Large Language Models.<n>We categorize practical applications, such as synthetic data generalization, end-to-end driving strategies, high-fidelity digital twin systems, smart transportation networks, and cross-domain transfer to embodied AI.
arXiv Detail & Related papers (2025-05-13T17:59:20Z) - DARS: Dynamic Action Re-Sampling to Enhance Coding Agent Performance by Adaptive Tree Traversal [55.13854171147104]
Large Language Models (LLMs) have revolutionized various domains, including natural language processing, data analysis, and software development.<n>We present Dynamic Action Re-Sampling (DARS), a novel inference time compute scaling approach for coding agents.<n>We evaluate our approach on SWE-Bench Lite benchmark, demonstrating that this scaling strategy achieves a pass@k score of 55% with Claude 3.5 Sonnet V2.
arXiv Detail & Related papers (2025-03-18T14:02:59Z) - Measuring AI agent autonomy: Towards a scalable approach with code inspection [8.344207672507334]
We introduce a code-based assessment of autonomy that eliminates the need to run an AI agent to perform specific tasks.<n>We demonstrate this approach with the AutoGen framework and select applications.
arXiv Detail & Related papers (2025-02-21T04:58:40Z) - Combining AI Control Systems and Human Decision Support via Robustness and Criticality [53.10194953873209]
We extend a methodology for adversarial explanations (AE) to state-of-the-art reinforcement learning frameworks.
We show that the learned AI control system demonstrates robustness against adversarial tampering.
In a training / learning framework, this technology can improve both the AI's decisions and explanations through human interaction.
arXiv Detail & Related papers (2024-07-03T15:38:57Z) - RCAgent: Cloud Root Cause Analysis by Autonomous Agents with Tool-Augmented Large Language Models [46.476439550746136]
Large language model (LLM) applications in cloud root cause analysis (RCA) have been actively explored recently.
We present RCAgent, a tool-augmented LLM autonomous agent framework for practical and privacy-aware industrial RCA usage.
Running on an internally deployed model rather than GPT families, RCAgent is capable of free-form data collection and comprehensive analysis with tools.
arXiv Detail & Related papers (2023-10-25T03:53:31Z) - The Pipeline for the Continuous Development of Artificial Intelligence
Models -- Current State of Research and Practice [3.793596705511303]
This paper includes a Multivocal Literature Review where we consolidated 151 relevant formal and informal sources.
We map challenges regarding pipeline implementation, adaption, and usage for the continuous development of AI to these four stages.
arXiv Detail & Related papers (2023-01-21T20:04:07Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.