Related papers: Agentic AI for Software: thoughts from Software Engineering community

Agentic AI for Software: thoughts from Software Engineering community

URL: http://arxiv.org/abs/2508.17343v4
Date: Mon, 22 Sep 2025 10:56:53 GMT
Title: Agentic AI for Software: thoughts from Software Engineering community
Authors: Abhik Roychoudhury,
Abstract summary: At the code level, common software tasks include code generation, testing, and program repair.<n>Key to successfully developing agentic AI-based software will be to resolve the core difficulty in software engineering - the deciphering and clarification of developer intent.<n>A successful deployment of agentic technology into software engineering would involve making conceptual progress in such intent inference via agents.
Score: 9.966138715949205
License: http://creativecommons.org/licenses/by/4.0/
Abstract: AI agents have recently shown significant promise in software engineering. Much public attention has been transfixed on the topic of code generation from Large Language Models (LLMs) via a prompt. However, software engineering is much more than programming, and AI agents go far beyond instructions given by a prompt. At the code level, common software tasks include code generation, testing, and program repair. Design level software tasks may include architecture exploration, requirements understanding, and requirements enforcement at the code level. Each of these software tasks involves micro-decisions which can be taken autonomously by an AI agent, aided by program analysis tools. This creates the vision of an AI software engineer, where the AI agent can be seen as a member of a development team. Conceptually, the key to successfully developing trustworthy agentic AI-based software workflows will be to resolve the core difficulty in software engineering - the deciphering and clarification of developer intent. Specification inference, or deciphering the intent, thus lies at the heart of many software tasks, including software maintenance and program repair. A successful deployment of agentic technology into software engineering would involve making conceptual progress in such intent inference via agents. Trusting the AI agent becomes a key aspect, as software engineering becomes more automated. Higher automation also leads to higher volume of code being automatically generated, and then integrated into code-bases. Thus to deal with this explosion, an emerging direction is AI-based verification and validation (V & V) of AI generated code. We posit that agentic software workflows in future will include such AIbased V&V.

Related papers

A Viable Paradigm of Software Automation: Iterative End-to-End Automated Software Development [41.295627885484855]
We present a vision of an iterative end-to-end automated software development paradigm AutoSW.<n>It operates in an analyze-plan-implement-deliver loop, where AI systems as human partners become first-class actors.<n>The results indicate that AutoSW can successfully deliver executable software.
arXiv Detail & Related papers (2025-11-19T09:57:49Z)
AI-Driven Self-Evolving Software: A Promising Path Toward Software Automation [6.38492008798679]
Current AI functions primarily as assistants to human developers.<n>Can AI move beyond its role as an assistant to become a core component of software?<n>We introduce AI-Driven Self-Evolving Software, a new form of software that evolves continuously through direct interaction with users.
arXiv Detail & Related papers (2025-10-01T07:17:51Z)
Code with Me or for Me? How Increasing AI Automation Transforms Developer Workflows [66.1850490474361]
We conduct the first academic study to explore developer interactions with coding agents.<n>We evaluate two leading copilot and agentic coding assistants, GitHub Copilot and OpenHands.<n>Our results show agents have the potential to assist developers in ways that surpass copilots.
arXiv Detail & Related papers (2025-07-10T20:12:54Z)
Unified Software Engineering agent as AI Software Engineer [14.733475669942276]
Large Language Model (LLM) technology has raised expectations for automated coding.<n>In this paper, we seek to understand this question by developing a Unified Software Engineering agent or USEagent.<n>We envision USEagent as the first draft of a future AI Software Engineer which can be a team member in future software development teams involving both AI and humans.
arXiv Detail & Related papers (2025-06-17T16:19:13Z)
Explainability for Embedding AI: Aspirations and Actuality [1.8130068086063336]
Explainable AI (XAI) may allow developers to understand better the systems they build.<n>Existing XAI systems still fall short of this aspiration.<n>We see an unmet need to provide developers with adequate support mechanisms to cope with this complexity.
arXiv Detail & Related papers (2025-04-20T14:20:01Z)
Challenges and Paths Towards AI for Software Engineering [55.95365538122656]
We discuss progress in AI for software engineering in threefold manner.<n>First, we provide a structured taxonomy of concrete tasks in AI for software engineering.<n>Second, we outline several key bottlenecks that limit current approaches.
arXiv Detail & Related papers (2025-03-28T17:17:57Z)
Agentic AI Software Engineers: Programming with Trust [33.88230182444934]
Large Language Models (LLMs) have shown surprising proficiency in generating code snippets.<n>We argue that successfully deploying AI software engineers requires a level of trust equal to or even greater than the trust established by human-driven software engineering practices.
arXiv Detail & Related papers (2025-02-19T14:28:42Z)
TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks [55.03911355902567]
We introduce TheAgentCompany, a benchmark for evaluating AI agents that interact with the world in similar ways to those of a digital worker.<n>We find that the most competitive agent can complete 30% of tasks autonomously.<n>This paints a nuanced picture on task automation with simulating LM agents in a setting a real workplace.
arXiv Detail & Related papers (2024-12-18T18:55:40Z)
OpenHands: An Open Platform for AI Software Developers as Generalist Agents [109.8507367518992]
We introduce OpenHands, a platform for the development of AI agents that interact with the world in similar ways to a human developer.<n>We describe how the platform allows for the implementation of new agents, safe interaction with sandboxed environments for code execution, and incorporation of evaluation benchmarks.
arXiv Detail & Related papers (2024-07-23T17:50:43Z)
ChatDev: Communicative Agents for Software Development [84.90400377131962]
ChatDev is a chat-powered software development framework in which specialized agents are guided in what to communicate. These agents actively contribute to the design, coding, and testing phases through unified language-based communication.
arXiv Detail & Related papers (2023-07-16T02:11:34Z)
Generation Probabilities Are Not Enough: Uncertainty Highlighting in AI Code Completions [54.55334589363247]
We study whether conveying information about uncertainty enables programmers to more quickly and accurately produce code. We find that highlighting tokens with the highest predicted likelihood of being edited leads to faster task completion and more targeted edits.
arXiv Detail & Related papers (2023-02-14T18:43:34Z)
Automated Machine Learning: A Case Study on Non-Intrusive Appliance Load Monitoring [81.06807079998117]
We propose a novel approach to enable Automated Machine Learning (AutoML) for Non-Intrusive Appliance Load Monitoring (NIALM)<n>NIALM offers a cost-effective alternative to smart meters for measuring the energy consumption of electric devices and appliances.
arXiv Detail & Related papers (2022-03-06T10:12:56Z)

This list is automatically generated from the titles and abstracts of the papers in this site.

This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.