Sharpen the Spec, Cut the Code: A Case for Generative File System with SYSSPEC
- URL: http://arxiv.org/abs/2512.13047v1
- Date: Mon, 15 Dec 2025 07:15:01 GMT
- Title: Sharpen the Spec, Cut the Code: A Case for Generative File System with SYSSPEC
- Authors: Qingyuan Liu, Zou Mo, Hengbin Zhang, Dong Du, Yubin Xia, Haibo Chen,
- Abstract summary: This paper introduces SYSSPEC, a framework for developing generative file systems.<n>Instead of imprecise prompts, SYSSPEC employs a multi-part specification that accurately describes a file system's functionality.<n>To manage evolution, we develop a DAG-structured patch that operates on the specification itself.
- Score: 6.536447932095715
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: File systems are critical OS components that require constant evolution to support new hardware and emerging applica- tion needs. However, the traditional paradigm of developing features, fixing bugs, and maintaining the system incurs significant overhead, especially as systems grow in complexity. This paper proposes a new paradigm, generative file systems, which leverages Large Language Models (LLMs) to generate and evolve a file system from prompts, effectively addressing the need for robust evolution. Despite the widespread success of LLMs in code generation, attempts to create a functional file system have thus far been unsuccessful, mainly due to the ambiguity of natural language prompts. This paper introduces SYSSPEC, a framework for developing generative file systems. Its key insight is to replace ambiguous natural language with principles adapted from formal methods. Instead of imprecise prompts, SYSSPEC employs a multi-part specification that accurately describes a file system's functionality, modularity, and concurrency. The specification acts as an unambiguous blueprint, guiding LLMs to generate expected code flexibly. To manage evolution, we develop a DAG-structured patch that operates on the specification itself, enabling new features to be added without violating existing invariants. Moreover, the SYSSPEC toolchain features a set of LLM-based agents with mechanisms to mitigate hallucination during construction and evolution. We demonstrate our approach by generating SPECFS, a concurrent file system. SPECFS passes hundreds of regression tests, matching a manually-coded baseline. We further confirm its evolvability by seamlessly integrating 10 real-world features from Ext4. Our work shows that a specification-guided approach makes generating and evolving complex systems not only feasible but also highly effective.
Related papers
- Architecture-Aware Multi-Design Generation for Repository-Level Feature Addition [53.50448142467294]
RAIM is a multi-design and architecture-aware framework for repository-level feature addition.<n>It shifts away from linear patching by generating multiple diverse implementation designs.<n>Experiments on the NoCode-bench Verified dataset demonstrate that RAIM establishes a new state-of-the-art performance.
arXiv Detail & Related papers (2026-03-02T12:50:40Z) - Evolutionary Generation of Multi-Agent Systems [49.47969796873096]
Large language model (LLM)-based multi-agent systems (MAS) show strong promise for complex reasoning, planning, and tool-augmented tasks.<n>EvoMAS formulates MAS generation as structured configuration generation.<n>EvoMAS consistently improves task performance over both human-designed MAS and prior automatic MAS generation methods.
arXiv Detail & Related papers (2026-02-06T09:01:35Z) - ThinkGen: Generalized Thinking for Visual Generation [97.19923474851987]
ThinkGen is a think-driven visual generation framework that explicitly leverages Chain-of-Thought (CoT) reasoning in various generation scenarios.<n>We propose a separable GRPO-based training paradigm, alternating reinforcement learning between the MLLM and DiT modules.<n>Experiments demonstrate that ThinkGen achieves robust, state-of-the-art performance across multiple generation benchmarks.
arXiv Detail & Related papers (2025-12-29T16:08:50Z) - SysMoBench: Evaluating AI on Formally Modeling Complex Real-World Systems [12.181911851729614]
We present SysMoBench, a benchmark that evaluates AI's ability to formally model large, complex systems.<n>We focus on concurrent and distributed systems, which are keystones of today's critical computing infrastructures.
arXiv Detail & Related papers (2025-09-27T05:24:54Z) - Requirements Development and Formalization for Reliable Code Generation: A Multi-Agent Vision [45.59678433715798]
We envision the first multi-agent framework for reliable code generation based on textscrequirements textscdevelopment and textscformalization, named textscReDeFo.<n>The core of textscReDeFo is the use of formal specifications to bridge the gap between potentially ambiguous natural language requirements and precise executable code.
arXiv Detail & Related papers (2025-08-26T04:45:04Z) - Specifications: The missing link to making the development of LLM systems an engineering discipline [65.10077876035417]
We discuss the progress the field has made so far-through advances like structured outputs, process supervision, and test-time compute.<n>We outline several future directions for research to enable the development of modular and reliable LLM-based systems.
arXiv Detail & Related papers (2024-11-25T07:48:31Z) - From Commands to Prompts: LLM-based Semantic File System for AIOS [46.29019415676847]
We propose an LLM-based semantic file system ( LSFS) for prompt-driven file management.<n>Unlike conventional approaches, LSFS incorporates LLMs to enable users or agents to interact with files through natural language prompts.<n>Our experiments show that LSFS offers significant improvements over traditional file systems in terms of user convenience, the diversity of supported functions, and the accuracy and efficiency of file operations.
arXiv Detail & Related papers (2024-09-23T08:39:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.