Distributed Architecture Reconstruction of Polyglot and Multi-Repository Microservice Projects
- URL: http://arxiv.org/abs/2602.08166v1
- Date: Sun, 08 Feb 2026 23:59:19 GMT
- Title: Distributed Architecture Reconstruction of Polyglot and Multi-Repository Microservice Projects
- Authors: Oscar Manglaras, Alex Farkas, Thomas Woolford, Christoph Treude, Markus Wagner,
- Abstract summary: This paper presents a novel framework for static architecture reconstruction that supports technology-specific analysis modules.<n>We describe the core design concepts and algorithms that govern how extractors are executed, how data is passed between them, and how their outputs are unified.
- Score: 7.448085632032854
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Microservice architectures encourage the use of small, independently developed services; however, this can lead to increased architectural complexity. Accurate documentation is crucial, but is challenging to maintain due to the rapid, independent evolution of services. While static architecture reconstruction provides a way to maintain up-to-date documentation, existing approaches suffer from technology limitations, mono-repo constraints, or high implementation barriers. This paper presents a novel framework for static architecture reconstruction that supports technology-specific analysis modules, called \emph{extractors}, and supports \emph{distributed architecture reconstruction} in multi-repo environments. We describe the core design concepts and algorithms that govern how extractors are executed, how data is passed between them, and how their outputs are unified. Furthermore, the framework is interoperable with existing static analysis tools and algorithms, allowing them to be invoked from or embedded within extractors.
Related papers
- Architecture-Aware Multi-Design Generation for Repository-Level Feature Addition [53.50448142467294]
RAIM is a multi-design and architecture-aware framework for repository-level feature addition.<n>It shifts away from linear patching by generating multiple diverse implementation designs.<n>Experiments on the NoCode-bench Verified dataset demonstrate that RAIM establishes a new state-of-the-art performance.
arXiv Detail & Related papers (2026-03-02T12:50:40Z) - ModARO: A Modular Approach to Architecture Reconstruction of Distributed Microservice Codebases [7.448085632032854]
ModARO is an approach to microservice architecture reconstruction.<n>It allows writing modular code ('extractors') for any technologies and reusing them across different projects.
arXiv Detail & Related papers (2026-02-09T00:46:35Z) - Iterative Structured Pruning for Large Language Models with Multi-Domain Calibration [73.40887151631088]
Large Language Models (LLMs) have achieved remarkable success across a wide spectrum of natural language processing tasks.<n>Their ever-growing scale introduces significant barriers to real-world deployment, including substantial computational overhead, memory footprint, and inference latency.<n>In this work, we explore structured pruning, which eliminates entire architectural components and maintains compatibility with standard hardware accelerators.
arXiv Detail & Related papers (2026-01-06T03:09:31Z) - Everything is Context: Agentic File System Abstraction for Context Engineering [11.63011212134865]
This paper proposes a file-system abstraction for context engineering.<n>The abstraction offers a persistent, governed infrastructure for managing heterogeneous context artefacts.<n>As GenAI becomes an active collaborator in decision support, humans play a central role as curators, verifiers, and co-reasoners.
arXiv Detail & Related papers (2025-12-05T06:56:45Z) - Generating Software Architecture Description from Source Code using Reverse Engineering and Large Language Model [2.6126272668390373]
Software Architecture Descriptions (SADs) are essential for managing the inherent complexity of modern software systems.<n>SADs are often missing, outdated, or poorly aligned with the system's actual implementation.<n>We propose a semi-automated generation of SADs from source code by integrating reverse engineering (RE) techniques with a Large Language Model (LLM)
arXiv Detail & Related papers (2025-11-07T11:35:46Z) - AsCAN: Asymmetric Convolution-Attention Networks for Efficient Recognition and Generation [48.82264764771652]
We introduce AsCAN -- a hybrid architecture, combining both convolutional and transformer blocks.
AsCAN supports a variety of tasks: recognition, segmentation, class-conditional image generation.
We then scale the same architecture to solve a large-scale text-to-image task and show state-of-the-art performance.
arXiv Detail & Related papers (2024-11-07T18:43:17Z) - An Infrastructure Cost Optimised Algorithm for Partitioning of Microservices [20.638612359627952]
As migrating applications into the cloud is universally adopted by the software industry, have proven to be the most suitable and widely accepted architecture pattern for applications deployed on distributed cloud.
Their efficacy is enabled by both technical benefits like reliability, fault isolation, scalability and productivity benefits like ease of asset maintenance and clear ownership boundaries.
In some cases, the complexity of migrating an existing application into the architecture becomes overwhelmingly complex and expensive.
arXiv Detail & Related papers (2024-08-13T02:08:59Z) - Designing and Implementing a Generator Framework for a SIMD Abstraction Library [53.84310825081338]
We present TSLGen, a novel end-to-end framework for generating an SIMD abstraction library.
We show that our framework is comparable to existing libraries, and we achieve the same performance results.
arXiv Detail & Related papers (2024-07-26T13:25:38Z) - Mechanistic Design and Scaling of Hybrid Architectures [114.3129802943915]
We identify and test new hybrid architectures constructed from a variety of computational primitives.
We experimentally validate the resulting architectures via an extensive compute-optimal and a new state-optimal scaling law analysis.
We find MAD synthetics to correlate with compute-optimal perplexity, enabling accurate evaluation of new architectures.
arXiv Detail & Related papers (2024-03-26T16:33:12Z) - Software Architecture Recovery with Information Fusion [14.537490019685384]
We propose SARIF, a fully automated architecture recovery technique.
It incorporates three types of comprehensive information, including dependencies, code text and folder structure.
SARIF is 36.1% more accurate than the best of the previous techniques on average.
arXiv Detail & Related papers (2023-11-08T12:35:37Z) - Serving Deep Learning Model in Relational Databases [70.53282490832189]
Serving deep learning (DL) models on relational data has become a critical requirement across diverse commercial and scientific domains.
We highlight three pivotal paradigms: The state-of-the-art DL-centric architecture offloads DL computations to dedicated DL frameworks.
The potential UDF-centric architecture encapsulates one or more tensor computations into User Defined Functions (UDFs) within the relational database management system (RDBMS)
arXiv Detail & Related papers (2023-10-07T06:01:35Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.