Evaluation of Architectural Synthesis Using Generative AI
- URL: http://arxiv.org/abs/2503.02861v1
- Date: Tue, 04 Mar 2025 18:39:28 GMT
- Title: Evaluation of Architectural Synthesis Using Generative AI
- Authors: Jingfei Huang, Alexandros Haridis,
- Abstract summary: This paper presents a comparative evaluation of two systems: GPT-4o and Claude 3.5, in the task of architectural 3D synthesis.<n>We conduct a case study on two buildings from Palladio's Four Books of Architecture (1965): Villa Rotonda and Palazzo Porto.<n>We assess the systems' abilities in (1) interpreting 2D and 3D representations of buildings from drawings, (2) encoding the buildings into a CAD software script, and (3) self-improving based on outputs.
- Score: 49.1574468325115
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Recent advancements in multimodal Generative AI have the potential to democratize specialized architectural tasks, such as interpreting technical drawings and creating 3D CAD models, which traditionally require expert knowledge. This paper presents a comparative evaluation of two systems: GPT-4o and Claude 3.5, in the task of architectural 3D synthesis. We conduct a case study on two buildings from Palladio's Four Books of Architecture (1965): Villa Rotonda and Palazzo Porto. High-level architectural models and drawings of these buildings were prepared, inspired by Palladio's original texts and drawings. Through sequential text and image prompting, we assess the systems' abilities in (1) interpreting 2D and 3D representations of buildings from drawings, (2) encoding the buildings into a CAD software script, and (3) self-improving based on outputs. While both systems successfully generate individual parts, they struggle to accurately assemble these parts into the desired spatial relationships, with Claude 3.5 demonstrating better performance, particularly in self-correcting its output. This study contributes to ongoing research on benchmarking the strengths and weaknesses of off-the-shelf AI systems in performing intelligent human tasks that require discipline-specific knowledge. The findings highlight the potential of language-enabled AI systems to act as collaborative technical assistants in the architectural design process.
Related papers
- TwinArch: A Digital Twin Reference Architecture [83.71071897785683]
Digital Twins (DTs) are dynamic virtual representations of physical systems, enabled by seamless, bidirectional communication between the physical and digital realms.
The proposed Digital Twin Reference Architecture is named TwinArch. It is documented using the Views and Beyond methodology by the Software Engineering Institute.
arXiv Detail & Related papers (2025-04-10T07:53:11Z) - PHT-CAD: Efficient CAD Parametric Primitive Analysis with Progressive Hierarchical Tuning [52.681829043446044]
ParaCAD comprises over 10 million annotated drawings for training and 3,000 real-world industrial drawings with complex topological structures and physical constraints for test.
PHT-CAD is a novel 2D PPA framework that harnesses the modality alignment and reasoning capabilities of Vision-Language Models.
arXiv Detail & Related papers (2025-03-23T17:24:32Z) - Proc-GS: Procedural Building Generation for City Assembly with 3D Gaussians [65.09942210464747]
Building asset creation is labor-intensive and requires specialized skills to develop design rules.<n>Recent generative models for building creation often overlook these patterns, leading to low visual fidelity and limited scalability.<n>By manipulating procedural code, we can streamline this process and generate an infinite variety of buildings.
arXiv Detail & Related papers (2024-12-10T16:45:32Z) - SUGAR: Pre-training 3D Visual Representations for Robotics [85.55534363501131]
We introduce a novel 3D pre-training framework for robotics named SUGAR.
SUGAR captures semantic, geometric and affordance properties of objects through 3D point clouds.
We show that SUGAR's 3D representation outperforms state-of-the-art 2D and 3D representations.
arXiv Detail & Related papers (2024-04-01T21:23:03Z) - Generative AI Models for Different Steps in Architectural Design: A Literature Review [14.910709576423576]
It is essential to comprehend the principles and advancements of generative AI models and analyze their relevance in architecture applications.
This paper first provides an overview of generative AI technologies, with a focus on probabilistic diffusion models (DDPMs), 3D generative models, and foundation models.
We subdivide the architectural design process into six steps and review related research projects in each step from 2020 to the present.
arXiv Detail & Related papers (2024-03-30T13:25:11Z) - Sketch-to-Architecture: Generative AI-aided Architectural Design [20.42779592734634]
We present a novel workflow that utilizes AI models to generate conceptual floorplans and 3D models from simple sketches.
Our work demonstrates the potential of generative AI in the architectural design process, pointing towards a new direction of computer-aided architectural design.
arXiv Detail & Related papers (2024-03-29T14:04:45Z) - Multi-task Learning with 3D-Aware Regularization [55.97507478913053]
We propose a structured 3D-aware regularizer which interfaces multiple tasks through the projection of features extracted from an image encoder to a shared 3D feature space.
We show that the proposed method is architecture agnostic and can be plugged into various prior multi-task backbones to improve their performance.
arXiv Detail & Related papers (2023-10-02T08:49:56Z) - Multi-CLIP: Contrastive Vision-Language Pre-training for Question
Answering tasks in 3D Scenes [68.61199623705096]
Training models to apply common-sense linguistic knowledge and visual concepts from 2D images to 3D scene understanding is a promising direction that researchers have only recently started to explore.
We propose a novel 3D pre-training Vision-Language method, namely Multi-CLIP, that enables a model to learn language-grounded and transferable 3D scene point cloud representations.
arXiv Detail & Related papers (2023-06-04T11:08:53Z) - Towards AI-Architecture Liberty: A Comprehensive Survey on Design and Generation of Virtual Architecture by Deep Learning [23.58793497403681]
3D shape generation techniques leveraging deep learning have garnered significant interest from both the computer vision and architectural design communities.
We review 149 related articles covering architectural design, 3D shape techniques, and virtual environments.
We highlight four important enablers of ubiquitous interaction with immersive systems in deep learning-assisted architectural generation.
arXiv Detail & Related papers (2023-04-30T15:38:36Z) - Unfinished Architectures: A Perspective from Artificial Intelligence [73.52315464582637]
Development of Artificial Intelligence (AI) opens new avenues for the proposal of possibilities for the completion of unfinished architectures.
Recent appearance of tools such as DALL-E, capable of completing images guided by a textual description.
In this article we explore the use of these new AI tools for the completion of unfinished facades of historical temples and analyse the still germinal stadium in the field of architectural graphic composition.
arXiv Detail & Related papers (2023-03-03T13:05:10Z) - A Compositional Approach to Creating Architecture Frameworks with an
Application to Distributed AI Systems [16.690434072032176]
We show how compositional thinking can provide rules for the creation and management of architectural frameworks for complex systems.
The aim of the paper is not to provide viewpoints or architecture models specific to AI systems, but instead to provide guidelines on how a consistent framework can be built up with existing, or newly created, viewpoints.
arXiv Detail & Related papers (2022-12-27T18:05:02Z) - Vitruvio: 3D Building Meshes via Single Perspective Sketches [0.8001739956625484]
We introduce the first deep learning method focused only on buildings that aim to convert a single sketch to a 3D building mesh: Vitruvio.
First, it accelerates the inference process by more than 26% (from 0.5s to 0.37s)
Second, it increases the reconstruction accuracy (measured by the Chamfer Distance) by 18%.
arXiv Detail & Related papers (2022-10-24T22:24:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.