Using Text-to-Image Generation for Architectural Design Ideation
- URL: http://arxiv.org/abs/2304.10182v1
- Date: Thu, 20 Apr 2023 09:46:27 GMT
- Title: Using Text-to-Image Generation for Architectural Design Ideation
- Authors: Ville Paananen, Jonas Oppenlaender, Aku Visuri
- Abstract summary: This study is the first to investigate the potential of text-to-image generators in supporting creativity during the early stages of the architectural design process.
We conducted a laboratory study with 17 architecture students, who developed a concept for a culture center using three popular text-to-image generators.
- Score: 10.938191897918474
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The recent progress of text-to-image generation has been recognized in
architectural design. Our study is the first to investigate the potential of
text-to-image generators in supporting creativity during the early stages of
the architectural design process. We conducted a laboratory study with 17
architecture students, who developed a concept for a culture center using three
popular text-to-image generators: Midjourney, Stable Diffusion, and DALL-E.
Through standardized questionnaires and group interviews, we found that image
generation could be a meaningful part of the design process when design
constraints are carefully considered. Generative tools support serendipitous
discovery of ideas and an imaginative mindset, enriching the design process. We
identified several challenges of image generators and provided considerations
for software development and educators to support creativity and emphasize
designers' imaginative mindset. By understanding the limitations and potential
of text-to-image generators, architects and designers can leverage this
technology in their design process and education, facilitating innovation and
effective communication of concepts.
Related papers
- A Framework for Collaborating a Large Language Model Tool in Brainstorming for Triggering Creative Thoughts [2.709166684084394]
This study proposes a framework called GPS, which employs goals, prompts, and strategies to guide designers to systematically work with an LLM tool for improving the creativity of ideas generated during brainstorming.
Our framework, tested through a design example and a case study, demonstrates its effectiveness in stimulating creativity and its seamless LLM tool integration into design practices.
arXiv Detail & Related papers (2024-10-10T13:39:27Z) - Diffusion-Based Visual Art Creation: A Survey and New Perspectives [51.522935314070416]
This survey explores the emerging realm of diffusion-based visual art creation, examining its development from both artistic and technical perspectives.
Our findings reveal how artistic requirements are transformed into technical challenges and highlight the design and application of diffusion-based methods within visual art creation.
We aim to shed light on the mechanisms through which AI systems emulate and possibly, enhance human capacities in artistic perception and creativity.
arXiv Detail & Related papers (2024-08-22T04:49:50Z) - Inspired by AI? A Novel Generative AI System To Assist Conceptual Automotive Design [6.001793288867721]
Design inspiration is crucial for establishing the direction of a design as well as evoking feelings and conveying meanings during the conceptual design process.
Many practice designers use text-based searches on platforms like Pinterest to gather image ideas, followed by sketching on paper or using digital tools to develop concepts.
Emerging generative AI techniques, such as diffusion models, offer a promising avenue to streamline these processes by swiftly generating design concepts based on text and image inspiration inputs.
arXiv Detail & Related papers (2024-06-06T17:04:14Z) - I-Design: Personalized LLM Interior Designer [57.00412237555167]
I-Design is a personalized interior designer that allows users to generate and visualize their design goals through natural language communication.
I-Design starts with a team of large language model agents that engage in dialogues and logical reasoning with one another.
The final design is then constructed in 3D by retrieving and integrating assets from an existing object database.
arXiv Detail & Related papers (2024-04-03T16:17:53Z) - Geometric Deep Learning for Computer-Aided Design: A Survey [85.79012726689511]
This survey offers a comprehensive overview of learning-based methods in computer-aided design.
It includes similarity analysis and retrieval, 2D and 3D CAD model synthesis, and CAD generation from point clouds.
It provides a complete list of benchmark datasets and their characteristics, along with open-source codes that have propelled research in this domain.
arXiv Detail & Related papers (2024-02-27T17:11:35Z) - ConceptLab: Creative Concept Generation using VLM-Guided Diffusion Prior
Constraints [56.824187892204314]
We present the task of creative text-to-image generation, where we seek to generate new members of a broad category.
We show that the creative generation problem can be formulated as an optimization process over the output space of the diffusion prior.
We incorporate a question-answering Vision-Language Model (VLM) that adaptively adds new constraints to the optimization problem, encouraging the model to discover increasingly more unique creations.
arXiv Detail & Related papers (2023-08-03T17:04:41Z) - Experiments on Generative AI-Powered Parametric Modeling and BIM for
Architectural Design [4.710049212041078]
The study experiments with the potential of ChatGPT and generative AI in 3D architectural design.
The framework provides architects with an intuitive and powerful method to convey design intent.
arXiv Detail & Related papers (2023-08-01T01:51:59Z) - Unsupervised Compositional Concepts Discovery with Text-to-Image
Generative Models [80.75258849913574]
In this paper, we consider the inverse problem -- given a collection of different images, can we discover the generative concepts that represent each image?
We present an unsupervised approach to discover generative concepts from a collection of images, disentangling different art styles in paintings, objects, and lighting from kitchen scenes, and discovering image classes given ImageNet images.
arXiv Detail & Related papers (2023-06-08T17:02:15Z) - Generative Transformers for Design Concept Generation [7.807713821263175]
This study explores the recent advance of the natural language generation (NLG) technique in the artificial intelligence (AI) field.
A novel approach utilizing the generative pre-trained transformer (GPT) is proposed to leverage the knowledge and reasoning from textual data.
Three concept generation tasks are defined to leverage different knowledge and reasoning: domain knowledge synthesis, problem-driven synthesis, and analogy-driven synthesis.
arXiv Detail & Related papers (2022-11-07T11:29:10Z) - Pathway to Future Symbiotic Creativity [76.20798455931603]
We propose a classification of the creative system with a hierarchy of 5 classes, showing the pathway of creativity evolving from a mimic-human artist to a Machine artist in its own right.
In art creation, it is necessary for machines to understand humans' mental states, including desires, appreciation, and emotions, humans also need to understand machines' creative capabilities and limitations.
We propose a novel framework for building future Machine artists, which comes with the philosophy that a human-compatible AI system should be based on the "human-in-the-loop" principle.
arXiv Detail & Related papers (2022-08-18T15:12:02Z) - Generative Pre-Trained Transformer for Design Concept Generation: An
Exploration [6.233117407988574]
This paper explores the uses of generative pre-trained transformers (GPT) for natural language design concept generation.
Our experiments involve the use of GPT-2 and GPT-3 for different creative reasonings in design tasks.
arXiv Detail & Related papers (2021-11-16T14:12:08Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.