Trash to Treasure: Using text-to-image models to inform the design of
physical artefacts
- URL: http://arxiv.org/abs/2302.00561v1
- Date: Wed, 1 Feb 2023 16:26:34 GMT
- Title: Trash to Treasure: Using text-to-image models to inform the design of
physical artefacts
- Authors: Amy Smith, Hope Schroeder, Ziv Epstein, Michael Cook, Simon Colton,
Andrew Lippman
- Abstract summary: We conduct a pilot study to investigate if and how text-to-image models can be used to assist in upstream tasks within the creative process.
Thirty participants selected sculpture-making materials and generated three images using the Stable Diffusion text-to-image generator.
The majority of participants reported that the generated images informed their sculptures, and 28/30 reported interest in using text-to-image models to help them in a creative task in the future.
- Score: 2.6093180689514353
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Text-to-image generative models have recently exploded in popularity and
accessibility. Yet so far, use of these models in creative tasks that bridge
the 2D digital world and the creation of physical artefacts has been
understudied. We conduct a pilot study to investigate if and how text-to-image
models can be used to assist in upstream tasks within the creative process,
such as ideation and visualization, prior to a sculpture-making activity.
Thirty participants selected sculpture-making materials and generated three
images using the Stable Diffusion text-to-image generator, each with text
prompts of their choice, with the aim of informing and then creating a physical
sculpture. The majority of participants (23/30) reported that the generated
images informed their sculptures, and 28/30 reported interest in using
text-to-image models to help them in a creative task in the future. We identify
several prompt engineering strategies and find that a participant's prompting
strategy relates to their stage in the creative process. We discuss how our
findings can inform support for users at different stages of the design process
and for using text-to-image models for physical artefact design.
Related papers
- KITTEN: A Knowledge-Intensive Evaluation of Image Generation on Visual Entities [93.74881034001312]
We conduct a systematic study on the fidelity of entities in text-to-image generation models.
We focus on their ability to generate a wide range of real-world visual entities, such as landmark buildings, aircraft, plants, and animals.
Our findings reveal that even the most advanced text-to-image models often fail to generate entities with accurate visual details.
arXiv Detail & Related papers (2024-10-15T17:50:37Z) - Artworks Reimagined: Exploring Human-AI Co-Creation through Body Prompting [3.7907687118593323]
This article explores body prompting as input for image generation using generative artificial intelligence.
We implement this concept in an interactive art installation, Artworks Reimagined, designed to transform artworks via body prompting.
We identify three distinct patterns of embodied interaction with the generative AI and present insights into participants' experience of body prompting and AI co-creation.
arXiv Detail & Related papers (2024-08-10T08:05:59Z) - ORACLE: Leveraging Mutual Information for Consistent Character Generation with LoRAs in Diffusion Models [3.7599363231894185]
We introduce a novel framework designed to produce consistent character representations from a single text prompt.
Our framework outperforms existing methods in generating characters with consistent visual identities.
arXiv Detail & Related papers (2024-06-04T23:39:08Z) - The Chosen One: Consistent Characters in Text-to-Image Diffusion Models [71.15152184631951]
We propose a fully automated solution for consistent character generation with the sole input being a text prompt.
Our method strikes a better balance between prompt alignment and identity consistency compared to the baseline methods.
arXiv Detail & Related papers (2023-11-16T18:59:51Z) - Impressions: Understanding Visual Semiotics and Aesthetic Impact [66.40617566253404]
We present Impressions, a novel dataset through which to investigate the semiotics of images.
We show that existing multimodal image captioning and conditional generation models struggle to simulate plausible human responses to images.
This dataset significantly improves their ability to model impressions and aesthetic evaluations of images through fine-tuning and few-shot adaptation.
arXiv Detail & Related papers (2023-10-27T04:30:18Z) - InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists [66.85125112199898]
We develop a unified language interface for computer vision tasks that abstracts away task-specific design choices.
Our model, dubbed InstructCV, performs competitively compared to other generalist and task-specific vision models.
arXiv Detail & Related papers (2023-09-30T14:26:43Z) - Language Does More Than Describe: On The Lack Of Figurative Speech in
Text-To-Image Models [63.545146807810305]
Text-to-image diffusion models can generate high-quality pictures from textual input prompts.
These models have been trained using text data collected from content-based labelling protocols.
We characterise the sentimentality, objectiveness and degree of abstraction of publicly available text data used to train current text-to-image diffusion models.
arXiv Detail & Related papers (2022-10-19T14:20:05Z) - A Taxonomy of Prompt Modifiers for Text-To-Image Generation [6.903929927172919]
This paper identifies six types of prompt modifier used by practitioners in the online community based on a 3-month ethnography study.
The novel taxonomy of prompt modifier provides researchers a conceptual starting point for investigating the practice of text-to-image generation.
We discuss research opportunities of this novel creative practice in the field of Human-Computer Interaction.
arXiv Detail & Related papers (2022-04-20T06:15:50Z) - A Framework and Dataset for Abstract Art Generation via CalligraphyGAN [0.0]
We present a creative framework based on Conditional Generative Adversarial Networks and Contextual Neural Language Model to generate abstract artworks.
Our work is inspired by Chinese calligraphy, which is a unique form of visual art where the character itself is an aesthetic painting.
arXiv Detail & Related papers (2020-12-02T16:24:20Z) - Modeling Artistic Workflows for Image Generation and Editing [83.43047077223947]
We propose a generative model that follows a given artistic workflow.
It enables both multi-stage image generation as well as multi-stage image editing of an existing piece of art.
arXiv Detail & Related papers (2020-07-14T17:54:26Z) - Words as Art Materials: Generating Paintings with Sequential GANs [8.249180979158815]
We investigate the generation of artistic images on a large variance dataset.
This dataset includes images with variations, for example, in shape, color, and content.
We propose a sequential Generative Adversarial Network model.
arXiv Detail & Related papers (2020-07-08T19:17:14Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.