References & Citations
Computer Science > Human-Computer Interaction
Title: The Creativity of Text-to-Image Generation
(Submitted on 13 May 2022 (v1), revised 31 Aug 2022 (this version, v2), latest version 31 Oct 2022 (v4))
Abstract: Text-to-image synthesis has made a giant leap towards becoming a mainstream phenomenon since 2021. With text-to-image systems, anybody can create digital images and artworks. This provokes the question of whether text-to-image art is creative. This paper expounds on the nature of human creativity involved in text-to-image art with a specific focus on the practice of "prompt engineering". The paper argues that the current product-centered view of creativity may fall short in the context of text-to-image generation. A case exemplifying this shortcoming is provided and the importance of online communities for the creative ecosystem of text-to-image art is highlighted. We provide a high-level summary of this online ecosystem drawing on Rhodes's conceptual model of creativity. We provide a discussion on the challenges for evaluating the creativity of text-to-image generation and discuss opportunities for research on text-to-image art in the field of Human-Computer Interaction (HCI).
Submission history
From: Jonas Oppenlaender [view email][v1] Fri, 13 May 2022 05:59:02 GMT (34787kb,D)
[v2] Wed, 31 Aug 2022 07:45:43 GMT (69554kb,D)
[v3] Fri, 28 Oct 2022 11:19:54 GMT (69154kb,D)
[v4] Mon, 31 Oct 2022 09:56:44 GMT (69061kb,D)
Link back to: arXiv, form interface, contact.