A taxonomy of prompt modifiers for text-to-image generation
Oppenländer, J. (2023). A taxonomy of prompt modifiers for text-to-image generation. Behaviour and Information Technology, Early online. https://doi.org/10.1080/0144929X.2023.2286532
Julkaistu sarjassa
Behaviour and Information TechnologyTekijät
Päivämäärä
2023Tekijänoikeudet
© 2023 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group
Text-guided synthesis of images has become enormously popular and online communities dedicated to text-to-image generation and art generated with Artificial Intelligence (AI) have emerged. While deep generative models can synthesise high-quality images and artworks from simple descriptive text prompts, practitioners of text-to-image generation typically seek to control the generative model’s output by adding short key phrases (‘modifiers’) to the prompt. This paper identifies six types of prompt modifiers used by practitioners in the online text-to-image community based on a 3-month ethnographic study. The novel taxonomy of prompt modifiers provides researchers a conceptual starting point for investigating the practice of text-to-image generation, but may also help practitioners of AI generated art improve their images. We further outline how prompt modifiers are applied in the practice of ‘prompt engineering.’ and discuss research opportunities of this novel creative practice in the field of Human–Computer Interaction (HCI). The paper concludes with a discussion of broader implications of prompt engineering from the perspective of Human-AI Interaction (HAI) in future applications beyond the use case of text-to-image generation and AI generated art.
...
Julkaisija
Taylor & FrancisISSN Hae Julkaisufoorumista
0144-929XAsiasanat
Julkaisu tutkimustietojärjestelmässä
https://converis.jyu.fi/converis/portal/detail/Publication/197187107
Metadata
Näytä kaikki kuvailutiedotKokoelmat
Lisenssi
Samankaltainen aineisto
Näytetään aineistoja, joilla on samankaltainen nimeke tai asiasanat.
-
On the Human-AI Metaphorical Interplay for Culturally Sensitive Generative AI Design in Music Co-Creation
Correia, António (RWTH Aachen, 2024) -
And Justice for Art(ists) : Metaphorical Design as a Method for Creating Culturally Diverse Human-AI Music Composition Experiences
Correia, António; Schneider, Daniel; Fonseca, Benjamim; Mohseni, Hesam; Kujala, Tuomo; Kärkkäinen, Tommi (IEEE, 2024)This study discusses the intricate relations between generative artificial intelligence (AI) and music composers. Based on a previous rapid review of recent literature, it reinforces a gap and suggests the need to develop ... -
Human-AI collaboration and the future of education
Häkkinen, Päivi; Näykki, Piia; Pijeira-Díaz, Héctor; Channa, Faisal (University of Cambridge, 2024)Artificial Intelligence (AI), including generative AI (GenAI), is rapidly transforming educational settings in many ways. To succeed in our today’s society, learners need to combine expertise and ideas, solve problems and ... -
Reflections on the human role in AI policy formulations : how do national AI strategies view people?
Salo-Pöntinen, Henrikki; Saariluoma, Pertti (Springer Science and Business Media LLC, 2022)Purpose There is no artificial intelligence (AI) without people. People design and develop AI; they modify and use it and they have to reorganize the ways they have carried out tasks in their work and everyday life. ... -
Computational Rationality as a Theory of Interaction
Oulasvirta, Antti; Jokinen, Jussi P. P.; Howes, Andrew (ACM, 2022)How do people interact with computers? This fundamental question was asked by Card, Moran, and Newell in 1983 with a proposition to frame it as a question about human cognition – in other words, as a matter of how information ...
Ellei toisin mainittu, julkisesti saatavilla olevia JYX-metatietoja (poislukien tiivistelmät) saa vapaasti uudelleenkäyttää CC0-lisenssillä.