A taxonomy of prompt modifiers for text-to-image generation
Oppenländer, J. (2023). A taxonomy of prompt modifiers for text-to-image generation. Behaviour and Information Technology, Early online. https://doi.org/10.1080/0144929X.2023.2286532
Julkaistu sarjassa
Behaviour and Information TechnologyTekijät
Päivämäärä
2023Tekijänoikeudet
© 2023 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group
Text-guided synthesis of images has become enormously popular and online communities dedicated to text-to-image generation and art generated with Artificial Intelligence (AI) have emerged. While deep generative models can synthesise high-quality images and artworks from simple descriptive text prompts, practitioners of text-to-image generation typically seek to control the generative model’s output by adding short key phrases (‘modifiers’) to the prompt. This paper identifies six types of prompt modifiers used by practitioners in the online text-to-image community based on a 3-month ethnographic study. The novel taxonomy of prompt modifiers provides researchers a conceptual starting point for investigating the practice of text-to-image generation, but may also help practitioners of AI generated art improve their images. We further outline how prompt modifiers are applied in the practice of ‘prompt engineering.’ and discuss research opportunities of this novel creative practice in the field of Human–Computer Interaction (HCI). The paper concludes with a discussion of broader implications of prompt engineering from the perspective of Human-AI Interaction (HAI) in future applications beyond the use case of text-to-image generation and AI generated art.
...
Julkaisija
Taylor & FrancisISSN Hae Julkaisufoorumista
0144-929XAsiasanat
Julkaisu tutkimustietojärjestelmässä
https://converis.jyu.fi/converis/portal/detail/Publication/197187107
Metadata
Näytä kaikki kuvailutiedotKokoelmat
Lisenssi
Samankaltainen aineisto
Näytetään aineistoja, joilla on samankaltainen nimeke tai asiasanat.
-
Reflections on the human role in AI policy formulations : how do national AI strategies view people?
Salo-Pöntinen, Henrikki; Saariluoma, Pertti (Springer Science and Business Media LLC, 2022)Purpose There is no artificial intelligence (AI) without people. People design and develop AI; they modify and use it and they have to reorganize the ways they have carried out tasks in their work and everyday life. ... -
AnatomySketch : An Extensible Open-Source Software Platform for Medical Image Analysis Algorithm Development
Zhuang, Mingrui; Chen, Zhonghua; Wang, Hongkai; Tang, Hong; He, Jiang; Qin, Bobo; Yang, Yuxin; Jin, Xiaoxian; Yu, Mengzhu; Jin, Baitao; Li, Taijing; Kettunen, Lauri (Springer, 2022)The development of medical image analysis algorithm is a complex process including the multiple sub-steps of model training, data visualization, human–computer interaction and graphical user interface (GUI) construction. ... -
Computational Rationality as a Theory of Interaction
Oulasvirta, Antti; Jokinen, Jussi P. P.; Howes, Andrew (ACM, 2022)How do people interact with computers? This fundamental question was asked by Card, Moran, and Newell in 1983 with a proposition to frame it as a question about human cognition – in other words, as a matter of how information ... -
Perceptions and Realities of Text-to-Image Generation
Oppenlaender, Jonas; Silvennoinen, Johanna; Paananen, Ville; Visuri, Aku (ACM, 2023)Generative artificial intelligence (AI) is a widely popular technology that will have a profound impact on society and individuals. Less than a decade ago, it was thought that creative work would be among the last to be ... -
Puhekaverina botti : viestivä tekoäly inhimillistettynä vuorovaikutuskumppanina
Laaksonen, Salla-Maaria; Laitinen, Kaisa; Koivula, Minna; Sihvonen, Tanja (Lähikuva-yhdistys, 2020)Viestivät tekoälyt eli luonnollisella kielellä käytävään keskusteluun kykenevät algoritmit ovat yhä tyypillisempiä vuorovaikutuskumppaneita erilaisilla teknologisilla alustoilla. Yksi tyypillinen viestivän tekoälyn muoto ...
Ellei toisin mainittu, julkisesti saatavilla olevia JYX-metatietoja (poislukien tiivistelmät) saa vapaasti uudelleenkäyttää CC0-lisenssillä.