dc.contributor.author | Oppenländer, Jonas | |
dc.date.accessioned | 2023-12-19T09:25:30Z | |
dc.date.available | 2023-12-19T09:25:30Z | |
dc.date.issued | 2023 | |
dc.identifier.citation | Oppenländer, J. (2023). A taxonomy of prompt modifiers for text-to-image generation. <i>Behaviour and Information Technology</i>, <i>Early online</i>. <a href="https://doi.org/10.1080/0144929X.2023.2286532" target="_blank">https://doi.org/10.1080/0144929X.2023.2286532</a> | |
dc.identifier.other | CONVID_197187107 | |
dc.identifier.uri | https://jyx.jyu.fi/handle/123456789/92405 | |
dc.description.abstract | Text-guided synthesis of images has become enormously popular and online communities dedicated to text-to-image generation and art generated with Artificial Intelligence (AI) have emerged. While deep generative models can synthesise high-quality images and artworks from simple descriptive text prompts, practitioners of text-to-image generation typically seek to control the generative model’s output by adding short key phrases (‘modifiers’) to the prompt. This paper identifies six types of prompt modifiers used by practitioners in the online text-to-image community based on a 3-month ethnographic study. The novel taxonomy of prompt modifiers provides researchers a conceptual starting point for investigating the practice of text-to-image generation, but may also help practitioners of AI generated art improve their images. We further outline how prompt modifiers are applied in the practice of ‘prompt engineering.’ and discuss research opportunities of this novel creative practice in the field of Human–Computer Interaction (HCI). The paper concludes with a discussion of broader implications of prompt engineering from the perspective of Human-AI Interaction (HAI) in future applications beyond the use case of text-to-image generation and AI generated art. | en |
dc.format.mimetype | application/pdf | |
dc.language.iso | eng | |
dc.publisher | Taylor & Francis | |
dc.relation.ispartofseries | Behaviour and Information Technology | |
dc.rights | CC BY 4.0 | |
dc.subject.other | prompt engineering | |
dc.subject.other | text-to-image generation | |
dc.subject.other | human-AI interaction | |
dc.subject.other | AI generated art | |
dc.title | A taxonomy of prompt modifiers for text-to-image generation | |
dc.type | research article | |
dc.identifier.urn | URN:NBN:fi:jyu-202312198400 | |
dc.contributor.laitos | Informaatioteknologian tiedekunta | fi |
dc.contributor.laitos | Faculty of Information Technology | en |
dc.type.uri | http://purl.org/eprint/type/JournalArticle | |
dc.type.coar | http://purl.org/coar/resource_type/c_2df8fbb1 | |
dc.description.reviewstatus | peerReviewed | |
dc.relation.issn | 0144-929X | |
dc.relation.volume | Early online | |
dc.type.version | publishedVersion | |
dc.rights.copyright | © 2023 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group | |
dc.rights.accesslevel | openAccess | fi |
dc.type.publication | article | |
dc.subject.yso | ihmisen ja tietokoneen vuorovaikutus | |
dc.subject.yso | tekoäly | |
dc.format.content | fulltext | |
jyx.subject.uri | http://www.yso.fi/onto/yso/p38007 | |
jyx.subject.uri | http://www.yso.fi/onto/yso/p2616 | |
dc.rights.url | https://creativecommons.org/licenses/by/4.0/ | |
dc.relation.doi | 10.1080/0144929X.2023.2286532 | |
dc.type.okm | A1 | |