Show simple item record

dc.contributor.authorOppenländer, Jonas
dc.date.accessioned2023-12-19T09:25:30Z
dc.date.available2023-12-19T09:25:30Z
dc.date.issued2023
dc.identifier.citationOppenländer, J. (2023). A taxonomy of prompt modifiers for text-to-image generation. <i>Behaviour and Information Technology</i>, <i>Early online</i>. <a href="https://doi.org/10.1080/0144929X.2023.2286532" target="_blank">https://doi.org/10.1080/0144929X.2023.2286532</a>
dc.identifier.otherCONVID_197187107
dc.identifier.urihttps://jyx.jyu.fi/handle/123456789/92405
dc.description.abstractText-guided synthesis of images has become enormously popular and online communities dedicated to text-to-image generation and art generated with Artificial Intelligence (AI) have emerged. While deep generative models can synthesise high-quality images and artworks from simple descriptive text prompts, practitioners of text-to-image generation typically seek to control the generative model’s output by adding short key phrases (‘modifiers’) to the prompt. This paper identifies six types of prompt modifiers used by practitioners in the online text-to-image community based on a 3-month ethnographic study. The novel taxonomy of prompt modifiers provides researchers a conceptual starting point for investigating the practice of text-to-image generation, but may also help practitioners of AI generated art improve their images. We further outline how prompt modifiers are applied in the practice of ‘prompt engineering.’ and discuss research opportunities of this novel creative practice in the field of Human–Computer Interaction (HCI). The paper concludes with a discussion of broader implications of prompt engineering from the perspective of Human-AI Interaction (HAI) in future applications beyond the use case of text-to-image generation and AI generated art.en
dc.format.mimetypeapplication/pdf
dc.language.isoeng
dc.publisherTaylor & Francis
dc.relation.ispartofseriesBehaviour and Information Technology
dc.rightsCC BY 4.0
dc.subject.otherprompt engineering
dc.subject.othertext-to-image generation
dc.subject.otherhuman-AI interaction
dc.subject.otherAI generated art
dc.titleA taxonomy of prompt modifiers for text-to-image generation
dc.typeresearch article
dc.identifier.urnURN:NBN:fi:jyu-202312198400
dc.contributor.laitosInformaatioteknologian tiedekuntafi
dc.contributor.laitosFaculty of Information Technologyen
dc.type.urihttp://purl.org/eprint/type/JournalArticle
dc.type.coarhttp://purl.org/coar/resource_type/c_2df8fbb1
dc.description.reviewstatuspeerReviewed
dc.relation.issn0144-929X
dc.relation.volumeEarly online
dc.type.versionpublishedVersion
dc.rights.copyright© 2023 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group
dc.rights.accesslevelopenAccessfi
dc.type.publicationarticle
dc.subject.ysoihmisen ja tietokoneen vuorovaikutus
dc.subject.ysotekoäly
dc.format.contentfulltext
jyx.subject.urihttp://www.yso.fi/onto/yso/p38007
jyx.subject.urihttp://www.yso.fi/onto/yso/p2616
dc.rights.urlhttps://creativecommons.org/licenses/by/4.0/
dc.relation.doi10.1080/0144929X.2023.2286532
dc.type.okmA1


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record

CC BY 4.0
Except where otherwise noted, this item's license is described as CC BY 4.0