A taxonomy of prompt modifiers for text-to-image generation

Oppenländer, Jonas

doi:10.1080/0144929X.2023.2286532

dc.contributor.author	Oppenländer, Jonas
dc.date.accessioned	2023-12-19T09:25:30Z
dc.date.available	2023-12-19T09:25:30Z
dc.date.issued	2023
dc.identifier.citation	Oppenländer, J. (2023). A taxonomy of prompt modifiers for text-to-image generation. <i>Behaviour and Information Technology</i>, <i>Early online</i>. <a href="https://doi.org/10.1080/0144929X.2023.2286532" target="_blank">https://doi.org/10.1080/0144929X.2023.2286532</a>
dc.identifier.other	CONVID_197187107
dc.identifier.uri	https://jyx.jyu.fi/handle/123456789/92405
dc.description.abstract	Text-guided synthesis of images has become enormously popular and online communities dedicated to text-to-image generation and art generated with Artificial Intelligence (AI) have emerged. While deep generative models can synthesise high-quality images and artworks from simple descriptive text prompts, practitioners of text-to-image generation typically seek to control the generative model’s output by adding short key phrases (‘modifiers’) to the prompt. This paper identifies six types of prompt modifiers used by practitioners in the online text-to-image community based on a 3-month ethnographic study. The novel taxonomy of prompt modifiers provides researchers a conceptual starting point for investigating the practice of text-to-image generation, but may also help practitioners of AI generated art improve their images. We further outline how prompt modifiers are applied in the practice of ‘prompt engineering.’ and discuss research opportunities of this novel creative practice in the field of Human–Computer Interaction (HCI). The paper concludes with a discussion of broader implications of prompt engineering from the perspective of Human-AI Interaction (HAI) in future applications beyond the use case of text-to-image generation and AI generated art.	en
dc.format.mimetype	application/pdf
dc.language.iso	eng
dc.publisher	Taylor & Francis
dc.relation.ispartofseries	Behaviour and Information Technology
dc.rights	CC BY 4.0
dc.subject.other	prompt engineering
dc.subject.other	text-to-image generation
dc.subject.other	human-AI interaction
dc.subject.other	AI generated art
dc.title	A taxonomy of prompt modifiers for text-to-image generation
dc.type	research article
dc.identifier.urn	URN:NBN:fi:jyu-202312198400
dc.contributor.laitos	Informaatioteknologian tiedekunta	fi
dc.contributor.laitos	Faculty of Information Technology	en
dc.type.uri	http://purl.org/eprint/type/JournalArticle
dc.type.coar	http://purl.org/coar/resource_type/c_2df8fbb1
dc.description.reviewstatus	peerReviewed
dc.relation.issn	0144-929X
dc.relation.volume	Early online
dc.type.version	publishedVersion
dc.rights.copyright	© 2023 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group
dc.rights.accesslevel	openAccess	fi
dc.type.publication	article
dc.subject.yso	ihmisen ja tietokoneen vuorovaikutus
dc.subject.yso	tekoäly
dc.format.content	fulltext
jyx.subject.uri	http://www.yso.fi/onto/yso/p38007
jyx.subject.uri	http://www.yso.fi/onto/yso/p2616
dc.rights.url	https://creativecommons.org/licenses/by/4.0/
dc.relation.doi	10.1080/0144929X.2023.2286532
dc.type.okm	A1

Aineistoon kuuluvat tiedostot

Nimi:: A taxonomy of prompt modifiers ...
Koko:: 2.548Mb
Tiedostomuoto:: PDF
Kuvaus:: publishedVersion

Katso/Avaa

Aineisto kuuluu seuraaviin kokoelmiin

Informaatioteknologian tiedekunta [2330]

Näytä suppeat kuvailutiedot

Ellei muuten mainita, aineiston lisenssi on CC BY 4.0

A taxonomy of prompt modifiers for text-to-image generation

Aineistoon kuuluvat tiedostot

Aineisto kuuluu seuraaviin kokoelmiin

Samankaltainen aineisto

On the Human-AI Metaphorical Interplay for Culturally Sensitive Generative AI Design in Music Co-Creation ﻿

And Justice for Art(ists) : Metaphorical Design as a Method for Creating Culturally Diverse Human-AI Music Composition Experiences ﻿

Human-AI collaboration and the future of education ﻿

Reflections on the human role in AI policy formulations : how do national AI strategies view people? ﻿

Computational Rationality as a Theory of Interaction ﻿

On the Human-AI Metaphorical Interplay for Culturally Sensitive Generative AI Design in Music Co-Creation

And Justice for Art(ists) : Metaphorical Design as a Method for Creating Culturally Diverse Human-AI Music Composition Experiences

Human-AI collaboration and the future of education

Reflections on the human role in AI policy formulations : how do national AI strategies view people?

Computational Rationality as a Theory of Interaction