Zero-shot Semantic Segmentation using Relation Network

Zhang, Yindong; Khriyenko, Oleksiy

doi:10.23919/FRUCT50888.2021.9347619

dc.contributor.author	Zhang, Yindong
dc.contributor.author	Khriyenko, Oleksiy
dc.contributor.editor	Balandin, Sergey
dc.contributor.editor	Deart, Vladimir
dc.contributor.editor	Tyutina, Tatiana
dc.date.accessioned	2021-02-16T12:51:28Z
dc.date.available	2021-02-16T12:51:28Z
dc.date.issued	2021
dc.identifier.citation	Zhang, Y., & Khriyenko, O. (2021). Zero-shot Semantic Segmentation using Relation Network. In S. Balandin, V. Deart, & T. Tyutina (Eds.), <i>FRUCT '28 : Proceedings of the 28th Conference of Open Innovations Association FRUCT</i> (pp. 516-527). FRUCT Oy. Proceedings of Conference of Open Innovations Association FRUCT. <a href="https://doi.org/10.23919/FRUCT50888.2021.9347619" target="_blank">https://doi.org/10.23919/FRUCT50888.2021.9347619</a>
dc.identifier.other	CONVID_51498043
dc.identifier.uri	https://jyx.jyu.fi/handle/123456789/74258
dc.description.abstract	Zero-shot learning (ZSL) is widely studied in recent years to solve the problem of lacking annotations. Currently, most studies on ZSL are for image classification and object detection. But, zero-shot semantic segmentation, pixel level classification, is still at its early stage. Therefore, this work proposes to extend a zero-shot image classification model, Relation Network (RN), to semantic segmentation tasks. We modified the structure of RN based on other state-of-the-arts semantic segmentation models (i.e. U-Net and DeepLab) and utilizes word embeddings from Caltech-UCSD Birds 200-2011 attributes and natural language processing models (i.e. word2vec and fastText). Because meta-learning is limited to binary tasks, this work proposes to join multiple binary semantic segmentation pipelines for multi-class semantic segmentation. It is proved by experiments that RN could improve accuracy of U-Net with the help of semantic side information on binary semantic segmentation and it could also be applied on multi-class semantic segmentation with simpler structure than the baseline model, SPNet, but higher accuracy under ZSL setting. However, the capability of RN under generalized zero-shot learning (GZSL) setting still needs improvement. We also studied on how different word embeddings, network structures and data affect RN and what could be done to improve its results.	en
dc.format.mimetype	application/pdf
dc.language.iso	eng
dc.publisher	FRUCT Oy
dc.relation.ispartof	FRUCT '28 : Proceedings of the 28th Conference of Open Innovations Association FRUCT
dc.relation.ispartofseries	Proceedings of Conference of Open Innovations Association FRUCT
dc.relation.uri	https://fruct.org/publications/fruct28/files/Zha.pdf
dc.rights	CC BY-ND 4.0
dc.subject.other	deep learning
dc.subject.other	image segmentation
dc.subject.other	zero-shot semantic segmentation
dc.title	Zero-shot Semantic Segmentation using Relation Network
dc.type	conference paper
dc.identifier.urn	URN:NBN:fi:jyu-202102161673
dc.contributor.laitos	Informaatioteknologian tiedekunta	fi
dc.contributor.laitos	Faculty of Information Technology	en
dc.type.uri	http://purl.org/eprint/type/ConferencePaper
dc.relation.isbn	978-952-69244-4-1
dc.type.coar	http://purl.org/coar/resource_type/c_5794
dc.description.reviewstatus	peerReviewed
dc.format.pagerange	516-527
dc.relation.issn	2305-7254
dc.type.version	publishedVersion
dc.rights.copyright	© 2021 the Authors
dc.rights.accesslevel	openAccess	fi
dc.type.publication	conferenceObject
dc.relation.conference	Conference of Open Innovations Association
dc.subject.yso	konenäkö
dc.subject.yso	koneoppiminen
dc.subject.yso	hahmontunnistus (tietotekniikka)
dc.format.content	fulltext
jyx.subject.uri	http://www.yso.fi/onto/yso/p2618
jyx.subject.uri	http://www.yso.fi/onto/yso/p21846
jyx.subject.uri	http://www.yso.fi/onto/yso/p8266
dc.rights.url	https://creativecommons.org/licenses/by-nd/4.0/
dc.relation.doi	10.23919/FRUCT50888.2021.9347619
dc.type.okm	A4

Aineistoon kuuluvat tiedostot

Nimi:: Zha.pdf
Koko:: 1.463Mb
Tiedostomuoto:: PDF
Kuvaus:: Publisher's PDF

Katso/Avaa

Aineisto kuuluu seuraaviin kokoelmiin

Informaatioteknologian tiedekunta [2328]

Näytä suppeat kuvailutiedot

Ellei muuten mainita, aineiston lisenssi on CC BY-ND 4.0

Zero-shot Semantic Segmentation using Relation Network

Aineistoon kuuluvat tiedostot

Aineisto kuuluu seuraaviin kokoelmiin

Samankaltainen aineisto

Zero-shot semantic segmentation using relation network ﻿

Emotions and Activity Recognition System Using Wearable Device Sensors ﻿

Automatic social distance estimation for photographic studies : Performance evaluation, test benchmark, and algorithm ﻿

BRIMA : Low-Overhead Browser-Only Image Annotation Tool ﻿

Mäkihypyn ponnistusvaiheen biomekaniikka hahmon asennon tunnistamiseen perustuvalla liikeanalyysillä ﻿

Zero-shot semantic segmentation using relation network

Emotions and Activity Recognition System Using Wearable Device Sensors

Automatic social distance estimation for photographic studies : Performance evaluation, test benchmark, and algorithm

BRIMA : Low-Overhead Browser-Only Image Annotation Tool

Mäkihypyn ponnistusvaiheen biomekaniikka hahmon asennon tunnistamiseen perustuvalla liikeanalyysillä