dc.contributor.author | Zhang, Yindong | |
dc.contributor.author | Khriyenko, Oleksiy | |
dc.contributor.editor | Balandin, Sergey | |
dc.contributor.editor | Deart, Vladimir | |
dc.contributor.editor | Tyutina, Tatiana | |
dc.date.accessioned | 2021-02-16T12:51:28Z | |
dc.date.available | 2021-02-16T12:51:28Z | |
dc.date.issued | 2021 | |
dc.identifier.citation | Zhang, Y., & Khriyenko, O. (2021). Zero-shot Semantic Segmentation using Relation Network. In S. Balandin, V. Deart, & T. Tyutina (Eds.), <i>FRUCT '28 : Proceedings of the 28th Conference of Open Innovations Association FRUCT</i> (pp. 516-527). FRUCT Oy. Proceedings of Conference of Open Innovations Association FRUCT. <a href="https://doi.org/10.23919/FRUCT50888.2021.9347619" target="_blank">https://doi.org/10.23919/FRUCT50888.2021.9347619</a> | |
dc.identifier.other | CONVID_51498043 | |
dc.identifier.uri | https://jyx.jyu.fi/handle/123456789/74258 | |
dc.description.abstract | Zero-shot learning (ZSL) is widely studied in recent years to solve the problem of lacking annotations. Currently, most studies on ZSL are for image classification and object detection. But, zero-shot semantic segmentation, pixel level classification, is still at its early stage. Therefore, this work proposes to extend a zero-shot image classification model, Relation Network (RN), to semantic segmentation tasks. We modified the structure of RN based on other state-of-the-arts semantic segmentation models (i.e. U-Net and DeepLab) and utilizes word embeddings from Caltech-UCSD Birds 200-2011 attributes and natural language processing models (i.e. word2vec and fastText). Because meta-learning is limited to binary tasks, this work proposes to join multiple binary semantic segmentation pipelines for multi-class semantic segmentation. It is proved by experiments that RN could improve accuracy of U-Net with the help of semantic side information on binary semantic segmentation and it could also be applied on multi-class semantic segmentation with simpler structure than the baseline model, SPNet, but higher accuracy under ZSL setting. However, the capability of RN under generalized zero-shot learning (GZSL) setting still needs improvement. We also studied on how different word embeddings, network structures and data affect RN and what could be done to improve its results. | en |
dc.format.mimetype | application/pdf | |
dc.language.iso | eng | |
dc.publisher | FRUCT Oy | |
dc.relation.ispartof | FRUCT '28 : Proceedings of the 28th Conference of Open Innovations Association FRUCT | |
dc.relation.ispartofseries | Proceedings of Conference of Open Innovations Association FRUCT | |
dc.relation.uri | https://fruct.org/publications/fruct28/files/Zha.pdf | |
dc.rights | CC BY-ND 4.0 | |
dc.subject.other | deep learning | |
dc.subject.other | image segmentation | |
dc.subject.other | zero-shot semantic segmentation | |
dc.title | Zero-shot Semantic Segmentation using Relation Network | |
dc.type | conference paper | |
dc.identifier.urn | URN:NBN:fi:jyu-202102161673 | |
dc.contributor.laitos | Informaatioteknologian tiedekunta | fi |
dc.contributor.laitos | Faculty of Information Technology | en |
dc.type.uri | http://purl.org/eprint/type/ConferencePaper | |
dc.relation.isbn | 978-952-69244-4-1 | |
dc.type.coar | http://purl.org/coar/resource_type/c_5794 | |
dc.description.reviewstatus | peerReviewed | |
dc.format.pagerange | 516-527 | |
dc.relation.issn | 2305-7254 | |
dc.type.version | publishedVersion | |
dc.rights.copyright | © 2021 the Authors | |
dc.rights.accesslevel | openAccess | fi |
dc.type.publication | conferenceObject | |
dc.relation.conference | Conference of Open Innovations Association | |
dc.subject.yso | konenäkö | |
dc.subject.yso | koneoppiminen | |
dc.subject.yso | hahmontunnistus (tietotekniikka) | |
dc.format.content | fulltext | |
jyx.subject.uri | http://www.yso.fi/onto/yso/p2618 | |
jyx.subject.uri | http://www.yso.fi/onto/yso/p21846 | |
jyx.subject.uri | http://www.yso.fi/onto/yso/p8266 | |
dc.rights.url | https://creativecommons.org/licenses/by-nd/4.0/ | |
dc.relation.doi | 10.23919/FRUCT50888.2021.9347619 | |
dc.type.okm | A4 | |