dc.contributor.author | Terziyan, Vagan | |
dc.contributor.author | Vitko, Oleksandra | |
dc.contributor.editor | Longo, Francesco | |
dc.contributor.editor | Affenzeller, Michael | |
dc.contributor.editor | Padovano, Antonio | |
dc.contributor.editor | Weiming, Shen | |
dc.date.accessioned | 2023-01-19T12:31:29Z | |
dc.date.available | 2023-01-19T12:31:29Z | |
dc.date.issued | 2023 | |
dc.identifier.citation | Terziyan, V., & Vitko, O. (2023). Causality-Aware Convolutional Neural Networks for Advanced Image Classification and Generation. In F. Longo, M. Affenzeller, A. Padovano, & S. Weiming (Eds.), <i>4th International Conference on Industry 4.0 and Smart Manufacturing</i> (pp. 495-506). Elsevier. Procedia Computer Science, 217. <a href="https://doi.org/10.1016/j.procs.2022.12.245" target="_blank">https://doi.org/10.1016/j.procs.2022.12.245</a> | |
dc.identifier.other | CONVID_172578730 | |
dc.identifier.uri | https://jyx.jyu.fi/handle/123456789/85106 | |
dc.description.abstract | Smart manufacturing uses emerging deep learning models, and particularly Convolutional Neural Networks (CNNs) and Generative Adversarial Networks (GANs), for different industrial diagnostics tasks, e.g., classification, detection, recognition, prediction, synthetic data generation, security, etc., on the basis of image data. In spite of being efficient for these objectives, the majority of current deep learning models lack interpretability and explainability. They can discover features hidden within input data together with their mutual co-occurrence. However, they are weak at discovering and making explicit hidden causalities between the features, which could be the reason behind the particular diagnoses. In this paper, we suggest Causality-Aware CNNs (CA-CNNs) and Causality-Aware GANs (CA-GANs) to address the issue of learning hidden causalities within images. The core architecture includes an additional layer of neurons (after the last convolution-pooling and just before the dense layers), which learns pairwise conditional probabilities (aka causality estimates) for the features. Computations for these neurons are driven by the adaptive Lehmer mean function. Learned causalities are merged with the features during flattening and (via fully connected layers) influence the classification outcomes. Such causality estimates can be done for the mixed inputs where images are combined with other data. We argue that CA-CNNs not only improve the classification performance of normal CNNs but also open additional opportunities for the explainability of the models’ outcomes. We consider as an additional advantage for CA-CNNs (if used as a discriminator within CA-GANs) the possibility to generate realistically looking images with respect to the causalities.
See presentation slides: https://ai.it.jyu.fi/ISM-2022-Causality.pptx | en |
dc.format.extent | 1954 | |
dc.format.mimetype | application/pdf | |
dc.language.iso | eng | |
dc.publisher | Elsevier | |
dc.relation.ispartof | 4th International Conference on Industry 4.0 and Smart Manufacturing | |
dc.relation.ispartofseries | Procedia Computer Science | |
dc.rights | CC BY-NC-ND 4.0 | |
dc.subject.other | causal discovery | |
dc.subject.other | causal inference | |
dc.subject.other | image processing | |
dc.subject.other | Convolutional Neural Network | |
dc.subject.other | Generative Adversarial Network | |
dc.title | Causality-Aware Convolutional Neural Networks for Advanced Image Classification and Generation | |
dc.type | conferenceObject | |
dc.identifier.urn | URN:NBN:fi:jyu-202301191406 | |
dc.contributor.laitos | Informaatioteknologian tiedekunta | fi |
dc.contributor.laitos | Faculty of Information Technology | en |
dc.contributor.oppiaine | Collective Intelligence | fi |
dc.contributor.oppiaine | Tekniikka | fi |
dc.contributor.oppiaine | Collective Intelligence | en |
dc.contributor.oppiaine | Engineering | en |
dc.type.uri | http://purl.org/eprint/type/ConferencePaper | |
dc.type.coar | http://purl.org/coar/resource_type/c_5794 | |
dc.description.reviewstatus | peerReviewed | |
dc.format.pagerange | 495-506 | |
dc.relation.issn | 1877-0509 | |
dc.type.version | publishedVersion | |
dc.rights.copyright | © 2022 The Authors. Published by Elsevier B.V. | |
dc.rights.accesslevel | openAccess | fi |
dc.relation.conference | International Conference on Industry 4.0 and Smart Manufacturing | |
dc.subject.yso | neuroverkot | |
dc.subject.yso | syväoppiminen | |
dc.subject.yso | luokitus (toiminta) | |
dc.subject.yso | valmistustekniikka | |
dc.subject.yso | koneoppiminen | |
dc.subject.yso | konenäkö | |
dc.subject.yso | kausaliteetti | |
dc.subject.yso | päättely | |
dc.format.content | fulltext | |
jyx.subject.uri | http://www.yso.fi/onto/yso/p7292 | |
jyx.subject.uri | http://www.yso.fi/onto/yso/p39324 | |
jyx.subject.uri | http://www.yso.fi/onto/yso/p12668 | |
jyx.subject.uri | http://www.yso.fi/onto/yso/p22012 | |
jyx.subject.uri | http://www.yso.fi/onto/yso/p21846 | |
jyx.subject.uri | http://www.yso.fi/onto/yso/p2618 | |
jyx.subject.uri | http://www.yso.fi/onto/yso/p333 | |
jyx.subject.uri | http://www.yso.fi/onto/yso/p5902 | |
dc.rights.url | https://creativecommons.org/licenses/by-nc-nd/4.0/ | |
dc.relation.doi | 10.1016/j.procs.2022.12.245 | |
dc.type.okm | A4 | |