Hyper-flexible Convolutional Neural Networks based on Generalized Lehmer and Power Means
Terziyan, V., Malyk, D., Golovianko, M., & Branytskyi, V. (2022). Hyper-flexible Convolutional Neural Networks based on Generalized Lehmer and Power Means. Neural Networks, 155, 177-203. https://doi.org/10.1016/j.neunet.2022.08.017
Julkaistu sarjassa
Neural NetworksPäivämäärä
2022Tekijänoikeudet
© 2022 The Author(s). Published by Elsevier Ltd.
Convolutional Neural Network is one of the famous members of the deep learning family of neural network architectures, which is used for many purposes, including image classification. In spite of the wide adoption, such networks are known to be highly tuned to the training data (samples representing a particular problem), and they are poorly reusable to address new problems. One way to change this would be, in addition to trainable weights, to apply trainable parameters of the mathematical functions, which simulate various neural computations within such networks. In this way, we may distinguish between the narrowly focused task-specific parameters (weights) and more generic capability-specific parameters. In this paper, we suggest a couple of flexible mathematical functions (Generalized Lehmer Mean and Generalized Power Mean) with trainable parameters to replace some fixed operations (such as ordinary arithmetic mean or simple weighted aggregation), which are traditionally used within various components of a convolutional neural network architecture. We named the overall architecture with such an update as a hyper-flexible convolutional neural network. We provide mathematical justification of various components of such architecture and experimentally show that it performs better than the traditional one, including better robustness regarding the adversarial perturbations of testing data.
...
Julkaisija
ElsevierISSN Hae Julkaisufoorumista
0893-6080Asiasanat
Julkaisuun liittyvä(t) tutkimusaineisto(t)
https://github.com/Adversarial-Intelligence-Group/flexnetsJulkaisu tutkimustietojärjestelmässä
https://converis.jyu.fi/converis/portal/detail/Publication/151802598
Metadata
Näytä kaikki kuvailutiedotKokoelmat
Lisenssi
Samankaltainen aineisto
Näytetään aineistoja, joilla on samankaltainen nimeke tai asiasanat.
-
Causality-Aware Convolutional Neural Networks for Advanced Image Classification and Generation
Terziyan, Vagan; Vitko, Oleksandra (Elsevier, 2023)Smart manufacturing uses emerging deep learning models, and particularly Convolutional Neural Networks (CNNs) and Generative Adversarial Networks (GANs), for different industrial diagnostics tasks, e.g., classification, ... -
Domain‐specific neural networks improve automated bird sound recognition already with small amount of local data
Lauha, Patrik; Somervuo, Panu; Lehikoinen, Petteri; Geres, Lisa; Richter, Tobias; Seibold, Sebastian; Ovaskainen, Otso (Wiley-Blackwell, 2022)An automatic bird sound recognition system is a useful tool for collecting data of different bird species for ecological analysis. Together with autonomous recording units (ARUs), such a system provides a possibility to ... -
Convolutional Neural Network Based Sleep Stage Classification with Class Imbalance
Xu, Qi; Zhou, Dongdong; Wang, Jian; Shen, Jiangrong; Kettunen, Lauri; Cong, Fengyu (IEEE, 2022)Accurate sleep stage classification is vital to assess sleep quality and diagnose sleep disorders. Numerous deep learning based models have been designed for accomplishing this labor automatically. However, the class ... -
Chlorophyll Concentration Retrieval by Training Convolutional Neural Network for Stochastic Model of Leaf Optical Properties (SLOP) Inversion
Annala, Leevi; Honkavaara, Eija; Tuominen, Sakari; Pölönen, Ilkka (MDPI AG, 2020)Miniaturized hyperspectral imaging techniques have developed rapidly in recent years and have become widely available for different applications. Combining calibrated hyperspectral imagery with inverse physically based ... -
Differentiating Malignant from Benign Pigmented or Non-Pigmented Skin Tumours : A Pilot Study on 3D Hyperspectral Imaging of Complex Skin Surfaces and Convolutional Neural Networks
Lindholm, Vivian; Raita-Hakola, Anna-Maria; Annala, Leevi; Salmivuori, Mari; Jeskanen, Leila; Saari, Heikki; Koskenmies, Sari; Pitkänen, Sari; Pölönen, Ilkka; Isoherranen, Kirsi; Ranki, Annamari (MDPI AG, 2022)Several optical imaging techniques have been developed to ease the burden of skin cancer disease on our health care system. Hyperspectral images can be used to identify biological tissues by their diffuse reflected spectra. ...
Ellei toisin mainittu, julkisesti saatavilla olevia JYX-metatietoja (poislukien tiivistelmät) saa vapaasti uudelleenkäyttää CC0-lisenssillä.