Do Randomized Algorithms Improve the Efficiency of Minimal Learning Machine?
Linja, J., Hämäläinen, J., Nieminen, P., & Kärkkäinen, T. (2020). Do Randomized Algorithms Improve the Efficiency of Minimal Learning Machine?. Machine Learning and Knowledge Extraction, 2(4), 533-557. https://doi.org/10.3390/make2040029
Julkaistu sarjassa
Machine Learning and Knowledge ExtractionPäivämäärä
2020Tekijänoikeudet
© 2020 by the authors. Licensee MDPI, Basel, Switzerland.
Minimal Learning Machine (MLM) is a recently popularized supervised learning method, which is composed of distance-regression and multilateration steps. The computational complexity of MLM is dominated by the solution of an ordinary least-squares problem. Several different solvers can be applied to the resulting linear problem. In this paper, a thorough comparison of possible and recently proposed, especially randomized, algorithms is carried out for this problem with a representative set of regression datasets. In addition, we compare MLM with shallow and deep feedforward neural network models and study the effects of the number of observations and the number of features with a special dataset. To our knowledge, this is the first time that both scalability and accuracy of such a distance-regression model are being compared to this extent. We expect our results to be useful on shedding light on the capabilities of MLM and in assessing what solution algorithms can improve the efficiency of MLM. We conclude that (i) randomized solvers are an attractive option when the computing time or resources are limited and (ii) MLM can be used as an out-of-the-box tool especially for high-dimensional problems.
...
Julkaisija
MDPI AGISSN Hae Julkaisufoorumista
2504-4990Asiasanat
Julkaisuun liittyvä(t) tutkimusaineisto(t)
Linja, Joakim; Hämäläinen, Joonas; Kärkkäinen, Tommi; Nieminen, Paavo. (2020). Au38Q MBTR-K3. V. 11.11.2020. Zenodo. https://doi.org/10.5281/zenodo.4268064.Julkaisu tutkimustietojärjestelmässä
https://converis.jyu.fi/converis/portal/detail/Publication/47038065
Metadata
Näytä kaikki kuvailutiedotKokoelmat
Rahoittaja(t)
Suomen AkatemiaRahoitusohjelmat(t)
Akatemiaohjelma, SA; Profilointi, SALisätietoja rahoituksesta
This work was supported by the Academy of Finland from the projects 311877 (Demo) and 315550 (HNP-AI).Lisenssi
Samankaltainen aineisto
Näytetään aineistoja, joilla on samankaltainen nimeke tai asiasanat.
-
Problem Transformation Methods with Distance-Based Learning for Multi-Target Regression
Hämäläinen, Joonas; Kärkkäinen, Tommi (ESANN, 2020)Multi-target regression is a special subset of supervised machine learning problems. Problem transformation methods are used in the field to improve the performance of basic methods. The purpose of this article is to test ... -
Comparing the forecasting performance of logistic regression and random forest models in criminal recidivism
Aaltonen, Olli-Pekka (2016)Rikosseuraamusalalla on viime vuosina kehitetty uusintarikollisuutta ennustavia malleja (Tyni, 2015), jotka perustuvat tyypillisesti rekisteripohjaisiin mittareihin, jotka mittaavat mm. tuomitun sukupuolta, ikää, rikostaustaa ... -
Extreme minimal learning machine : Ridge regression with distance-based basis
Kärkkäinen, Tommi (Elsevier BV, 2019)The extreme learning machine (ELM) and the minimal learning machine (MLM) are nonlinear and scalable machine learning techniques with a randomly generated basis. Both techniques start with a step in which a matrix of weights ... -
Feature selection for distance-based regression : An umbrella review and a one-shot wrapper
Linja, Joakim; Hämäläinen, Joonas; Nieminen, Paavo; Kärkkäinen, Tommi (Elsevier, 2023)Feature selection (FS) may improve the performance, cost-efficiency, and understandability of supervised machine learning models. In this paper, FS for the recently introduced distance-based supervised machine learning ... -
Intelligent solutions for real-life data-driven applications
Ivannikova, Elena (University of Jyväskylä, 2017)The subject of this thesis belongs to the topic of machine learning or, specifically, to the development of advanced methods for regression analysis, clustering, and anomaly detection. Industry is constantly seeking ...
Ellei toisin mainittu, julkisesti saatavilla olevia JYX-metatietoja (poislukien tiivistelmät) saa vapaasti uudelleenkäyttää CC0-lisenssillä.