Do Randomized Algorithms Improve the Efficiency of Minimal Learning Machine?
Linja, J., Hämäläinen, J., Nieminen, P., & Kärkkäinen, T. (2020). Do Randomized Algorithms Improve the Efficiency of Minimal Learning Machine?. Machine Learning and Knowledge Extraction, 2(4), 533-557. https://doi.org/10.3390/make2040029
Published in
Machine Learning and Knowledge ExtractionDate
2020Copyright
© 2020 by the authors. Licensee MDPI, Basel, Switzerland.
Minimal Learning Machine (MLM) is a recently popularized supervised learning method, which is composed of distance-regression and multilateration steps. The computational complexity of MLM is dominated by the solution of an ordinary least-squares problem. Several different solvers can be applied to the resulting linear problem. In this paper, a thorough comparison of possible and recently proposed, especially randomized, algorithms is carried out for this problem with a representative set of regression datasets. In addition, we compare MLM with shallow and deep feedforward neural network models and study the effects of the number of observations and the number of features with a special dataset. To our knowledge, this is the first time that both scalability and accuracy of such a distance-regression model are being compared to this extent. We expect our results to be useful on shedding light on the capabilities of MLM and in assessing what solution algorithms can improve the efficiency of MLM. We conclude that (i) randomized solvers are an attractive option when the computing time or resources are limited and (ii) MLM can be used as an out-of-the-box tool especially for high-dimensional problems.
...
Publisher
MDPI AGISSN Search the Publication Forum
2504-4990Keywords
Dataset(s) related to the publication
Linja, Joakim; Hämäläinen, Joonas; Kärkkäinen, Tommi; Nieminen, Paavo. (2020). Au38Q MBTR-K3. V. 11.11.2020. Zenodo. https://doi.org/10.5281/zenodo.4268064.Publication in research information system
https://converis.jyu.fi/converis/portal/detail/Publication/47038065
Metadata
Show full item recordCollections
Related funder(s)
Research Council of FinlandFunding program(s)
Academy Programme, AoF; Research profiles, AoFAdditional information about funding
This work was supported by the Academy of Finland from the projects 311877 (Demo) and 315550 (HNP-AI).License
Related items
Showing items with similar title or keywords.
-
Problem Transformation Methods with Distance-Based Learning for Multi-Target Regression
Hämäläinen, Joonas; Kärkkäinen, Tommi (ESANN, 2020)Multi-target regression is a special subset of supervised machine learning problems. Problem transformation methods are used in the field to improve the performance of basic methods. The purpose of this article is to test ... -
Comparing the forecasting performance of logistic regression and random forest models in criminal recidivism
Aaltonen, Olli-Pekka (2016)Rikosseuraamusalalla on viime vuosina kehitetty uusintarikollisuutta ennustavia malleja (Tyni, 2015), jotka perustuvat tyypillisesti rekisteripohjaisiin mittareihin, jotka mittaavat mm. tuomitun sukupuolta, ikää, rikostaustaa ... -
Extreme minimal learning machine : Ridge regression with distance-based basis
Kärkkäinen, Tommi (Elsevier BV, 2019)The extreme learning machine (ELM) and the minimal learning machine (MLM) are nonlinear and scalable machine learning techniques with a randomly generated basis. Both techniques start with a step in which a matrix of weights ... -
Feature selection for distance-based regression : An umbrella review and a one-shot wrapper
Linja, Joakim; Hämäläinen, Joonas; Nieminen, Paavo; Kärkkäinen, Tommi (Elsevier, 2023)Feature selection (FS) may improve the performance, cost-efficiency, and understandability of supervised machine learning models. In this paper, FS for the recently introduced distance-based supervised machine learning ... -
Intelligent solutions for real-life data-driven applications
Ivannikova, Elena (University of Jyväskylä, 2017)The subject of this thesis belongs to the topic of machine learning or, specifically, to the development of advanced methods for regression analysis, clustering, and anomaly detection. Industry is constantly seeking ...