dc.contributor.author | Linja, Joakim | |
dc.date.accessioned | 2023-04-03T08:10:50Z | |
dc.date.available | 2023-04-03T08:10:50Z | |
dc.date.issued | 2023 | |
dc.identifier.isbn | 978-951-39-9517-1 | |
dc.identifier.uri | https://jyx.jyu.fi/handle/123456789/86238 | |
dc.description.abstract | The rise of machine learning (ML) has revolutionized the usage of data. Researchers continue to develop new ways to use ML and find new targets to apply ML on. One of these areas of application is found in nanoscience. Nanoscience is a constantly expanding field with applications in almost every part of life, such as medicine, materials design, and consumer products. The experimental research of nanoscience is expensive, augmented by computational research. Computational research is, however, also resource-intensive and time-consuming due to the complexity of the simulation models. Machine learning promises to alleviate that strain. This work and the articles presented focus on a family of distance-based machine learning algorithms, Minimal Learning Machine (MLM), and Extreme Minimal Learning Machine (EMLM), in the context of computational nanoscience. Specifically in the context of monolayer protected nanoclusters (MPC).
The distance-based ML methods are studied as surrogates in feature selection and knowledge discovery. A set of benchmark, generated, and molecular dynamics-based datasets were used in the included articles. The performance of MLM was studied by using it as a surrogate, comparing it to other methods, and inspecting the effect of a solver on its function. EMLM was used as the ML model in feature selection and knowledge discovery. A set of scaling-focused benchmark datasets were developed based on the simulation data of Au<sub>38</sub>(SCH<sub>3</sub>)<sub>24</sub> MPC and a set of synthetic benchmark & development datasets were created to test the performance of a feature selection algorithm. A Mean Absolute Sensitivity (MAS) utilizing distance-based feature selection algorithm, Distance-based one-shot wrapper, was developed and then extended to Feature Importance Detector. An umbrella review was made to contextualize the one-shot wrapper to feature selection literature. The results prove the viability of distance-based ML methods in the context of computational nanoscience.
Keywords: Machine Learning, Distance–Based Regression, Nanoscience, MLM, EMLM, Hybrid Nanoparticles, Feature Selection, Knowledge discovery | en |
dc.format.mimetype | application/pdf | |
dc.language.iso | eng | |
dc.publisher | Jyväskylän yliopisto | |
dc.relation.ispartofseries | JYU dissertations | |
dc.relation.haspart | <b>Artikkeli I:</b> Pihlajamäki, A., Hämäläinen, J., Linja, J., Nieminen, P., Malola, S., Kärkkäinen, T., & Häkkinen, H. (2020). Monte Carlo Simulations of Au38(SCH3)24 Nanocluster Using Distance-Based Machine Learning Methods. <i>Journal of Physical Chemistry A, 124(23), 4827-4836.</i> DOI: <a href="https://doi.org/10.1021/acs.jpca.0c01512"target="_blank"> 10.1021/acs.jpca.0c01512</a>. JYX: <a href="https://jyx.jyu.fi/handle/123456789/69062"target="_blank"> jyx.jyu.fi/handle/123456789/69062</a> | |
dc.relation.haspart | <b>Artikkeli II:</b> Linja, J., Hämäläinen, J., Nieminen, P., & Kärkkäinen, T. (2020). Do Randomized Algorithms Improve the Efficiency of Minimal Learning Machine?. <i>Machine Learning and Knowledge Extraction, 2(4), 533-557.</i> DOI: <a href="https://doi.org/10.3390/make2040029"target="_blank"> 10.3390/make2040029</a> | |
dc.relation.haspart | <b>Artikkeli III:</b> Linja, J., Hämäläinen, J., Nieminen, P., & Kärkkäinen, T. (2023). Feature selection for distance-based regression : An umbrella review and a one-shot wrapper. <i>Neurocomputing, 518, 344-359. </i> DOI: <a href="https://doi.org/10.1016/j.neucom.2022.11.023"target="_blank"> 10.1016/j.neucom.2022.11.023</a> | |
dc.relation.haspart | <b>Artikkeli IV:</b> Linja, J., Hämäläinen, J., Pihlajamäki, A., Nieminen, P., Malola, S., Häkkinen, H., Kärkkäinen, T. Knowledge Discovery from
Atomic Structures using Feature Importances. <i>Manuscript.</i> | |
dc.rights | In Copyright | |
dc.title | Advancing nanomaterials design using novel machine learning methods | |
dc.type | Diss. | |
dc.identifier.urn | URN:ISBN:978-951-39-9517-1 | |
dc.contributor.tiedekunta | Faculty of Information Technology | en |
dc.contributor.tiedekunta | Informaatioteknologian tiedekunta | fi |
dc.contributor.yliopisto | University of Jyväskylä | en |
dc.contributor.yliopisto | Jyväskylän yliopisto | fi |
dc.relation.issn | 2489-9003 | |
dc.rights.copyright | © The Author & University of Jyväskylä | |
dc.rights.accesslevel | openAccess | |
dc.type.publication | doctoralThesis | |
dc.format.content | fulltext | |
dc.rights.url | https://rightsstatements.org/page/InC/1.0/ | |