Koneoppimistyökalujen toimintaperiaatteet

Lappalainen, Markus

dc.contributor.advisor	Taipalus, Toni
dc.contributor.author	Lappalainen, Markus
dc.date.accessioned	2023-06-01T06:53:26Z
dc.date.available	2023-06-01T06:53:26Z
dc.date.issued	2023
dc.identifier.uri	https://jyx.jyu.fi/handle/123456789/87381
dc.description.abstract	Koneoppimisalgoritmit ja neuroverkot ovat nykyään osa jokapäiväistä elämäämme, ja niiden tuoma kehitys on mullistanut yhteiskunnan montaa osa-aluetta. Nopean kehityksen vuoksi asiaan perehtymätön henkilö tuskin yleensä edes huomaa käyttävänsä koneoppimiseen perustuvia teknologioita. Koneoppiminen, ja varsinkin neuroverkot, ovat nykyään niin monimutkaisia, että niiden toimintaa voi olla vaikeaa, tai jopa mahdotonta ymmärtää. Tutkimus toteutettiin kuvailevana kirjallisuuskatsauksena. Tutkimuksessa annetaan ensin lyhyt yleiskatsaus koneoppimisesta lukijan ymmärryksen tueksi. Tutkimuksen päätarkoitus, sekä tutkimusongelma, oli selvittää koneoppimistyökalujen toimintaperiaatteet. Tutkimusongelmaa selvitettiin tutustumalla koneoppimistyö-kalujen toimintaan, etenkin niiden opetusvaiheeseen, jonka aikana luodaan edellytykset niiden päätöksenteolle. Tutkimuksessa selvitettiin joidenkin suosittujen koneoppimistyökalujen toimintaperiaatteita, sekä esitellään ne helppolukuisessa ja helposti ymmärrettävässä muodossa, ilman matemaattisia kaavoja. Koneoppimistyökalujen perustoimintaperiaatteeksi löydettiin virhefunktiot, ja niiden minimoiminen. Virhefunktiot esittävät koneoppimistyökalun ennusteen ja toteutuneen tapahtuman välistä eroa, joten virhefunktion minimoiminen on koneoppimisen ydintavoite. Keinot virhefunktioiden minimoimiseksi riippuu käsiteltävän koneoppimistyökalun piirteistä. Tarkasteltaessa valittujen koneoppimistyökalujen optimointiongelmia, paljastui yleisimmäksi keinoksi gradienttimenetelmään perustuvat iteratiiviset optimointialgoritmit. Tutkimuksen aikana löytyi myös muita optimointiongelmia, joita ei pystytä ratkaisemaan gradienttimenetelmällä. Tutkimuksen aikana selvisi myös koneoppimisen perustuvan vahvasti matematiikkaan, erityisesti lineaarialgebraan sekä derivointiin.	fi
dc.description.abstract	Machine learning algorithms and neural networks are a ubiquitous part of our everyday lives, and their recent development has revolutionized many areas of society. Due to the rapid pace of development, a person who is not familiar with the subject may not even realize that they are using technologies based on machine learning. Machine learning, especially neural networks, are now so complex that understanding the logic behind their decision can sometimes be impossible. The study was conducted as a descriptive literature review. The study begins by providing a brief overview of machine learning to support the reader's understanding. The main objective of the study, and the research question, was to clarify the basic principles of machine learning tools. This was done by examining the operation of machine learning tools, particularly their training phase, during which the conditions for their decision-making are established. The study examined the operating principles of some popular machine learning tools and presented them in an easy-to-understand form, without mathematical formulas. The basic operating principle of machine learning tools was found to be cost functions and their minimization. Cost functions measure the difference between the prediction of the machine learning tool and the actual outcome, so minimizing the cost function can be seen as the primary goal of machine learning. The method for minimizing a cost function depends on the characteristics of the machine learning tool being used. When examining the optimization methods of the studied machine learning tools, iterative optimization algorithms based on the gradient descent algorithm were found to be the most common approach. The study also identified other optimization problems that cannot be solved by the gradient descent algorithm. At the time of writing the study, it became clear that machine learning is heavily based on mathematics, especially linear algebra, and differentiation.	en
dc.format.extent	28
dc.language.iso	fi
dc.rights	In Copyright	en
dc.title	Koneoppimistyökalujen toimintaperiaatteet
dc.type	bachelor thesis
dc.identifier.urn	URN:NBN:fi:jyu-202306013436
dc.type.ontasot	Bachelor's thesis	en
dc.type.ontasot	Kandidaatintyö	fi
dc.contributor.tiedekunta	Informaatioteknologian tiedekunta	fi
dc.contributor.tiedekunta	Faculty of Information Technology	en
dc.contributor.laitos	Informaatioteknologia	fi
dc.contributor.laitos	Information Technology	en
dc.contributor.yliopisto	Jyväskylän yliopisto	fi
dc.contributor.yliopisto	University of Jyväskylä	en
dc.contributor.oppiaine	Tietojärjestelmätiede	fi
dc.contributor.oppiaine	Information Systems Science	en
dc.type.coar	http://purl.org/coar/resource_type/c_7a1f
dc.type.publication	bachelorThesis
dc.contributor.oppiainekoodi	601
dc.subject.yso	tekoäly
dc.subject.yso	neuroverkot
dc.subject.yso	koneoppiminen
dc.subject.yso	algoritmit
dc.rights.url	https://rightsstatements.org/page/InC/1.0/