Curiosity-driven algorithm for reinforcement learning

Tsybulko, Vitalii

dc.contributor.advisor	Terziyan, Vagan
dc.contributor.author	Tsybulko, Vitalii
dc.date.accessioned	2019-05-29T06:28:43Z
dc.date.available	2019-05-29T06:28:43Z
dc.date.issued	2019
dc.identifier.uri	https://jyx.jyu.fi/handle/123456789/64268
dc.description.abstract	One problem of current Reinforcement Learning algorithms is finding a balance between exploitation of existing knowledge and exploration for a new experience. Curiosity exploration bonus has been proposed to address this problem, but current implementations are vulnerable to stochastic noise inside the environment. The new approach presented in this thesis utilises exploration bonus based on the predicted novelty of the next state. That protects exploration from noise issues during training. This work also introduces a new way of combining extrinsic and intrinsic rewards. Both improvements help to overcome a number of problems that Reinforcement Learning had until now.	en
dc.format.extent	63
dc.format.mimetype	application/pdf
dc.language.iso	en
dc.subject.other	reinforcement learning
dc.subject.other	proximal policy optimisation
dc.subject.other	curiosity-driven exploration bonus
dc.title	Curiosity-driven algorithm for reinforcement learning
dc.identifier.urn	URN:NBN:fi:jyu-201905292863
dc.type.ontasot	Pro gradu -tutkielma	fi
dc.type.ontasot	Master’s thesis	en
dc.contributor.tiedekunta	Informaatioteknologian tiedekunta	fi
dc.contributor.tiedekunta	Faculty of Information Technology	en
dc.contributor.laitos	Informaatioteknologia	fi
dc.contributor.laitos	Information Technology	en
dc.contributor.yliopisto	Jyväskylän yliopisto	fi
dc.contributor.yliopisto	University of Jyväskylä	en
dc.contributor.oppiaine	Tietotekniikka	fi
dc.contributor.oppiaine	Mathematical Information Technology	en
dc.rights.copyright	Julkaisu on tekijänoikeussäännösten alainen. Teosta voi lukea ja tulostaa henkilökohtaista käyttöä varten. Käyttö kaupallisiin tarkoituksiin on kielletty.	fi
dc.rights.copyright	This publication is copyrighted. You may download, display and print it for Your own personal use. Commercial use is prohibited.	en
dc.type.publication	masterThesis
dc.contributor.oppiainekoodi	602
dc.subject.yso	tekoäly
dc.subject.yso	koneoppiminen
dc.subject.yso	palkitseminen
dc.subject.yso	artificial intelligence
dc.subject.yso	machine learning
dc.subject.yso	rewarding
dc.format.content	fulltext
dc.type.okm	G2

Aineistoon kuuluvat tiedostot

Nimi:: URN:NBN:fi:jyu-201905292863.pdf
Koko:: 4.506Mb
Tiedostomuoto:: PDF

Katso/Avaa

Aineisto kuuluu seuraaviin kokoelmiin

Pro gradu -tutkielmat [29589]

Näytä suppeat kuvailutiedot

Curiosity-driven algorithm for reinforcement learning

Aineistoon kuuluvat tiedostot

Aineisto kuuluu seuraaviin kokoelmiin

Samankaltainen aineisto

Evolutionary Algorithms and Metaheuristics : Applications in Engineering Design and Optimization ﻿

Strategic cyber threat intelligence : Building the situational picture with emerging technologies ﻿

On Attacking Future 5G Networks with Adversarial Examples : Survey ﻿

Practices and Infrastructures for Machine Learning Systems : An Interview Study in Finnish Organizations ﻿

On Assessing Vulnerabilities of the 5G Networks to Adversarial Examples ﻿

Evolutionary Algorithms and Metaheuristics : Applications in Engineering Design and Optimization

Strategic cyber threat intelligence : Building the situational picture with emerging technologies

On Attacking Future 5G Networks with Adversarial Examples : Survey

Practices and Infrastructures for Machine Learning Systems : An Interview Study in Finnish Organizations

On Assessing Vulnerabilities of the 5G Networks to Adversarial Examples