Study of various machine learning approaches to predict default behavior of a borrower based on transactional dataset
dc.contributor.advisor | Khriyenko, Oleksiy | |
dc.contributor.advisor | Karimova, Rahima | |
dc.contributor.advisor | Fredström, Ashkan | |
dc.contributor.author | Hossain, Mohammad Farhad | |
dc.date.accessioned | 2021-04-16T05:25:03Z | |
dc.date.available | 2021-04-16T05:25:03Z | |
dc.date.issued | 2021 | |
dc.identifier.uri | https://jyx.jyu.fi/handle/123456789/75073 | |
dc.description.abstract | Predicting ‘default’ behavior of borrowers is quite challenging and time consuming, although financial institutions require faster and more reliable decision on loan applications to survive in the competitive market. Availability of huge amount of data makes the work of current credit scoring system harder. To deal with such situation machine learning engineers are trying to build a system that can predict default behavior of a borrower by analyzing application and transaction data. In our current study we applied different machine learning models such as decision tree, logistic regression, gradient boosting, XGBoosting, support vector machine and KNeighbors on transactional dataset to find which model performed better. We also applied deep neural network on the datasets. To further extend the study, we created new features by using manual process and unsupervised machine learning to observe whether they boost the performance or not. In addition to that, we used feature selection to see how it affected the prediction. Due to small dataset, we achieved 70% ac-curacy with 72% AUC on aggregated dataset from Random Forest. The dataset created by using unsupervised machine learning showed 62% accuracy with 68% AUC value. Manually created ratio-based features and feature selection could not yield any significant difference in results. Deep learning also per-formed lower than others probably due to small dataset. | en |
dc.format.extent | 56 | |
dc.format.mimetype | application/pdf | |
dc.language.iso | en | |
dc.rights | In Copyright | en |
dc.subject.other | deep learning | |
dc.subject.other | credit scoring | |
dc.subject.other | transaction data | |
dc.subject.other | default behavior | |
dc.subject.other | loan application | |
dc.title | Study of various machine learning approaches to predict default behavior of a borrower based on transactional dataset | |
dc.type | master thesis | |
dc.identifier.urn | URN:NBN:fi:jyu-202104162383 | |
dc.type.ontasot | Pro gradu -tutkielma | fi |
dc.type.ontasot | Master’s thesis | en |
dc.contributor.tiedekunta | Informaatioteknologian tiedekunta | fi |
dc.contributor.tiedekunta | Faculty of Information Technology | en |
dc.contributor.laitos | Informaatioteknologia | fi |
dc.contributor.laitos | Information Technology | en |
dc.contributor.yliopisto | Jyväskylän yliopisto | fi |
dc.contributor.yliopisto | University of Jyväskylä | en |
dc.contributor.oppiaine | Tietotekniikka | fi |
dc.contributor.oppiaine | Mathematical Information Technology | en |
dc.type.coar | http://purl.org/coar/resource_type/c_bdcc | |
dc.rights.accesslevel | openAccess | |
dc.type.publication | masterThesis | |
dc.contributor.oppiainekoodi | 602 | |
dc.subject.yso | koneoppiminen | |
dc.subject.yso | rahoituslaitokset | |
dc.subject.yso | machine learning | |
dc.subject.yso | financial institutions | |
dc.format.content | fulltext | |
dc.rights.url | https://rightsstatements.org/page/InC/1.0/ | |
dc.type.okm | G2 |
Aineistoon kuuluvat tiedostot
Aineisto kuuluu seuraaviin kokoelmiin
-
Pro gradu -tutkielmat [29743]