Intelligent solutions for real-life data-driven applications
Published inJyväskylä studies in computing
The subject of this thesis belongs to the topic of machine learning or, speciﬁcally, to the development of advanced methods for regression analysis, clustering, and anomaly detection. Industry is constantly seeking improved production practices and minimized production time and costs. In connection to this, several industrial case studies are presented in which mathematical models for predicting paper quality were proposed. The most important variables for the prediction models are selected based on information-theoretic measures and regression trees approach. The rest of the original papers are devoted to unsupervised machine learning. The main focus is developing advanced spectral clustering techniques for community detection and anomaly detection. As part of these efforts, a number of enhancements for the dependence clustering algorithm have been proposed. These enhancements include adding regularization for controlling the size of clusters, extension to the ensemble version for improving model stability, handling overlapping clusters, and adaptation to solving anomaly detection problems and handling big datasets. Another focus of the thesis is on developing anomaly detection algorithms for network security data. In connection to this, a probabilistic transition-based approach is proposed for detecting application-layer distributed denial-of-service attacks. The developed approaches are tested on real datasets and are capable of efﬁciently solving the given tasks with high accuracy and good performance. They are shown to be applicable to solving variable selection, graph segmentation, and anomaly detection tasks in different applications. ...
PublisherUniversity of Jyväskylä
clustering community detection anomaly detection paper machine regression analysis regression trees mutual information graph segmentation spectral clustering variable selection network security big data koneoppiminen regressioanalyysi klusterianalyysi paperikoneet laadunvalvonta tiedonlouhinta tietoturva
MetadataShow full item record
- Väitöskirjat 
Showing items with similar title or keywords.
Zolotukhin, Mikhail (University of Jyväskylä, 2014)
Hämäläinen, Joonas (Jyväskylän yliopisto, 2018)Clustering or cluster analysis is an essential part of data mining, machine learning, and pattern recognition. The most popularly applied clustering methods are partitioning-based or prototype-based methods. Prototype-based ...
Unsupervised network intrusion detection systems for zero-day fast-spreading network attacks and botnets Vahdani Amoli, Payam (University of Jyväskylä, 2015)Today, the occurrence of zero-day and complex attacks in high-speed networks is increasingly common due to the high number vulnerabilities in the cyber world. As a result, intrusions become more sophisticated and fast ...
Anomaly-based online intrusion detection system as a sensor for cyber security situational awareness system Kokkonen, Tero (University of Jyväskylä, 2016)Almost all the organisations and even individuals rely on complex structures of data networks and networked computer systems. That complex data ensemble, the cyber domain, provides great opportunities, but at the same ...
Juvonen, Antti (University of Jyväskylä, 2014)