Smart prototype selection for machine learning based on ignorance zones analysis
The size of databases has been considerably growing over recent decades and Machine Learning algorithms are not ready to process such large volume of information. Being one of the most useful algorithms in Data Mining the Nearest neighbor classifier suffers from high storage requirements and slow response when working with large data sets. Prototype Selection methods help to alleviate this problem by choosing a subset of data with a smaller size. In this thesis, the overview of existing instance selection methods is provided together with the introduction of a new approach. The majority of current methods select a subset experimentally by checking whether certain point affects classification accuracy or not. The new approach, presented in this thesis, is based on analyzing data set instances and choosing prototypes based on discovered ignorance zones. The results obtained from the analysis show that the proposed method can effectively decrease the size of the data set while maintaining the same classification accuracy with the Nearest neighbor classifier. In addition, it allows removing noisy data making the decision boundaries smoother.
...
Asiasanat
Metadata
Näytä kaikki kuvailutiedotKokoelmat
- Pro gradu -tutkielmat [28907]
Samankaltainen aineisto
Näytetään aineistoja, joilla on samankaltainen nimeke tai asiasanat.
-
Radio frequency fingerprinting for outdoor user equipment localization
Mondal, Riaz Uddin (University of Jyväskylä, 2017)The recent advancements in cellular mobile technology and smart phone usage have opened opportunities for researchers and commercial companies to develop ubiquitous low cost localization systems. Radio frequency (RF) ... -
Minimal learning machine in hyperspectral imaging classification
Hakola, Anna-Maria; Pölönen, Ilkka (SPIE, 2020)A hyperspectral (HS) image is typically a stack of frames, where each frame represents the intensity of a different wavelength of light. Each spatial pixel has a spectrum. In the classification of the HS image, each spectrum ... -
Improvements and applications of the elements of prototype-based clustering
Hämäläinen, Joonas (Jyväskylän yliopisto, 2018) -
Updating strategies for distance based classification model with recursive least squares
Raita-Hakola, Anna-Maria; Pölönen, Ilkka (Copernicus Publications, 2022)The idea is to create a self-learning Minimal Learning Machine (MLM) model that is computationally efficient, easy to implement and performs with high accuracy. The study has two hypotheses. Experiment A examines the ... -
Cluster-Based RF Fingerprint Positioning Using LTE and WLAN Outdoor Signals
Mondal, Riaz; Ristaniemi, Tapani; Turkka, Jussi (IEEE, 2015)In this paper we evaluate user-equipment (UE) positioning performance of three cluster-based RF fingerprinting methods using LTE and WLAN signals. Real-life LTE and WLAN data were collected for the evaluation purpose ...
Ellei toisin mainittu, julkisesti saatavilla olevia JYX-metatietoja (poislukien tiivistelmät) saa vapaasti uudelleenkäyttää CC0-lisenssillä.