Developing and testing sub-band spectral features in music genre and music mood machine learning

Prezja, Fabi

dc.contributor.advisor	Toiviainen, Petri
dc.contributor.author	Prezja, Fabi
dc.date.accessioned	2019-01-08T12:49:39Z
dc.date.available	2019-01-08T12:49:39Z
dc.date.issued	2018
dc.identifier.uri	https://jyx.jyu.fi/handle/123456789/60963
dc.description.abstract	In the field of artificial intelligence, supervised machine learning enables us to try to develop automatic recognition systems. In music information retrieval, training and testing such systems is possible with a variety of music datasets. Two key prediction tasks are those of music genre recognition, and of music mood recognition. The focus of this study is to evaluate the classification of music into genres and mood categories from the audio content. To this end, we evaluate five novel spectro-temporal variants of sub-band musical features. These features are, sub-band entropy, sub-band flux, sub-band kurtosis, sub-band skewness and sub-band zero crossing rate. The choice of features is based on previous studies that highlight the potential efficacy of sub-band features. To aid our analysis we include the Mel-Frequency Cepstral Coefficients feature as our baseline approach. The classification performances are obtained with various learning algorithms, distinct datasets and multiple feature selection subsets. In order to create and evaluate models in both tasks, we use two music datasets prelabelled with regards to, music genres (GTZAN) and music mood (PandaMood) respectively. In addition, this study is the first to develop an adaptive window decomposition method for these sub-band features and one of a handful few that uses artist filtering and fault filtering for the GTZAN dataset. Our results show that the vast majority of sub-band features outperformed the MFCCs in the music genre and the music mood tasks. Between individual features, sub-band entropy outperformed and outranked every feature in both tasks and feature selection approaches. Lastly, we find lower overfitting tendencies for sub-band features in comparison to the MFCCs. In summary, this study gives support to the use of these sub-band features for music genre and music mood classification tasks and further suggests uses in other content-based predictive tasks.	en
dc.format.extent	114
dc.format.mimetype	application/pdf
dc.language.iso	en
dc.subject.other	music information retrieval
dc.subject.other	music genre classification
dc.subject.other	music mood classification
dc.subject.other	sub-band features
dc.subject.other	polyphonic timbre
dc.subject.other	spectral features
dc.subject.other	adaptive spectral window decomposition
dc.title	Developing and testing sub-band spectral features in music genre and music mood machine learning
dc.identifier.urn	URN:NBN:fi:jyu-201901081104
dc.type.ontasot	Pro gradu -tutkielma	fi
dc.type.ontasot	Master’s thesis	en
dc.contributor.tiedekunta	Humanistis-yhteiskuntatieteellinen tiedekunta	fi
dc.contributor.tiedekunta	Faculty of Humanities and Social Sciences	en
dc.contributor.laitos	Musiikin, taiteen ja kulttuurin tutkimuksen laitos	fi
dc.contributor.laitos	Department of Music, Art and Culture Studies	en
dc.contributor.yliopisto	Jyväskylän yliopisto	fi
dc.contributor.yliopisto	University of Jyväskylä	en
dc.contributor.oppiaine	Music, Mind and Technology (maisteriohjelma)	fi
dc.contributor.oppiaine	Master's Degree Programme in Music, Mind and Technology	en
dc.rights.copyright	Julkaisu on tekijänoikeussäännösten alainen. Teosta voi lukea ja tulostaa henkilökohtaista käyttöä varten. Käyttö kaupallisiin tarkoituksiin on kielletty.	fi
dc.rights.copyright	This publication is copyrighted. You may download, display and print it for Your own personal use. Commercial use is prohibited.	en
dc.type.publication	masterThesis
dc.contributor.oppiainekoodi	3054
dc.subject.yso	koneoppiminen
dc.subject.yso	genret
dc.subject.yso	luokitus (toiminta)
dc.subject.yso	machine learning
dc.subject.yso	genres
dc.subject.yso	classification
dc.format.content	fulltext
dc.type.okm	G2

Aineistoon kuuluvat tiedostot

Nimi:: URN:NBN:fi:jyu-201901081104.pdf
Koko:: 2.265Mb
Tiedostomuoto:: PDF

Katso/Avaa

Aineisto kuuluu seuraaviin kokoelmiin

Pro gradu -tutkielmat [29564]

Näytä suppeat kuvailutiedot

Developing and testing sub-band spectral features in music genre and music mood machine learning

Aineistoon kuuluvat tiedostot

Aineisto kuuluu seuraaviin kokoelmiin

Samankaltainen aineisto

Testing a spectral-based feature set for audio genre classification ﻿

Updating strategies for distance based classification model with recursive least squares ﻿

Unstable feature relevance in classification tasks ﻿

Comparison of feature importance measures as explanations for classification models ﻿

The Truth is Out There : Focusing on Smaller to Guess Bigger in Image Classification ﻿

Testing a spectral-based feature set for audio genre classification

Updating strategies for distance based classification model with recursive least squares

Unstable feature relevance in classification tasks

Comparison of feature importance measures as explanations for classification models

The Truth is Out There : Focusing on Smaller to Guess Bigger in Image Classification