University of Jyväskylä | JYX Digital Repository

  • English  | Give feedback |
    • suomi
    • English
 
  • Login
JavaScript is disabled for your browser. Some features of this site may not work without it.
View Item 
  • JYX
  • Opinnäytteet
  • Pro gradu -tutkielmat
  • View Item
JYX > Opinnäytteet > Pro gradu -tutkielmat > View Item

Testing a spectral-based feature set for audio genre classification

Thumbnail
View/Open
1.4Mb

Downloads:  
Show download detailsHide download details  
Authors
Hartmann, Martín Ariel
Date
2011
Discipline
Music, Mind and Technology (maisteriohjelma)Master's Degree Programme in Music, Mind and Technology

 
Automatic musical genre classification is an important information retrieval task since it can be applied for practical purposes such as the organization of data collections in the digital music industry. However, this task remains an open question because the current state of the art shows far from satisfactory outcomes in terms of classification performance. Moreover, the most common algorithms that are used for this task are not designed for modelling music perception. This study suggests a framework for testing different musical features for use in music genre classification and evaluates the performance of this task based on two musical descriptors. The focus of this study is on automatic classification of music into genres based on audio content. The performance of two sets of timbral descriptors, namely the sub-band fluxes and the mel-frequency cepstral coefficients, is compared. The choice of these particular descriptors is based on their ease or difficulty of interpretation from a perceptual point of view. Classification performance is determined by using a variety of music datasets, learning algorithms, feature selection approaches and combinatorial feature subsets yielded from these descriptors. The results were estimated upon overall classification accuracies, generalization capability, and relevance of these musical descriptors based on feature ranking. According to the results, the sub-band fluxes, perceptually motivated descriptors of polyphonic timbre, performed better than the widely used mel-frequency cepstral coefficients. The former timbral descriptors showed better classification accuracies and lower tendency to overfit than the latter. In a nutshell, this study gives support to using perceptually interpretable timbre desciptors for musical genre classification tasks and suggests the utilization of the sub-band flux set for further content-based tasks in the field of music information retrieval. ...
Keywords
music information retrieval music genre classification polyphonic timbre feature ranking mallintaminen musiikki genret sähköiset palvelut luokitus
URI

http://urn.fi/URN:NBN:fi:jyu-2011080311207

Metadata
Show full item record
Collections
  • Pro gradu -tutkielmat [23352]

Related items

Showing items with similar title or keywords.

  • Developing and testing sub-band spectral features in music genre and music mood machine learning 

    Prezja, Fabi (2018)
    In the field of artificial intelligence, supervised machine learning enables us to try to develop automatic recognition systems. In music information retrieval, training and testing such systems is possible with a variety ...
  • The influence of rhythmic and spectro-timbral musical features on gait-related movement 

    Johnson, Susan (2017)
    Music makes us move, and humans have the universal tendency to synchronise their movements to music. This phenomenon has been used in music therapy to help people with movement disorders regain control over their movements. ...
  • Unstable feature relevance in classification tasks 

    Skrypnyk, Iryna (University of Jyväskylä, 2011)
  • Remote Sensing of 3-D Geometry and Surface Moisture of a Peat Production Area Using Hyperspectral Frame Cameras in Visible to Short-Wave Infrared Spectral Ranges Onboard a Small Unmanned Airborne Vehicle (UAV) 

    Honkavaara, Eija; Eskelinen, Matti; Pölönen, Ilkka; Saari, Heikki; Ojanen, Harri; Mannila, Rami; Holmlund, Christer; Hakala, Teemu; Litkey, Paula; Rosnell, Tomi; Viljanen, Niko; Pulkkanen, Merja (Institute of Electrical and Electronics Engineers, 2016)
    Miniaturized hyperspectral imaging sensors are becoming available to small unmanned airborne vehicle (UAV) platforms. Imaging concepts based on frame format offer an attractive alternative to conventional hyperspectral ...
  • Feature selection for classification of music according to expressed emotion 

    Saari, Pasi (2009)
  • Browse materials
  • Browse materials
  • Articles
  • Conferences and seminars
  • Electronic books
  • Historical maps
  • Journals
  • Tunes and musical notes
  • Photographs
  • Presentations and posters
  • Publication series
  • Research reports
  • Research data
  • Study materials
  • Theses

Browse

All of JYXCollection listBy Issue DateAuthorsSubjectsPublished inDepartmentDiscipline

My Account

Login

Statistics

View Usage Statistics
  • How to publish in JYX?
  • Self-archiving
  • Publish Your Thesis Online
  • Publishing Your Dissertation
  • Publication services

Open Science at the JYU
 
Data Protection Description

Accessibility Statement

Unless otherwise specified, publicly available JYX metadata (excluding abstracts) may be freely reused under the CC0 waiver.
Open Science Centre