Dynamic integration of classifiers for handling concept drift
Tsymbal, A., Pechenizkiy, M., Cunningham, P., & Puuronen, S. (2008). Dynamic integration of classifiers for handling concept drift. Information fusion, 9(1), 56-68. https://doi.org/10.1016/j.inffus.2006.11.002
Julkaistu sarjassa
Information fusionPäivämäärä
2008Tekijänoikeudet
© 2006 Elsevier B.V. All rights reserved.
In the real world concepts are often not stable but change with time. A typical example of this in the biomedical context is antibiotic resistance, where pathogen sensitivity may change over time as new pathogen strains develop resistance to antibiotics that were previously effective. This problem, known as concept drift, complicates the task of learning a model from data and requires special approaches, different from commonly used techniques that treat arriving instances as equally important contributors to the final concept. The underlying data distribution may change as well, making previously built models useless. This is known as virtual concept drift. Both types of concept drifts make regular updates of the model necessary. Among the most popular and effective approaches to handle concept drift is ensemble learning, where a set of models built over different time periods is maintained and the best model is selected or the predictions of models are combined, usually according to their expertise level regarding the current concept. In this paper we propose the use of an ensemble integration technique that would help to better handle concept drift at an instance level. In dynamic integration of classifiers, each base classifier is given a weight proportional to its local accuracy with regard to the instance tested, and the best base classifier is selected, or the classifiers are integrated using weighted voting. Our experiments with synthetic data sets simulating abrupt and gradual concept drifts and with a real-world antibiotic resistance data set demonstrate that dynamic integration of classifiers built over small time intervals or fixed-sized data blocks can be significantly better than majority voting and weighted voting, which are currently the most commonly used integration techniques for handling concept drift with ensembles.
...
Julkaisija
ElsevierISSN Hae Julkaisufoorumista
1566-2535Asiasanat
Julkaisu tutkimustietojärjestelmässä
https://converis.jyu.fi/converis/portal/detail/Publication/16379860
Metadata
Näytä kaikki kuvailutiedotKokoelmat
Lisätietoja rahoituksesta
This material is based upon work supported by the Science Foundation Ireland under Grant No. S.F.I.-02/N.1/111. This research was also partly supported by the Academy of Finland.Lisenssi
Samankaltainen aineisto
Näytetään aineistoja, joilla on samankaltainen nimeke tai asiasanat.
-
Predicting Children's Myopia Risk : A Monte Carlo Approach to Compare the Performance of Machine Learning Models
Artiemjew, Piotr; Cybulski, Radosław; Emamian, Mohammad; Grzybowski, Andrzej; Jankowski, Andrzej; Lanca, Carla; Mehravaran, Shiva; Młyński, Marcin; Morawski, Cezary; Nordhausen, Klaus; Pärssinen, Olavi; Ropiak, Krzysztof (SCITEPRESS Science and Technology Publications, 2024)This study presents the initial results of the Myopia Risk Calculator (MRC) Consortium, introducing an innovative approach to predict myopia risk by using trustworthy machine-learning models. The dataset included approximately ... -
Effect of variable selection strategy on the predictive models for adverse pregnancy outcomes of pre-eclampsia : A retrospective study
Zheng, Dongying; Hao, Xinyu; Khan, Muhanmmad; Kang, Fuli; Li, Fan; Hämäläinen, Timo; Wang, Lixia (Scholar Media Publishing Company, 2024)Objectives: The improvement of prediction for adverse pregnancy outcomes is quite essential to the women suffering from pre-eclampsia, while the collection of predictive indicators is the prerequisite. The traditional ... -
Artificial Intelligence and Computational Science
Neittaanmäki, Pekka; Repin, Sergey (Springer, 2022)In this note, we discuss the interaction between two ways of scientific analysis. The first (classical) way is known as Mathematical Modeling (MM). It is based on a model created by humans and presented in mathematical ... -
Generation of Error Indicators for Partial Differential Equations by Machine Learning Methods
Muzalevskiy, Alexey; Neittaanmäki, Pekka; Repin, Sergey (Springer, 2022)Computer simulation methods for models based on partial differential equations usually apply adaptive strategies that generate sequences of approximations for consequently refined meshes. In this process, error indicators ... -
The focus and timing of gaze matters : Investigating collaborative knowledge construction in a simulation-based environment by combined video and eye tracking
Lämsä, Joni; Kotkajuuri, Jimi; Lehtinen, Antti; Koskinen, Pekka; Mäntylä, Terhi; Kilpeläinen, Jasmin; Hämäläinen, Raija (Frontiers Media SA, 2022)Although eye tracking has been successfully used in science education research, exploiting its potential in collaborative knowledge construction has remained sporadic. This article presents a novel approach for studying ...
Ellei toisin mainittu, julkisesti saatavilla olevia JYX-metatietoja (poislukien tiivistelmät) saa vapaasti uudelleenkäyttää CC0-lisenssillä.