Analysing Complex Life Sequence Data with Hidden Markov Modelling
Helske, S., Helske, J., & Eerola, M. (2016). Analysing Complex Life Sequence Data with Hidden Markov Modelling. In G. Ritschard, & M. Studer (Eds.), LaCOSA II : Proceedings of the International Conference on Sequence Analysis and Related Methods (pp. 209-240). LIVES - Swiss National Centre of Competence in Research; Swiss National Science Foundation; Université de Genevè. https://lacosa.lives-nccr.ch/sites/lacosa.lives-nccr.ch/files/proc-lacosa2-helskehelskeeerola_paper_24.pdf
Date
2016Copyright
© the Authors & LIVES - Swiss National Centre of Competence in Research, Swiss National Science Foundation, Université de Genevè, 2016.
When analysing complex sequence data with multiple channels (dimensions)
and long observation sequences, describing and visualizing the data can be
a challenge. Hidden Markov models (HMMs) and their mixtures (MHMMs) offer
a probabilistic model-based framework where the information in such data can be
compressed into hidden states (general life stages) and clusters (general patterns in
life courses).
We studied two different approaches to analysing clustered life sequence data
with sequence analysis (SA) and hidden Markov modelling. In the first approach
we used SA clusters as fixed and estimated HMMs separately for each group. In the
second approach we treated SA clusters as suggestive and used them as a starting
point for the estimation of MHMMs.
Even though the MHMM approach has advantages, we found it to be unfeasible
in this type of complex setting. Instead, using separate HMMs for SA clusters was
useful for finding and describing patterns in life courses.
Publisher
LIVES - Swiss National Centre of Competence in Research; Swiss National Science Foundation; Université de GenevèConference
International Conference on Sequence Analysis and Related MethodsIs part of publication
LaCOSA II : Proceedings of the International Conference on Sequence Analysis and Related Methods
Original source
https://lacosa.lives-nccr.ch/sites/lacosa.lives-nccr.ch/files/proc-lacosa2-helskehelskeeerola_paper_24.pdfPublication in research information system
https://converis.jyu.fi/converis/portal/detail/Publication/26144449
Metadata
Show full item recordCollections
Related items
Showing items with similar title or keywords.
-
Statistical analysis of life sequence data
Helske, Satu (University of Jyväskylä, 2016) -
Minimum Description Length Based Hidden Markov Model Clustering for Life Sequence Analysis
Helske, Jouni; Eerola, Mervi; Tabus, Ioan (2010)In this article, a model-based method for clustering life sequences is suggested. In the social sciences, model-free clustering methods are often used in order to find typical life sequences. The suggested method, which ... -
Conditional particle filters with diffuse initial distributions
Karppinen, Santeri; Vihola, Matti (Springer, 2021)Conditional particle filters (CPFs) are powerful smoothing algorithms for general nonlinear/non-Gaussian hidden Markov models. However, CPFs can be inefficient or difficult to apply with diffuse initial distributions, which ... -
Conditional particle filters with bridge backward sampling
Karppinen, Santeri; Singh, Sumeetpal S.; Vihola, Matti (Taylor & Francis, 2024)Conditional particle filters (CPFs) with backward/ancestor sampling are powerful methods for sampling from the posterior distribution of the latent states of a dynamic model such as a hidden Markov model. However, the ... -
Part-of-speech tagging in written slang
Korolainen, Valtteri (2014)Erilaiset kieliteknologiasovellukset ovat olleet jo vuosikymmeniä arkipäiväises-sä käytössä. Esimerkiksi ennustava tekstinsyöttö ja automaattinen korjaus ovat olleet käytössä jo vuosikymmeniä. Puheen tunnistus ja kielen ...