Oppijankieliaineistojen annotointi – esimerkkinä ICLFI:n annotoinnin prosessit, ongelmat ja ratkaisut
Jantunen, J. H., Brunni, S., Lehto, L.-M., & Airaksinen, V. (2014). Oppijankieliaineistojen annotointi – esimerkkinä ICLFI:n annotoinnin prosessit, ongelmat ja ratkaisut. AFinLA-e : soveltavan kielitieteen tutkimuksia, 2014(7), 60-80. http://ojs.tsv.fi/index.php/afinla/article/view/48160/13961
Published in
AFinLA-e : soveltavan kielitieteen tutkimuksiaDate
2014This article illustrates the grammatical and error annotation of learner language with the help of
the International Corpus of Learner Finnish (ICLFI). In particular, we will focus on issues arising from
handling with at least semi-automatic methods a morphologically rich language. What makes
this corpus special compared to, for example, English-language material, is the frequent variation
in di" erent forms and related errors, both due to the rich morphology of the target language.
This article begins with a description of the design and implementation process of both the
grammatical and error annotation, followed by a brief introduction to the material for which the
annotations were designed. Finally, we outline some of the problems that have arisen during the
annotation process and their solutions.
Publisher
Suomen Soveltavan Kielitieteen Yhdistys AFinLA ryISSN Search the Publication Forum
1798-7822
Original source
http://ojs.tsv.fi/index.php/afinla/article/view/48160/13961Publication in research information system
https://converis.jyu.fi/converis/portal/detail/Publication/23967061
Metadata
Show full item recordCollections
Related items
Showing items with similar title or keywords.
-
Creating Corpora of Finland’s Sign Languages
Salonen, Juhana; Takkinen, Ritva; Puupponen, Anna; Nieminen, Henri; Pippuri, Outi (European Language Resources Association (ELRA), 2016)This paper discusses the process of creating corpora of the sign languages used in Finland, Finnish Sign Language (FinSL) and Finland-Swedish Sign Language (FinSSL). It describes the process of getting informants and data, ... -
The Corpus of Advanced Learner Finnish (LAS2): Database and toolkit to study academic learner Finnish
Ivaska, Ilmari (Centre for Applied Language Studies, University of Jyväskylä, 2014)This paper introduces the Corpus of Advanced Learner Finnish (LAS2), one of the existing corpora of learner Finnish. The corpus was started at the University of Turku in 2007, and the initial motivation for its collection ... -
Establishing a Standardised Procedure for Building Learner Corpora
Glaznieks, Aivars; Nicolas, Lionel; Stemle, Egon; Abel, Andrea; Lyding, Verena (Centre for Applied Language Studies, University of Jyväskylä, 2014)Decisions at the outset of preparing a learner corpus are of crucial importance for how the corpus can be built and how it can be analysed later on. This paper presents a generic workflow to build learner corpora while ... -
Using Automatic Morphological Tools to Process Data from a Learner Corpus of Hungarian
Durst, Péter; Szabó, Martina Katalin; Vincze, Veronica; Zsibrita, János (Centre for Applied Language Studies, University of Jyväskylä, 2014)The aim of this article is to show how automatic morphological tools originally used to analyze native speaker data can be applied to process data from a learner corpus of Hungarian. We collected written data from 35 ... -
Corpora, phraseology and dictionaries : How does corpus research intersect language teaching and learning?
Jantunen, Jarmo Harri (Uusfilologinen Yhdistys, 2016)This article discusses the role of corpus data in language learning and teaching as well as the benefits of using authentic language data in learner dictionary writing. It has been argued that acquiring and teaching ...