University of Jyväskylä | JYX Digital Repository

  • English  | Give feedback |
    • suomi
    • English
 
  • Login
JavaScript is disabled for your browser. Some features of this site may not work without it.
View Item 
  • JYX
  • Lehdet
  • Apples : Journal of Applied Language Studies
  • 2014, Volume 8
  • 2014, Volume 8, Issue 3
  • View Item
JYX > Lehdet > Apples : Journal of Applied Language Studies > 2014, Volume 8 > 2014, Volume 8, Issue 3 > View Item

Establishing a Standardised Procedure for Building Learner Corpora

ThumbnailPublisher's PDF
View/Open
334.8 Kb

Downloads:  
Show download detailsHide download details  
Glaznieks, A., Nicolas, L. Stemle, E. Abel & A. Lyding, V. (2014). Establishing a Standardised Procedure for Building Learner Corpora. Apples: journal of applied language studies, 8 (3), 5-20. Retrieved from http://apples.jyu.fi
Published in
Apples: journal of applied language studies
Authors
Glaznieks, Aivars |
Nicolas, Lionel |
Stemle, Egon |
Abel, Andrea |
Lyding, Verena
Date
2014
Copyright
© The Author(s)

 
Decisions at the outset of preparing a learner corpus are of crucial importance for how the corpus can be built and how it can be analysed later on. This paper presents a generic workflow to build learner corpora while taking into account the needs of the users. The workflow results from an extensive collaboration between linguists that annotate and use the corpus and computer linguists that are responsible for providing technical support. The paper addresses the linguists’ research needs as well as the availability and usability of language technology tools necessary to meet them. We demonstrate and illustrate the relevance of the workflow using results and examples from our L1 learner corpus of German (“KoKo”).
Publisher
Centre for Applied Language Studies, University of Jyväskylä
ISSN Search the Publication Forum
1457-9863
Keywords
L1 learner corpus corpus building workflow German as a first language

Original source
http://apples.jyu.fi

URI

http://urn.fi/URN:NBN:fi:jyu-201501071034

Metadata
Show full item record
Collections
  • 2014, Volume 8, Issue 3 [7]

Related items

Showing items with similar title or keywords.

  • The Corpus of Advanced Learner Finnish (LAS2): Database and toolkit to study academic learner Finnish 

    Ivaska, Ilmari (Centre for Applied Language Studies, University of Jyväskylä, 2014)
    This paper introduces the Corpus of Advanced Learner Finnish (LAS2), one of the existing corpora of learner Finnish. The corpus was started at the University of Turku in 2007, and the initial motivation for its collection ...
  • Creating Corpora of Finland’s Sign Languages 

    Salonen, Juhana; Takkinen, Ritva; Puupponen, Anna; Nieminen, Henri; Pippuri, Outi (European Language Resources Association (ELRA), 2016)
    This paper discusses the process of creating corpora of the sign languages used in Finland, Finnish Sign Language (FinSL) and Finland-Swedish Sign Language (FinSSL). It describes the process of getting informants and data, ...
  • Using Automatic Morphological Tools to Process Data from a Learner Corpus of Hungarian 

    Durst, Péter; Szabó, Martina Katalin; Vincze, Veronica; Zsibrita, János (Centre for Applied Language Studies, University of Jyväskylä, 2014)
    The aim of this article is to show how automatic morphological tools originally used to analyze native speaker data can be applied to process data from a learner corpus of Hungarian. We collected written data from 35 ...
  • The International Comparable Corpus : Challenges in building multilingual spoken and written comparable corpora 

    Čermáková, Ann; Jantunen, Jarmo; Jauhiainen, Tommi; Kirk, John; Křen, Michal; Kupietz, Marc; Uí Dhonnchadha, Elaine (Asociacion Espanola de Linguistica de Corpus, 2021)
    This paper reports on the efforts of twelve national teams in building the International Comparable Corpus (ICC; https://korpus.cz/icc) that will contain highly comparable datasets of spoken, written and electronic registers. ...
  • Compilation of language corpora : computer related issues and annotation 

    Marin, Eeva (1999)
  • Browse materials
  • Browse materials
  • Articles
  • Conferences and seminars
  • Electronic books
  • Historical maps
  • Journals
  • Tunes and musical notes
  • Photographs
  • Presentations and posters
  • Publication series
  • Research reports
  • Research data
  • Study materials
  • Theses

Browse

All of JYXCollection listBy Issue DateAuthorsSubjectsPublished inDepartmentDiscipline

My Account

Login

Statistics

View Usage Statistics
  • How to publish in JYX?
  • Self-archiving
  • Publish Your Thesis Online
  • Publishing Your Dissertation
  • Publication services

Open Science at the JYU
 
Data Protection Description

Accessibility Statement

Unless otherwise specified, publicly available JYX metadata (excluding abstracts) may be freely reused under the CC0 waiver.
Open Science Centre