Näytä suppeat kuvailutiedot

dc.contributor.authorRobertson, Frankie
dc.contributor.authorChang, Li-Hsin
dc.contributor.authorSöyrinki, Sini
dc.contributor.editorCalzolari, Nicoletta
dc.contributor.editorBéchet, Frédéric
dc.contributor.editorBlache, Philippe
dc.contributor.editorChoukri, Khalid
dc.contributor.editorCieri, Christopher
dc.contributor.editorDeclerck, Thierry
dc.contributor.editorGoggi, Sara
dc.contributor.editorIsahara, Hitoshi
dc.contributor.editorMaegaard, Bente
dc.contributor.editorMariani, Joseph
dc.contributor.editorMazo, Hélène
dc.contributor.editorOdijk, Jan
dc.contributor.editorPiperidis, Stelios
dc.date.accessioned2023-01-03T06:09:16Z
dc.date.available2023-01-03T06:09:16Z
dc.date.issued2022
dc.identifier.citationRobertson, F., Chang, L.-H., & Söyrinki, S. (2022). TallVocabL2Fi : A Tall Dataset of 15 Finnish L2 Learners’ Vocabulary. In N. Calzolari, F. Béchet, P. Blache, K. Choukri, C. Cieri, T. Declerck, S. Goggi, H. Isahara, B. Maegaard, J. Mariani, H. Mazo, J. Odijk, & S. Piperidis (Eds.), <i>LREC 2022 : Proceedings of the 13th Conference on Language Resources and Evaluation</i>. European Language Resources Association. LREC proceedings. <a href="https://aclanthology.org/2022.lrec-1.685/" target="_blank">https://aclanthology.org/2022.lrec-1.685/</a>
dc.identifier.otherCONVID_164825969
dc.identifier.urihttps://jyx.jyu.fi/handle/123456789/84663
dc.description.abstractPrevious work concerning measurement of second language learners has tended to focus on the knowledge of small numbers of words, often geared towards measuring vocabulary size. This paper presents a “tall” dataset containing information about a few learners’ knowledge of many words, suitable for evaluating Vocabulary Inventory Prediction (VIP) techniques, including those based on Computerised Adaptive Testing (CAT). In comparison to previous comparable datasets, the learners are from varied backgrounds, so as to reduce the risk of overfitting when used for machine learning based VIP. The dataset contains both a self-rating test and a translation test, used to derive a measure of reliability for learner responses. The dataset creation process is documented, and the relationship between variables concerning the participants, such as their completion time, their language ability level, and the triangulated reliability of their self-assessment responses, are analysed. The word list is constructed by taking into account the extensive derivation morphology of Finnish, and infrequent words are included in order to account for explanatory variables beyond word frequencyen
dc.format.mimetypeapplication/pdf
dc.language.isoeng
dc.publisherEuropean Language Resources Association
dc.relation.ispartofLREC 2022 : Proceedings of the 13th Conference on Language Resources and Evaluation
dc.relation.ispartofseriesLREC proceedings
dc.relation.urihttps://aclanthology.org/2022.lrec-1.685/
dc.rightsCC BY-NC 4.0
dc.subject.otherword knowledge
dc.subject.otherword response data
dc.subject.othermental lexicon
dc.subject.otherFinnish
dc.subject.otherlearner data
dc.titleTallVocabL2Fi : A Tall Dataset of 15 Finnish L2 Learners’ Vocabulary
dc.typeconferenceObject
dc.identifier.urnURN:NBN:fi:jyu-202301031023
dc.contributor.laitosKoulutuksen tutkimuslaitosfi
dc.contributor.laitosInformaatioteknologian tiedekuntafi
dc.contributor.laitosFinnish Institute for Educational Researchen
dc.contributor.laitosFaculty of Information Technologyen
dc.contributor.oppiaineCollective Intelligencefi
dc.contributor.oppiaineKoulutusteknologia ja kognitiotiedefi
dc.contributor.oppiaineCollective Intelligenceen
dc.contributor.oppiaineLearning and Cognitive Sciencesen
dc.type.urihttp://purl.org/eprint/type/ConferencePaper
dc.relation.isbn979-10-95546-72-6
dc.type.coarhttp://purl.org/coar/resource_type/c_5794
dc.description.reviewstatuspeerReviewed
dc.relation.issn2522-2686
dc.type.versionpublishedVersion
dc.rights.copyright© European Language Resources Association (ELRA)
dc.rights.accesslevelopenAccessfi
dc.relation.conferenceInternational Conference on Language Resources and Evaluation
dc.subject.ysotoinen kieli
dc.subject.ysooppiminen
dc.subject.ysosanavarasto
dc.subject.ysomittausmenetelmät
dc.subject.ysomittaus
dc.subject.ysodata
dc.subject.ysosanat
dc.subject.ysoarviointi
dc.subject.ysokoneoppiminen
dc.subject.ysokielen oppiminen
dc.format.contentfulltext
jyx.subject.urihttp://www.yso.fi/onto/yso/p17005
jyx.subject.urihttp://www.yso.fi/onto/yso/p2945
jyx.subject.urihttp://www.yso.fi/onto/yso/p21233
jyx.subject.urihttp://www.yso.fi/onto/yso/p20083
jyx.subject.urihttp://www.yso.fi/onto/yso/p4794
jyx.subject.urihttp://www.yso.fi/onto/yso/p27250
jyx.subject.urihttp://www.yso.fi/onto/yso/p3291
jyx.subject.urihttp://www.yso.fi/onto/yso/p7413
jyx.subject.urihttp://www.yso.fi/onto/yso/p21846
jyx.subject.urihttp://www.yso.fi/onto/yso/p24061
dc.rights.urlhttps://creativecommons.org/licenses/by-nc/4.0/
dc.type.okmA4


Aineistoon kuuluvat tiedostot

Thumbnail

Aineisto kuuluu seuraaviin kokoelmiin

Näytä suppeat kuvailutiedot

CC BY-NC 4.0
Ellei muuten mainita, aineiston lisenssi on CC BY-NC 4.0