Näytä suppeat kuvailutiedot

dc.contributor.authorKurimo, Mikko
dc.contributor.authorGetman, Yaroslav
dc.contributor.authorVoskoboinik, Ekaterina
dc.contributor.authorAl-Ghezi, Ragheb
dc.contributor.authorKallio, Heini
dc.contributor.authorKuronen, Mikko
dc.contributor.authorvon Zansen, Anna
dc.contributor.authorHilden, Raili
dc.contributor.authorKronholm, Sirkku
dc.contributor.authorHuhta, Ari
dc.contributor.authorLinden, Krister
dc.date.accessioned2023-09-01T10:57:05Z
dc.date.available2023-09-01T10:57:05Z
dc.date.issued2023
dc.identifier.citationKurimo, M., Getman, Y., Voskoboinik, E., Al-Ghezi, R., Kallio, H., Kuronen, M., von Zansen, A., Hilden, R., Kronholm, S., Huhta, A., & Linden, K. (2023). New data, benchmark and baseline for L2 speaking assessment for low-resoure languages. In <i>Proceedings of the 9th Workshop on Speech and Language Technology in Education (SLaTE) </i> (pp. 166-170). International Speech Communication Association. <a href="https://doi.org/10.21437/SLaTE.2023-32" target="_blank">https://doi.org/10.21437/SLaTE.2023-32</a>
dc.identifier.otherCONVID_184436350
dc.identifier.urihttps://jyx.jyu.fi/handle/123456789/88853
dc.description.abstractThe development of large multilingual speech models provides the possibility to construct high-quality speech technology even for low-resource languages. In this paper, we present the speech data of L2 learners of Finnish and Finland Swedish that we have recently collected for training and evaluation of automatic speech recognition (ASR) and speaking assessment (ASA). It includes over 4000 recordings by over 300 students per language in short read-aloud and free-form tasks. The recordings have been manually transcribed and assessed for pronunciation, fluency, range, accuracy, task achievement, and a holistic proficiency level. We present also an ASR and ASA benchmarking setup we have constructed using this data and include results from our baseline systems built by fine-tuning self-supervised multilingual model for the target language. In addition to benchmarking, our baseline system can be used by L2 students and teachers for online self-training and evaluation of oral proficiency.en
dc.format.extent186
dc.format.mimetypeapplication/pdf
dc.language.isoeng
dc.publisherInternational Speech Communication Association
dc.relation.ispartofProceedings of the 9th Workshop on Speech and Language Technology in Education (SLaTE)
dc.rightsIn Copyright
dc.subject.otherpuhemallit
dc.subject.otherASR
dc.subject.otherL2 speaking assessment
dc.subject.otherwav2vec2.0
dc.subject.otherlow-resource languages
dc.titleNew data, benchmark and baseline for L2 speaking assessment for low-resoure languages
dc.typeconferenceObject
dc.identifier.urnURN:NBN:fi:jyu-202309014887
dc.contributor.laitosSoveltavan kielentutkimuksen keskusfi
dc.contributor.laitosKieli- ja viestintätieteiden laitosfi
dc.contributor.laitosCentre for Applied Language Studiesen
dc.contributor.laitosDepartment of Language and Communication Studiesen
dc.contributor.oppiaineSoveltava kielentutkimusfi
dc.contributor.oppiaineRuotsin kielifi
dc.contributor.oppiaineHyvinvoinnin tutkimuksen yhteisöfi
dc.contributor.oppiaineSuomen kielifi
dc.contributor.oppiaineApplied language studiesen
dc.contributor.oppiaineSwedishen
dc.contributor.oppiaineSchool of Wellbeingen
dc.contributor.oppiaineFinnishen
dc.type.urihttp://purl.org/eprint/type/ConferencePaper
dc.type.coarhttp://purl.org/coar/resource_type/c_5794
dc.description.reviewstatusnonPeerReviewed
dc.format.pagerange166-170
dc.type.versionpublishedVersion
dc.rights.copyright© 2023 International Speech Communication Association
dc.rights.accesslevelopenAccessfi
dc.relation.conferenceWorkshop on Speech and Language Technology in Education
dc.relation.grantnumber322965
dc.subject.ysosuomi toisena kielenä
dc.subject.ysosuomenruotsi
dc.subject.ysoarviointi
dc.subject.ysopuheentunnistus
dc.subject.ysotoinen kieli
dc.subject.ysoruotsi toisena kielenä
dc.subject.ysomonikielisyys
dc.subject.ysokielen oppiminen
dc.subject.ysosuullinen kielitaito
dc.subject.ysopuhe (puhuminen)
dc.format.contentfulltext
jyx.subject.urihttp://www.yso.fi/onto/yso/p24613
jyx.subject.urihttp://www.yso.fi/onto/yso/p12864
jyx.subject.urihttp://www.yso.fi/onto/yso/p7413
jyx.subject.urihttp://www.yso.fi/onto/yso/p8264
jyx.subject.urihttp://www.yso.fi/onto/yso/p17005
jyx.subject.urihttp://www.yso.fi/onto/yso/p24614
jyx.subject.urihttp://www.yso.fi/onto/yso/p6720
jyx.subject.urihttp://www.yso.fi/onto/yso/p24061
jyx.subject.urihttp://www.yso.fi/onto/yso/p17782
jyx.subject.urihttp://www.yso.fi/onto/yso/p2492
dc.rights.urlhttp://rightsstatements.org/page/InC/1.0/?language=en
dc.relation.doi10.21437/SLaTE.2023-32
dc.relation.funderResearch Council of Finlanden
dc.relation.funderSuomen Akatemiafi
jyx.fundingprogramAcademy Project, AoFen
jyx.fundingprogramAkatemiahanke, SAfi
jyx.fundinginformationThis work was done and the data were collected as part of the Academy of Finland grants number 322619, 322625, 322965 and 337073.
dc.type.okmD3


Aineistoon kuuluvat tiedostot

Thumbnail

Aineisto kuuluu seuraaviin kokoelmiin

Näytä suppeat kuvailutiedot

In Copyright
Ellei muuten mainita, aineiston lisenssi on In Copyright