Extracting locations from sport and exercise-related social media messages using a neural network-based bilingual toponym recognition model

Abstract
Sport and exercise contribute to health and well-being in cities. While previous research has mainly focused on activities at specific locations such as sport facilities, "informal sport" that occur at arbitrary locations across the city have been largely neglected. Such activities are more challenging to observe, but this challenge may be addressed using data collected from social media platforms, because social media users regularly generate content related to sports and exercise at given locations. This allows studying all sport, including those "informal sport" which are at arbitrary locations, to better understand sports and exercise-related activities in cities. However, user-generated geographical information available on social media platforms is becoming scarcer and coarser. This places increased emphasis on extracting location information from free-form text content on social media, which is complicated by multilingualism and informal language. To support this effort, this article presents an end-to-end deep learning-based bilingual toponym recognition model for extracting location information from social media content related to sports and exercise. We show that our approach outperforms five state-of-the-art deep learning and machine learning models. We further demonstrate how our model can be deployed in a geoparsing framework to support city planners in promoting healthy and active lifestyles.
Main Authors
Format
Articles Research article
Published
2022
Series
Subjects
Publication in research information system
Publisher
National Center for Geographic Information and Analysis
Original source
http://204.48.17.207/index.php/josis/article/view/167
The permanent address of the publication
https://urn.fi/URN:NBN:fi:jyu-202208154079Käytä tätä linkitykseen.
Review status
Peer reviewed
ISSN
1948-660X
DOI
https://doi.org/10.5311/JOSIS.2022.24.167
Language
English
Published in
Journal of Spatial Information Science
Citation
  • Liu, P., Koivisto, S., Hiippala, T., van der Lijn, C., Väisänen, T., Nurmi, M., Toivonen, T., Vehkakoski, K., Pyykönen, J., Virmasalo, I., Simula, M., Hasanen, E., Salmikangas, A.-K., & Muukkonen, P. (2022). Extracting locations from sport and exercise-related social media messages using a neural network-based bilingual toponym recognition model. Journal of Spatial Information Science, (24), 31-61. https://doi.org/10.5311/JOSIS.2022.24.167
License
CC BY 4.0Open Access
Funder(s)
Ministry of the Environment
Funding program(s)
Others
Muut
Additional information about funding
This study is a part of the “Equality in suburban physical activity environments, YLLI” re-search project (in Finnish: Yhdenvertainen liikunnallinen lähiö, YLLI). The project is beingfinanced by the research program about suburban in Finland “Lähiöohjelma 2020-2022”coordinated by the Ministry of Environment (grant recipient: Dr. Petteri Muukkonen).
Copyright© 2022 Pengyuan Liu, Sonja Koivisto, Tuomo Hiippala, Charlotte van der Lijn, Tuomas Vaisanen, Marisofia Nurmi, Tuuli Toivonen, Kirsi Vehkakoski, Janne Pyykonen, Ilkka Virmasalo, Mikko Simula, Elina Hasanen, Anna-Katriina Salmikangas, Petteri Muukkonen

Share