Graph-based exploration and clustering analysis of semantic spaces
Veremyev, A., Semenov, A., Pasiliao, E. L., & Boginski, V. (2019). Graph-based exploration and clustering analysis of semantic spaces. Applied Network Science, 4, Article 104. https://doi.org/10.1007/s41109-019-0228-y
Published inApplied Network Science
© The Authors, 2019
The goal of this study is to demonstrate how network science and graph theory tools and concepts can be effectively used for exploring and comparing semantic spaces of word embeddings and lexical databases. Specifically, we construct semantic networks based on word2vec representation of words, which is “learnt” from large text corpora (Google news, Amazon reviews), and “human built” word networks derived from the well-known lexical databases: WordNet and Moby Thesaurus. We compare “global” (e.g., degrees, distances, clustering coefficients) and “local” (e.g., most central nodes and community-type dense clusters) characteristics of considered networks. Our observations suggest that human built networks possess more intuitive global connectivity patterns, whereas local characteristics (in particular, dense clusters) of the machine built networks provide much richer information on the contextual usage and perceived meanings of words, which reveals interesting structural differences between human built and machine built semantic networks. To our knowledge, this is the first study that uses graph theory and network science in the considered context; therefore, we also provide interesting examples and discuss potential research directions that may motivate further research on the synthesis of lexicographic and machine learning based tools and lead to new insights in this area. ...
Publication in research information system
MetadataShow full item record
Additional information about fundingThe work of V. Boginski and A. Veremyev was supported in part by the U.S. Air Force Research Laboratory (AFRL) award FA8651-16-2-0009. The work of A. Semenov was supported in part by the U.S. Air Force Research Laboratory (AFRL) European Office of Aerospace Research and Development under Grant FA9550-17-1-0030.
Showing items with similar title or keywords.
Chen, Jiawen (2015)Nowadays, quite many different media channels are used popoluarly. For instance, phone call, text message, email, website, and various mobile applications. Web technique plays a significant role in today’s society, no ...
Suopellonmäki, Pekka (2017)Sovelluskehys käyttöliittymän personointiin käyttäen semanttista käyttäjäprofiilia. Internetin kehittyessä maailma verkostoituu yhä enemmän. Käytämme päivittäin monia laitteita ja erilaisia käyttöliittymiä, mutta vaikka ...
Nguyen Kim, Chinh (2018)The coming of the Big Data era has posed great challenges to the traditional de- cision support systems, which are unable to effectively leverage unstructured data, necessi- tating more flexible and adaptable approaches. ...
Zafar, Uzair Ahmed (2017)Any physical area (like schools, home, hospitals etc.) that uses either mobile devices, sensors, embedded systems or computers to gather information from the users and the environment and eventually, adapt according to the ...
Semenov, Alexander; Mantzaris, Alexander V.; Nikolaev, Alexander; Veremyev, Alexander; Veijalainen, Jari; Pasiliao, Eduardo L.; Boginski, Vladimir (Institute of Electrical and Electronics Engineers, 2019)The “post-Soviet space" consists of countries with a substantial fraction of the world’s population; however, unlike many other regions, its social media network landscape is still somewhat underexplored. This study aims ...