Biased GraphWalks for RDF Graph Embeddings

Abstract
Knowledge Graphs have been recognized as a valuable source for background information in many data mining, information retrieval, natural language processing, and knowledge extraction tasks. However, obtaining a suitable feature vector representation from RDF graphs is a challenging task. In this paper, we extend the RDF2Vec approach, which leverages language modeling techniques for unsupervised feature extraction from sequences of entities. We generate sequences by exploiting local information from graph substructures, harvested by graph walks, and learn latent numerical representations of entities in RDF graphs. We extend the way we compute feature vector representations by comparing twelve di erent edge weighting functions for performing biased walks on the RDF graph, in order to generate higher quality graph embeddings. We evaluate our approach using di erent machine learning, as well as entity and document modeling benchmark data sets, and show that the naive RDF2Vec approach can be improved by exploiting Biased Graph Walks.
Main Authors
Format
Conferences Conference paper
Published
2017
Subjects
Publication in research information system
Publisher
ACM
The permanent address of the publication
https://urn.fi/URN:NBN:fi:jyu-201712114609Käytä tätä linkitykseen.
Parent publication ISBN
978-1-4503-5225-3
Review status
Peer reviewed
DOI
https://doi.org/10.1145/3102254.3102279
Conference
International Conference on Web Intelligence, Mining and Semantics
Language
English
Is part of publication
WIMS '17 : Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics
Citation
  • Cochez, M., Ristoski, P., Ponzetto, S. P., & Paulheim, H. (2017). Biased GraphWalks for RDF Graph Embeddings. In WIMS '17 : Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics (Article 21). ACM. https://doi.org/10.1145/3102254.3102279
License
Open Access
Copyright© 2017 Copyright held by the owner/author(s). Publication rights licensed to ACM.

Share