Advanced search
2 files | 2.35 MB Add to list

Benchmarking whole knowledge graph embedding techniques

Author
Organization
Project
Abstract
Knowledge Graphs (KGs) are gaining popularity and are being widely used in a plethora of applications. They owe their popularity to the fact that KGs are an ideal form to integrate and retrieve data originating from various sources. Using KGs as input for Machine Learning (ML) tasks allows to perform predictions on these popular graph structures. However, KGs cannot directly be used as ML input in their graph representation, they first require to be transformed to a vector space representation through an embedding technique. As ML techniques are data-driven, they can generalize over unseen input data that deviates to some extent from the data they were trained upon. To fully exploit the generalization capabilities of ML algorithms when using embedded KGs as input, small changes in the KGs should also result in small changes in the embedding space. Various embedding techniques for graphs in general exist, however, they have not been tailored towards embedding whole KGs, while KGs can be considered a special kind of graph that adheres to a certain KG schema. This paper evaluates if these existing embedding techniques that embed the whole graphs can represent the similarity between KGs in their embedding space, allowing ML algorithms to generalize over their input. We compare the similarities between KGs in terms of changes in size, entity labels, and KG schema. We found that most techniques were able to represent the similarities in terms of size and entity labels in their embedding space, however, none of the techniques were able to capture the similarities in KG schema.
Keywords
Knowledge graphs, embeddings, graph comparison, benchmark

Downloads

  • (...).pdf
    • full text (Published version)
    • |
    • UGent only
    • |
    • PDF
    • |
    • 1.42 MB
  • 8435 acc.pdf
    • full text (Accepted manuscript)
    • |
    • open access
    • |
    • PDF
    • |
    • 933.18 KB

Citation

Please use this url to cite or link to this publication:

MLA
Bonte, Pieter, et al. “Benchmarking Whole Knowledge Graph Embedding Techniques.” INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, vol. 34, no. 1, 2024, pp. 163–84, doi:10.1142/S0218194023500481.
APA
Bonte, P., Vanden Hautte, S., De Turck, F., Van Hoecke, S., & Ongenae, F. (2024). Benchmarking whole knowledge graph embedding techniques. INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 34(1), 163–184. https://doi.org/10.1142/S0218194023500481
Chicago author-date
Bonte, Pieter, Sander Vanden Hautte, Filip De Turck, Sofie Van Hoecke, and Femke Ongenae. 2024. “Benchmarking Whole Knowledge Graph Embedding Techniques.” INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING 34 (1): 163–84. https://doi.org/10.1142/S0218194023500481.
Chicago author-date (all authors)
Bonte, Pieter, Sander Vanden Hautte, Filip De Turck, Sofie Van Hoecke, and Femke Ongenae. 2024. “Benchmarking Whole Knowledge Graph Embedding Techniques.” INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING 34 (1): 163–184. doi:10.1142/S0218194023500481.
Vancouver
1.
Bonte P, Vanden Hautte S, De Turck F, Van Hoecke S, Ongenae F. Benchmarking whole knowledge graph embedding techniques. INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING. 2024;34(1):163–84.
IEEE
[1]
P. Bonte, S. Vanden Hautte, F. De Turck, S. Van Hoecke, and F. Ongenae, “Benchmarking whole knowledge graph embedding techniques,” INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, vol. 34, no. 1, pp. 163–184, 2024.
@article{01HKPXSM40CFWE37E0J8N84PDQ,
  abstract     = {{Knowledge Graphs (KGs) are gaining popularity and are being widely used in a plethora of applications. They owe their popularity to the fact that KGs are an ideal form to integrate and retrieve data originating from various sources. Using KGs as input for Machine Learning (ML) tasks allows to perform predictions on these popular graph structures. However, KGs cannot directly be used as ML input in their graph representation, they first require to be transformed to a vector space representation through an embedding technique. As ML techniques are data-driven, they can generalize over unseen input data that deviates to some extent from the data they were trained upon. To fully exploit the generalization capabilities of ML algorithms when using embedded KGs as input, small changes in the KGs should also result in small changes in the embedding space. Various embedding techniques for graphs in general exist, however, they have not been tailored towards embedding whole KGs, while KGs can be considered a special kind of graph that adheres to a certain KG schema. This paper evaluates if these existing embedding techniques that embed the whole graphs can represent the similarity between KGs in their embedding space, allowing ML algorithms to generalize over their input. We compare the similarities between KGs in terms of changes in size, entity labels, and KG schema. We found that most techniques were able to represent the similarities in terms of size and entity labels in their embedding space, however, none of the techniques were able to capture the similarities in KG schema.}},
  author       = {{Bonte, Pieter and Vanden Hautte, Sander and De Turck, Filip and Van Hoecke, Sofie and Ongenae, Femke}},
  issn         = {{0218-1940}},
  journal      = {{INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING}},
  keywords     = {{Knowledge graphs,embeddings,graph comparison,benchmark}},
  language     = {{eng}},
  number       = {{1}},
  pages        = {{163--184}},
  title        = {{Benchmarking whole knowledge graph embedding techniques}},
  url          = {{http://doi.org/10.1142/S0218194023500481}},
  volume       = {{34}},
  year         = {{2024}},
}

Altmetric
View in Altmetric
Web of Science
Times cited: