Advanced search
2 files | 2.00 MB Add to list

Using an existing website as a queryable low-cost LOD publishing interface

Brecht Van de Vyvere (UGent) , Ruben Taelman (UGent) , Pieter Colpaert (UGent) and Ruben Verborgh (UGent)
Author
Organization
Abstract
Maintaining an Open Dataset comes at an extra recurring cost when it is published in a dedicated Web interface. As there is not often a direct financial return from publishing a dataset publicly, these extra costs need to be minimized. Therefore we want to explore reusing existing infrastructure by enriching existing websites with Linked Data. In this demonstrator, we advised the data owner to annotate a digital heritage website with JSON-LD snippets, resulting in a dataset of more than three million triples that is now available and officially maintained. The website itself is paged, and thus hydra partial collection view controls were added in the snippets. We then extended the modular query engine Comunica to support following page controls and extracting data from HTML documents while querying. This way, a SPARQL or GraphQL query over multiple heterogeneous data sources can power automated data reuse. While the query performance on such an interface is visibly poor, it becomes easy to create composite data dumps. As a result of implementing these building blocks in Comunica, any paged collection and enriched HTML page now becomes queryable by the query engine. This enables heterogenous data interfaces to share functionality and become technically interoperable.
Keywords
JSON-LD data snippets, Hypermedia web APIs, Intelligent agents, Digital humanities, Linked Open Data, Semantic web

Downloads

  • DS262 i.pdf
    • colophon/title page
    • |
    • open access
    • |
    • PDF
    • |
    • 60.70 KB
  • (...).pdf
    • full text (Published version)
    • |
    • UGent only
    • |
    • PDF
    • |
    • 1.94 MB

Citation

Please use this url to cite or link to this publication:

MLA
Van de Vyvere, Brecht, et al. “Using an Existing Website as a Queryable Low-Cost LOD Publishing Interface.” SEMANTIC WEB: ESWC 2019 SATELLITE EVENTS, edited by Pascal Hitzler et al., vol. 11762, Springer, 2019, pp. 176–80, doi:10.1007/978-3-030-32327-1_35.
APA
Van de Vyvere, B., Taelman, R., Colpaert, P., & Verborgh, R. (2019). Using an existing website as a queryable low-cost LOD publishing interface. In P. Hitzler, S. Kirrane, O. Hartig, V. de Boer, M.-E. Vidal, M. Maleshkova, … R. Verborgh (Eds.), SEMANTIC WEB: ESWC 2019 SATELLITE EVENTS (Vol. 11762, pp. 176–180). Portorož, Slovenia: Springer. https://doi.org/10.1007/978-3-030-32327-1_35
Chicago author-date
Van de Vyvere, Brecht, Ruben Taelman, Pieter Colpaert, and Ruben Verborgh. 2019. “Using an Existing Website as a Queryable Low-Cost LOD Publishing Interface.” In SEMANTIC WEB: ESWC 2019 SATELLITE EVENTS, edited by Pascal Hitzler, Sabrina Kirrane, Olaf Hartig, Victor de Boer, Maria-Esther Vidal, Maria Maleshkova, Stefan Schlobach, et al., 11762:176–80. Springer. https://doi.org/10.1007/978-3-030-32327-1_35.
Chicago author-date (all authors)
Van de Vyvere, Brecht, Ruben Taelman, Pieter Colpaert, and Ruben Verborgh. 2019. “Using an Existing Website as a Queryable Low-Cost LOD Publishing Interface.” In SEMANTIC WEB: ESWC 2019 SATELLITE EVENTS, ed by. Pascal Hitzler, Sabrina Kirrane, Olaf Hartig, Victor de Boer, Maria-Esther Vidal, Maria Maleshkova, Stefan Schlobach, Karl Hammar, Nelia Lasierra, Steffen Stadtmüller, Katja Hose, and Ruben Verborgh, 11762:176–180. Springer. doi:10.1007/978-3-030-32327-1_35.
Vancouver
1.
Van de Vyvere B, Taelman R, Colpaert P, Verborgh R. Using an existing website as a queryable low-cost LOD publishing interface. In: Hitzler P, Kirrane S, Hartig O, de Boer V, Vidal M-E, Maleshkova M, et al., editors. SEMANTIC WEB: ESWC 2019 SATELLITE EVENTS. Springer; 2019. p. 176–80.
IEEE
[1]
B. Van de Vyvere, R. Taelman, P. Colpaert, and R. Verborgh, “Using an existing website as a queryable low-cost LOD publishing interface,” in SEMANTIC WEB: ESWC 2019 SATELLITE EVENTS, Portorož, Slovenia, 2019, vol. 11762, pp. 176–180.
@inproceedings{8629598,
  abstract     = {Maintaining an Open Dataset comes at an extra recurring cost when it is published in a dedicated Web interface. As there is not often a direct financial return from publishing a dataset publicly, these extra costs need to be minimized. Therefore we want to explore reusing existing infrastructure by enriching existing websites with Linked Data. In this demonstrator, we advised the data owner to annotate a digital heritage website with JSON-LD snippets, resulting in a dataset of more than three million triples that is now available and officially maintained. The website itself is paged, and thus hydra partial collection view controls were added in the snippets. We then extended the modular query engine Comunica to support following page controls and extracting data from HTML documents while querying. This way, a SPARQL or GraphQL query over multiple heterogeneous data sources can power automated data reuse. While the query performance on such an interface is visibly poor, it becomes easy to create composite data dumps. As a result of implementing these building blocks in Comunica, any paged collection and enriched HTML page now becomes queryable by the query engine. This enables heterogenous data interfaces to share functionality and become technically interoperable.},
  author       = {Van de Vyvere, Brecht and Taelman, Ruben and Colpaert, Pieter and Verborgh, Ruben},
  booktitle    = {SEMANTIC WEB: ESWC 2019 SATELLITE EVENTS},
  editor       = {Hitzler, Pascal and Kirrane, Sabrina and Hartig, Olaf and de Boer, Victor and Vidal, Maria-Esther and Maleshkova, Maria and Schlobach, Stefan and Hammar, Karl and Lasierra, Nelia and Stadtmüller, Steffen and Hose, Katja and Verborgh, Ruben},
  isbn         = {9783030323264},
  issn         = {0302-9743},
  keywords     = {JSON-LD data snippets,Hypermedia web APIs,Intelligent agents,Digital humanities,Linked Open Data,Semantic web},
  language     = {eng},
  location     = {Portorož, Slovenia},
  pages        = {176--180},
  publisher    = {Springer},
  title        = {Using an existing website as a queryable low-cost LOD publishing interface},
  url          = {http://dx.doi.org/10.1007/978-3-030-32327-1_35},
  volume       = {11762},
  year         = {2019},
}

Altmetric
View in Altmetric
Web of Science
Times cited: