Advanced search
2 files | 2.00 MB Add to list

Using an existing website as a queryable low-cost LOD publishing interface

Brecht Van de Vyvere (UGent) , Ruben Taelman (UGent) , Pieter Colpaert (UGent) and Ruben Verborgh (UGent)
Author
Organization
Abstract
Maintaining an Open Dataset comes at an extra recurring cost when it is published in a dedicated Web interface. As there is not often a direct financial return from publishing a dataset publicly, these extra costs need to be minimized. Therefore we want to explore reusing existing infrastructure by enriching existing websites with Linked Data. In this demonstrator, we advised the data owner to annotate a digital heritage website with JSON-LD snippets, resulting in a dataset of more than three million triples that is now available and officially maintained. The website itself is paged, and thus hydra partial collection view controls were added in the snippets. We then extended the modular query engine Comunica to support following page controls and extracting data from HTML documents while querying. This way, a SPARQL or GraphQL query over multiple heterogeneous data sources can power automated data reuse. While the query performance on such an interface is visibly poor, it becomes easy to create composite data dumps. As a result of implementing these building blocks in Comunica, any paged collection and enriched HTML page now becomes queryable by the query engine. This enables heterogenous data interfaces to share functionality and become technically interoperable.
Keywords
JSON-LD data snippets, Hypermedia web APIs, Intelligent agents, Digital humanities, Linked Open Data, Semantic web

Downloads

  • DS262 i.pdf
    • colophon/title page
    • |
    • open access
    • |
    • PDF
    • |
    • 60.70 KB
  • (...).pdf
    • full text (Published version)
    • |
    • UGent only
    • |
    • PDF
    • |
    • 1.94 MB

Citation

Please use this url to cite or link to this publication:

MLA
Van de Vyvere, Brecht, et al. “Using an Existing Website as a Queryable Low-Cost LOD Publishing Interface.” The Semantic Web : ESWC 2019 Satellite Events, vol. 11762, Springer, 2019, pp. 176–80.
APA
Van de Vyvere, B., Taelman, R., Colpaert, P., & Verborgh, R. (2019). Using an existing website as a queryable low-cost LOD publishing interface. In The semantic web : ESWC 2019 satellite events (Vol. 11762, pp. 176–180). Portoroz, Slovenia: Springer.
Chicago author-date
Van de Vyvere, Brecht, Ruben Taelman, Pieter Colpaert, and Ruben Verborgh. 2019. “Using an Existing Website as a Queryable Low-Cost LOD Publishing Interface.” In The Semantic Web : ESWC 2019 Satellite Events, 11762:176–80. Springer.
Chicago author-date (all authors)
Van de Vyvere, Brecht, Ruben Taelman, Pieter Colpaert, and Ruben Verborgh. 2019. “Using an Existing Website as a Queryable Low-Cost LOD Publishing Interface.” In The Semantic Web : ESWC 2019 Satellite Events, 11762:176–180. Springer.
Vancouver
1.
Van de Vyvere B, Taelman R, Colpaert P, Verborgh R. Using an existing website as a queryable low-cost LOD publishing interface. In: The semantic web : ESWC 2019 satellite events. Springer; 2019. p. 176–80.
IEEE
[1]
B. Van de Vyvere, R. Taelman, P. Colpaert, and R. Verborgh, “Using an existing website as a queryable low-cost LOD publishing interface,” in The semantic web : ESWC 2019 satellite events, Portoroz, Slovenia, 2019, vol. 11762, pp. 176–180.
@inproceedings{8629598,
  abstract     = {Maintaining an Open Dataset comes at an extra recurring cost when it is published in a dedicated Web interface. As there is not often a direct financial return from publishing a dataset publicly, these extra costs need to be minimized. Therefore we want to explore reusing existing infrastructure by enriching existing websites with Linked Data. In this demonstrator, we advised the data owner to annotate a digital heritage website with JSON-LD snippets, resulting in a dataset of more than three million triples that is now available and officially maintained. The website itself is paged, and thus hydra partial collection view controls were added in the snippets. We then extended the modular query engine Comunica to support following page controls and extracting data from HTML documents while querying. This way, a SPARQL or GraphQL query over multiple heterogeneous data sources can power automated data reuse. While the query performance on such an interface is visibly poor, it becomes easy to create composite data dumps. As a result of implementing these building blocks in Comunica, any paged collection and enriched HTML page now becomes queryable by the query engine. This enables heterogenous data interfaces to share functionality and become technically interoperable.},
  author       = {Van de Vyvere, Brecht and Taelman, Ruben and Colpaert, Pieter and Verborgh, Ruben},
  booktitle    = {The semantic web : ESWC 2019 satellite events},
  isbn         = {9783030323264},
  issn         = {0302-9743},
  keywords     = {JSON-LD data snippets,Hypermedia web APIs,Intelligent agents,Digital humanities,Linked Open Data,Semantic web},
  language     = {eng},
  location     = {Portoroz, Slovenia},
  pages        = {176--180},
  publisher    = {Springer},
  title        = {Using an existing website as a queryable low-cost LOD publishing interface},
  url          = {http://dx.doi.org/10.1007/978-3-030-32327-1_35},
  volume       = {11762},
  year         = {2019},
}

Altmetric
View in Altmetric