Advanced search
1 file | 377.55 KB Add to list

BESOCIAL : a sustainable knowledge graph-based workflow for social media archiving

Author
Organization
Abstract
Social media as infrastructure for public discourse provide valuable information that needs to be preserved. Several tools for social media harvesting exist, but still only fragmented workflows may be formed with different combinations of such tools. On top of that, social media data but also preservation-related metadata standards are heterogeneous, resulting in a costly manual process. In the framework of BESOCIAL at the Royal Library of Belgium (KBR), we develop a sustainable social media archiving workflow that integrates heterogeneous data sources in a Europeana and PREMIS-based data model to describe data preserved by open source tools. This allows data stewardship on a uniform representation and we generate metadata records automatically via queries. In this paper, we present a comparison of social media harvesting tools and our Knowledge Graph-based solution which reuses off-the-shelf open source tools to harvest social media and automatically generate preservation-related metadata records. We validate our solution by generating Encoded Archival Description (EAD) and bibliographic MARC records for preservation of harvested social media collections from Twitter collected at KBR. Other archiving institutions can build upon our solution and customize it to their own social media archiving policies.

Downloads

  • DS453.pdf
    • full text (Published version)
    • |
    • open access
    • |
    • PDF
    • |
    • 377.55 KB

Citation

Please use this url to cite or link to this publication:

MLA
Lieber, Sven, et al. “BESOCIAL : A Sustainable Knowledge Graph-Based Workflow for Social Media Archiving.” Further with Knowledge Graphs : Proceedings of the 17th International Conference on Semantic Systems, edited by Mehwish Alam et al., vol. 53, IOS, 2021, pp. 198–212, doi:10.3233/ssw210045.
APA
Lieber, S., Van Assche, D., Chambers, S., Messens, F., Geeraert, F., Birkholz, J. M., & Dimou, A. (2021). BESOCIAL : a sustainable knowledge graph-based workflow for social media archiving. In M. Alam, P. Groth, V. de Boer, T. Pellegrini, H. J. Pandit, E. Montiel, … A. Meroño-Peñuela (Eds.), Further with knowledge graphs : proceedings of the 17th international Conference on Semantic Systems (Vol. 53, pp. 198–212). Amsterdam, the Netherlands: IOS. https://doi.org/10.3233/ssw210045
Chicago author-date
Lieber, Sven, Dylan Van Assche, Sally Chambers, Fien Messens, Friedel Geeraert, Julie M. Birkholz, and Anastasia Dimou. 2021. “BESOCIAL : A Sustainable Knowledge Graph-Based Workflow for Social Media Archiving.” In Further with Knowledge Graphs : Proceedings of the 17th International Conference on Semantic Systems, edited by Mehwish Alam, Paul Groth, Victor de Boer, Tassilo Pellegrini, Harshvardhan J. Pandit, Elena Montiel, Víctor Rodríguez Doncel, Barbara McGillivray, and Albert Meroño-Peñuela, 53:198–212. IOS. https://doi.org/10.3233/ssw210045.
Chicago author-date (all authors)
Lieber, Sven, Dylan Van Assche, Sally Chambers, Fien Messens, Friedel Geeraert, Julie M. Birkholz, and Anastasia Dimou. 2021. “BESOCIAL : A Sustainable Knowledge Graph-Based Workflow for Social Media Archiving.” In Further with Knowledge Graphs : Proceedings of the 17th International Conference on Semantic Systems, ed by. Mehwish Alam, Paul Groth, Victor de Boer, Tassilo Pellegrini, Harshvardhan J. Pandit, Elena Montiel, Víctor Rodríguez Doncel, Barbara McGillivray, and Albert Meroño-Peñuela, 53:198–212. IOS. doi:10.3233/ssw210045.
Vancouver
1.
Lieber S, Van Assche D, Chambers S, Messens F, Geeraert F, Birkholz JM, et al. BESOCIAL : a sustainable knowledge graph-based workflow for social media archiving. In: Alam M, Groth P, de Boer V, Pellegrini T, Pandit HJ, Montiel E, et al., editors. Further with knowledge graphs : proceedings of the 17th international Conference on Semantic Systems. IOS; 2021. p. 198–212.
IEEE
[1]
S. Lieber et al., “BESOCIAL : a sustainable knowledge graph-based workflow for social media archiving,” in Further with knowledge graphs : proceedings of the 17th international Conference on Semantic Systems, Amsterdam, the Netherlands, 2021, vol. 53, pp. 198–212.
@inproceedings{8720643,
  abstract     = {{Social media as infrastructure for public discourse provide valuable information that needs to be preserved. Several tools for social media harvesting exist, but still only fragmented workflows may be formed with different combinations of such tools. On top of that, social media data but also preservation-related metadata standards are heterogeneous, resulting in a costly manual process. In the framework of BESOCIAL at the Royal Library of Belgium (KBR), we develop a sustainable social media archiving workflow that integrates heterogeneous data sources in a Europeana and PREMIS-based data model to describe data preserved by open source tools. This allows data stewardship on a uniform representation and we generate metadata records automatically via queries. In this paper, we present a comparison of social media harvesting tools and our Knowledge Graph-based solution which reuses off-the-shelf open source tools to harvest social media and automatically generate preservation-related metadata records. We validate our solution by generating Encoded Archival Description (EAD) and bibliographic MARC records for preservation of harvested social media collections from Twitter collected at KBR. Other archiving institutions can build upon our solution and customize it to their own social media archiving policies.}},
  author       = {{Lieber, Sven and Van Assche, Dylan and Chambers, Sally and Messens, Fien and Geeraert, Friedel and Birkholz, Julie M. and Dimou, Anastasia}},
  booktitle    = {{Further with knowledge graphs : proceedings of the 17th international Conference on Semantic Systems}},
  editor       = {{Alam, Mehwish and Groth, Paul and de Boer, Victor and Pellegrini, Tassilo and Pandit, Harshvardhan J. and Montiel, Elena and Doncel, Víctor Rodríguez and McGillivray, Barbara and Meroño-Peñuela, Albert}},
  isbn         = {{9781643682006}},
  issn         = {{1868-1158}},
  language     = {{eng}},
  location     = {{Amsterdam, the Netherlands}},
  pages        = {{198--212}},
  publisher    = {{IOS}},
  title        = {{BESOCIAL : a sustainable knowledge graph-based workflow for social media archiving}},
  url          = {{http://dx.doi.org/10.3233/ssw210045}},
  volume       = {{53}},
  year         = {{2021}},
}

Altmetric
View in Altmetric