Advanced search
1 file | 1.57 MB Add to list

Automatic detection of cyberbullying in social media text

(2018) PLOS ONE. 13(10).
Author
Organization
Abstract
While social media offer great communication opportunities, they also increase the vulnerability of young people to threatening situations online. Recent studies report that cyberbullying constitutes a growing problem among youngsters. Successful prevention depends on the adequate detection of potentially harmful messages and the information overload on the Web requires intelligent systems to identify potential risks automatically. The focus of this paper is on automatic cyberbullying detection in social media text by modelling posts written by bullies, victims, and bystanders of online bullying. We describe the collection and fine-grained annotation of a cyberbullying corpus for English and Dutch and perform a series of binary classification experiments to determine the feasibility of automatic cyberbullying detection. We make use of linear support vector machines exploiting a rich feature set and investigate which information sources contribute the most for the task. Experiments on a hold-out test set reveal promising results for the detection of cyberbullying-related posts. After optimisation of the hyperparameters, the classifier yields an F1 score of 64% and 61% for English and Dutch respectively, and considerably outperforms baseline systems.
Keywords
cyberbullying detection, machine learning, social media, text mining, lt3

Downloads

  • journal.pone.0203794.pdf
    • full text
    • |
    • open access
    • |
    • PDF
    • |
    • 1.57 MB

Citation

Please use this url to cite or link to this publication:

MLA
Van Hee, Cynthia, et al. “Automatic Detection of Cyberbullying in Social Media Text.” PLOS ONE, edited by Hussein Suleman, vol. 13, no. 10, Public Library of Science, 2018, doi:10.1371/journal.pone.0203794.
APA
Van Hee, C., Jacobs, G., Emmery, C., Desmet, B., Lefever, E., Verhoeven, B., … Hoste, V. (2018). Automatic detection of cyberbullying in social media text. PLOS ONE, 13(10). https://doi.org/10.1371/journal.pone.0203794
Chicago author-date
Van Hee, Cynthia, Gilles Jacobs, Chris Emmery, Bart Desmet, Els Lefever, Ben Verhoeven, Guy De Pauw, Walter Daelemans, and Veronique Hoste. 2018. “Automatic Detection of Cyberbullying in Social Media Text.” Edited by Hussein Suleman. PLOS ONE 13 (10). https://doi.org/10.1371/journal.pone.0203794.
Chicago author-date (all authors)
Van Hee, Cynthia, Gilles Jacobs, Chris Emmery, Bart Desmet, Els Lefever, Ben Verhoeven, Guy De Pauw, Walter Daelemans, and Veronique Hoste. 2018. “Automatic Detection of Cyberbullying in Social Media Text.” Ed by. Hussein Suleman. PLOS ONE 13 (10). doi:10.1371/journal.pone.0203794.
Vancouver
1.
Van Hee C, Jacobs G, Emmery C, Desmet B, Lefever E, Verhoeven B, et al. Automatic detection of cyberbullying in social media text. Suleman H, editor. PLOS ONE. 2018;13(10).
IEEE
[1]
C. Van Hee et al., “Automatic detection of cyberbullying in social media text,” PLOS ONE, vol. 13, no. 10, 2018.
@article{8573574,
  abstract     = {{While social media offer great communication opportunities, they also increase the vulnerability of young people to threatening situations online. Recent studies report that cyberbullying constitutes a growing problem among youngsters. Successful prevention depends on the adequate detection of potentially harmful messages and the information overload on the Web requires intelligent systems to identify potential risks automatically. The focus of this paper is on automatic cyberbullying detection in social media text by modelling posts written by bullies, victims, and bystanders of online bullying. We describe the collection and fine-grained annotation of a cyberbullying corpus for English and Dutch and perform a series of binary classification experiments to determine the feasibility of automatic cyberbullying detection. We make use of linear support vector machines exploiting a rich feature set and investigate which information sources contribute the most for the task. Experiments on a hold-out test set reveal promising results for the detection of cyberbullying-related posts. After optimisation of the hyperparameters, the classifier yields an F1 score of 64% and 61% for English and Dutch respectively, and considerably outperforms baseline systems.}},
  articleno    = {{e0203794}},
  author       = {{Van Hee, Cynthia and Jacobs, Gilles and Emmery, Chris and Desmet, Bart and Lefever, Els and Verhoeven, Ben and De Pauw, Guy and Daelemans, Walter and Hoste, Veronique}},
  editor       = {{Suleman, Hussein}},
  issn         = {{1932-6203}},
  journal      = {{PLOS ONE}},
  keywords     = {{cyberbullying detection,machine learning,social media,text mining,lt3}},
  language     = {{eng}},
  number       = {{10}},
  pages        = {{22}},
  publisher    = {{Public Library of Science}},
  title        = {{Automatic detection of cyberbullying in social media text}},
  url          = {{http://doi.org/10.1371/journal.pone.0203794}},
  volume       = {{13}},
  year         = {{2018}},
}

Altmetric
View in Altmetric
Web of Science
Times cited: