Advanced search
2 files | 787.34 KB Add to list

Predicting suicide risk from online postings in Reddit : the UGent-IDLab submission to the CLPysch 2019 Shared Task A

Author
Organization
Abstract
This paper describes IDLab’s text classification systems submitted to Task A as part of the CLPsych 2019 shared task. The aim of this shared task was to develop automated systems that predict the degree of suicide risk of people based on their posts on Reddit. Bag-of-words features, emotion features and post level predictions are used to derive user-level predictions. Linear models and ensembles of these models are used to predict final scores. We find that predicting fine-grained risk levels is much more difficult than flagging potentially at-risk users. Furthermore, we do not find clear added value from building richer ensembles compared to simple baselines, given the available training data and the nature of the prediction task.

Downloads

  • 7493.pdf
    • full text (Published version)
    • |
    • open access
    • |
    • PDF
    • |
    • 454.01 KB
  • 7493 i.pdf
    • full text
    • |
    • open access
    • |
    • PDF
    • |
    • 333.32 KB

Citation

Please use this url to cite or link to this publication:

MLA
Bitew, Semere Kiros, et al. “Predicting Suicide Risk from Online Postings in Reddit : The UGent-IDLab Submission to the CLPysch 2019 Shared Task A.” Computationallinguistics and Clinical Psychology : From Keyboard to Clinic : Proceedings of the Sixth Workshop on Computational Linguistics and Clinical Psychology, Association for Computational Linguistics (ACL), 2019, pp. 158–61, doi:10.18653/v1/W19-3019.
APA
Bitew, S. K., Bekoulis, I., Deleu, J., Sterckx, L., Zaporojets, K., Demeester, T., & Develder, C. (2019). Predicting suicide risk from online postings in Reddit : the UGent-IDLab submission to the CLPysch 2019 Shared Task A. Computationallinguistics and Clinical Psychology : From Keyboard to Clinic : Proceedings of the Sixth Workshop on Computational Linguistics and Clinical Psychology, 158–161. https://doi.org/10.18653/v1/W19-3019
Chicago author-date
Bitew, Semere Kiros, Ioannis Bekoulis, Johannes Deleu, Lucas Sterckx, Klim Zaporojets, Thomas Demeester, and Chris Develder. 2019. “Predicting Suicide Risk from Online Postings in Reddit : The UGent-IDLab Submission to the CLPysch 2019 Shared Task A.” In Computationallinguistics and Clinical Psychology : From Keyboard to Clinic : Proceedings of the Sixth Workshop on Computational Linguistics and Clinical Psychology, 158–61. Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/W19-3019.
Chicago author-date (all authors)
Bitew, Semere Kiros, Ioannis Bekoulis, Johannes Deleu, Lucas Sterckx, Klim Zaporojets, Thomas Demeester, and Chris Develder. 2019. “Predicting Suicide Risk from Online Postings in Reddit : The UGent-IDLab Submission to the CLPysch 2019 Shared Task A.” In Computationallinguistics and Clinical Psychology : From Keyboard to Clinic : Proceedings of the Sixth Workshop on Computational Linguistics and Clinical Psychology, 158–161. Association for Computational Linguistics (ACL). doi:10.18653/v1/W19-3019.
Vancouver
1.
Bitew SK, Bekoulis I, Deleu J, Sterckx L, Zaporojets K, Demeester T, et al. Predicting suicide risk from online postings in Reddit : the UGent-IDLab submission to the CLPysch 2019 Shared Task A. In: Computationallinguistics and clinical psychology : from keyboard to clinic : Proceedings of the Sixth Workshop on Computational Linguistics and Clinical Psychology. Association for Computational Linguistics (ACL); 2019. p. 158–61.
IEEE
[1]
S. K. Bitew et al., “Predicting suicide risk from online postings in Reddit : the UGent-IDLab submission to the CLPysch 2019 Shared Task A,” in Computationallinguistics and clinical psychology : from keyboard to clinic : Proceedings of the Sixth Workshop on Computational Linguistics and Clinical Psychology, Minneapolis, USA, 2019, pp. 158–161.
@inproceedings{8621471,
  abstract     = {{This paper describes IDLab’s text classification systems submitted to Task A as part of the CLPsych 2019 shared task. The aim of this shared task was to develop automated systems that predict the degree of suicide risk of people based on their posts on Reddit. Bag-of-words features, emotion features and post level predictions are used to derive user-level predictions. Linear models and ensembles of these models are used to predict final scores. We find that predicting fine-grained risk levels is much more difficult than flagging potentially at-risk users. Furthermore, we do not find clear added value from building richer ensembles compared to simple baselines, given the available training data and the nature of the prediction task.}},
  author       = {{Bitew, Semere Kiros and Bekoulis, Ioannis and Deleu, Johannes and Sterckx, Lucas and Zaporojets, Klim and Demeester, Thomas and Develder, Chris}},
  booktitle    = {{Computationallinguistics and clinical psychology : from keyboard to clinic : Proceedings of the Sixth Workshop on Computational Linguistics and Clinical Psychology}},
  isbn         = {{9781948087957}},
  language     = {{eng}},
  location     = {{Minneapolis, USA}},
  pages        = {{158--161}},
  publisher    = {{Association for Computational Linguistics (ACL)}},
  title        = {{Predicting suicide risk from online postings in Reddit : the UGent-IDLab submission to the CLPysch 2019 Shared Task A}},
  url          = {{http://dx.doi.org/10.18653/v1/W19-3019}},
  year         = {{2019}},
}

Altmetric
View in Altmetric