Advanced search
1 file | 823.75 KB Add to list

SMHD : a large-scale resource for exploring online language usage for multiple mental health conditions

Author
Organization
Abstract
Mental health is a significant and growing public health concern. As language usage can be leveraged to obtain crucial insights into mental health conditions, there is a need for large-scale, labeled, mental health-related datasets of users who have been diagnosed with one or more of such conditions. In this paper, we investigate the creation of high-precision patterns to identify self-reported diagnoses of nine different mental health conditions, and obtain high-quality labeled data without the need for manual labelling. We introduce the SMHD (Self-reported Mental Health Diagnoses) dataset and make it available. SMHD is a novel large dataset of social media posts from users with one or multiple mental health conditions along with matched control users. We examine distinctions in users’ language, as measured by linguistic and psychological variables. We further explore text classification methods to identify individuals with mental conditions through their language.
Keywords
LT3

Downloads

  • SMHD-COLING18.pdf
    • full text
    • |
    • open access
    • |
    • PDF
    • |
    • 823.75 KB

Citation

Please use this url to cite or link to this publication:

MLA
Cohan, Arman, et al. “SMHD : A Large-Scale Resource for Exploring Online Language Usage for Multiple Mental Health Conditions.” Proceedings of the 27th International Conference on Computational Linguistics (COLING 2018), Association for Computational Linguistics (ACL), 2018, pp. 1485–97.
APA
Cohan, A., Desmet, B., Yates, A., Soldaini, L., MacAvaney, S., & Goharian, N. (2018). SMHD : a large-scale resource for exploring online language usage for multiple mental health conditions. Proceedings of the 27th International Conference on Computational Linguistics (COLING 2018), 1485–1497. Santa Fe, USA: Association for Computational Linguistics (ACL).
Chicago author-date
Cohan, Arman, Bart Desmet, Andrew Yates, Luca Soldaini, Sean MacAvaney, and Nazli Goharian. 2018. “SMHD : A Large-Scale Resource for Exploring Online Language Usage for Multiple Mental Health Conditions.” In Proceedings of the 27th International Conference on Computational Linguistics (COLING 2018), 1485–97. Santa Fe, USA: Association for Computational Linguistics (ACL).
Chicago author-date (all authors)
Cohan, Arman, Bart Desmet, Andrew Yates, Luca Soldaini, Sean MacAvaney, and Nazli Goharian. 2018. “SMHD : A Large-Scale Resource for Exploring Online Language Usage for Multiple Mental Health Conditions.” In Proceedings of the 27th International Conference on Computational Linguistics (COLING 2018), 1485–1497. Santa Fe, USA: Association for Computational Linguistics (ACL).
Vancouver
1.
Cohan A, Desmet B, Yates A, Soldaini L, MacAvaney S, Goharian N. SMHD : a large-scale resource for exploring online language usage for multiple mental health conditions. In: Proceedings of the 27th International Conference on Computational Linguistics (COLING 2018). Santa Fe, USA: Association for Computational Linguistics (ACL); 2018. p. 1485–97.
IEEE
[1]
A. Cohan, B. Desmet, A. Yates, L. Soldaini, S. MacAvaney, and N. Goharian, “SMHD : a large-scale resource for exploring online language usage for multiple mental health conditions,” in Proceedings of the 27th International Conference on Computational Linguistics (COLING 2018), Santa Fe, USA, 2018, pp. 1485–1497.
@inproceedings{8573298,
  abstract     = {{Mental health is a significant and growing public health concern. As language usage can be leveraged to obtain crucial insights into mental health conditions, there is a need for large-scale, labeled, mental health-related datasets of users who have been diagnosed with one or more of such conditions. In this paper, we investigate the creation of high-precision patterns to identify self-reported diagnoses of nine different mental health conditions, and obtain high-quality labeled
data without the need for manual labelling. We introduce the SMHD (Self-reported Mental Health Diagnoses) dataset and make it available. SMHD is a novel large dataset of social media posts from users with one or multiple mental health conditions along with matched control users. We examine distinctions in users’ language, as measured by linguistic and psychological variables. We further explore text classification methods to identify individuals with mental conditions
through their language.}},
  articleno    = {{C18-1126}},
  author       = {{Cohan, Arman and Desmet, Bart and Yates, Andrew and Soldaini, Luca and MacAvaney, Sean and Goharian, Nazli}},
  booktitle    = {{Proceedings of the 27th International Conference on Computational Linguistics (COLING 2018)}},
  isbn         = {{9781948087506}},
  keywords     = {{LT3}},
  language     = {{eng}},
  location     = {{Santa Fe, USA}},
  pages        = {{C18-1126:1485--C18-1126:1497}},
  publisher    = {{Association for Computational Linguistics (ACL)}},
  title        = {{SMHD : a large-scale resource for exploring online language usage for multiple mental health conditions}},
  url          = {{https://aclanthology.info/events/coling-2018#C18-1}},
  year         = {{2018}},
}