Advanced search
1 file | 2.61 MB Add to list

Prospects for Dutch emotion detection : insights from the new EmotioNL dataset

Luna De Bruyne (UGent) , Orphée De Clercq (UGent) and Veronique Hoste (UGent)
Author
Organization
Project
Abstract
Although emotion detection has become a crucial research direction in NLP, the main focus is on English resources and data. The main obstacles for more specialized emotion detection are the lack of annotated data in smaller languages and the limited emotion taxonomy. In a first step towards improving emotion detection for Dutch, we present EmotioNL, an emotion dataset consisting of 1,000 Dutch tweets and 1,000 captions from TV-shows, annotated with emotion categories (anger, fear, joy, love, sadness and neutral) and dimensions (valence, arousal and dominance). We evaluate the state-of-the-art Dutch transformer models BERTje and RobBERT on this new dataset, investigate model generalizability across domains and perform a thorough error analysis based on the Component Process Model of emotions.
Keywords
lt3

Downloads

  • 14 Prospects DeBruyne.pdf
    • full text (Published version)
    • |
    • open access
    • |
    • PDF
    • |
    • 2.61 MB

Citation

Please use this url to cite or link to this publication:

MLA
De Bruyne, Luna, et al. “Prospects for Dutch Emotion Detection : Insights from the New EmotioNL Dataset.” COMPUTATIONAL LINGUISTICS IN THE NETHERLANDS JOURNAL, vol. 11, 2021, pp. 231–55.
APA
De Bruyne, L., De Clercq, O., & Hoste, V. (2021). Prospects for Dutch emotion detection : insights from the new EmotioNL dataset. COMPUTATIONAL LINGUISTICS IN THE NETHERLANDS JOURNAL, 11, 231–255.
Chicago author-date
De Bruyne, Luna, Orphée De Clercq, and Veronique Hoste. 2021. “Prospects for Dutch Emotion Detection : Insights from the New EmotioNL Dataset.” COMPUTATIONAL LINGUISTICS IN THE NETHERLANDS JOURNAL 11: 231–55.
Chicago author-date (all authors)
De Bruyne, Luna, Orphée De Clercq, and Veronique Hoste. 2021. “Prospects for Dutch Emotion Detection : Insights from the New EmotioNL Dataset.” COMPUTATIONAL LINGUISTICS IN THE NETHERLANDS JOURNAL 11: 231–255.
Vancouver
1.
De Bruyne L, De Clercq O, Hoste V. Prospects for Dutch emotion detection : insights from the new EmotioNL dataset. COMPUTATIONAL LINGUISTICS IN THE NETHERLANDS JOURNAL. 2021;11:231–55.
IEEE
[1]
L. De Bruyne, O. De Clercq, and V. Hoste, “Prospects for Dutch emotion detection : insights from the new EmotioNL dataset,” COMPUTATIONAL LINGUISTICS IN THE NETHERLANDS JOURNAL, vol. 11, pp. 231–255, 2021.
@article{8743302,
  abstract     = {{Although emotion detection has become a crucial research direction in NLP, the main focus is on English resources and data. The main obstacles for more specialized emotion detection are the lack of annotated data in smaller languages and the limited emotion taxonomy. In a first step towards improving emotion detection for Dutch, we present EmotioNL, an emotion dataset consisting of 1,000 Dutch tweets and 1,000 captions from TV-shows, annotated with emotion categories (anger, fear, joy, love, sadness and neutral) and dimensions (valence, arousal and dominance). We evaluate the state-of-the-art Dutch transformer models BERTje and RobBERT on this new dataset, investigate model generalizability across domains and perform a thorough error analysis based on the Component Process Model of emotions.}},
  author       = {{De Bruyne, Luna and De Clercq, Orphée and Hoste, Veronique}},
  issn         = {{2211-4009}},
  journal      = {{COMPUTATIONAL LINGUISTICS IN THE NETHERLANDS JOURNAL}},
  keywords     = {{lt3}},
  language     = {{eng}},
  pages        = {{231--255}},
  title        = {{Prospects for Dutch emotion detection : insights from the new EmotioNL dataset}},
  url          = {{https://www.clinjournal.org/clinj/article/view/138}},
  volume       = {{11}},
  year         = {{2021}},
}