
Prospects for Dutch emotion detection : insights from the new EmotioNL dataset
- Author
- Luna De Bruyne (UGent) , Orphée De Clercq (UGent) and Veronique Hoste (UGent)
- Organization
- Project
- Abstract
- Although emotion detection has become a crucial research direction in NLP, the main focus is on English resources and data. The main obstacles for more specialized emotion detection are the lack of annotated data in smaller languages and the limited emotion taxonomy. In a first step towards improving emotion detection for Dutch, we present EmotioNL, an emotion dataset consisting of 1,000 Dutch tweets and 1,000 captions from TV-shows, annotated with emotion categories (anger, fear, joy, love, sadness and neutral) and dimensions (valence, arousal and dominance). We evaluate the state-of-the-art Dutch transformer models BERTje and RobBERT on this new dataset, investigate model generalizability across domains and perform a thorough error analysis based on the Component Process Model of emotions.
- Keywords
- lt3
Downloads
-
14 Prospects DeBruyne.pdf
- full text (Published version)
- |
- open access
- |
- |
- 2.61 MB
Citation
Please use this url to cite or link to this publication: http://hdl.handle.net/1854/LU-8743302
- MLA
- De Bruyne, Luna, et al. “Prospects for Dutch Emotion Detection : Insights from the New EmotioNL Dataset.” COMPUTATIONAL LINGUISTICS IN THE NETHERLANDS JOURNAL, vol. 11, 2021, pp. 231–55.
- APA
- De Bruyne, L., De Clercq, O., & Hoste, V. (2021). Prospects for Dutch emotion detection : insights from the new EmotioNL dataset. COMPUTATIONAL LINGUISTICS IN THE NETHERLANDS JOURNAL, 11, 231–255.
- Chicago author-date
- De Bruyne, Luna, Orphée De Clercq, and Veronique Hoste. 2021. “Prospects for Dutch Emotion Detection : Insights from the New EmotioNL Dataset.” COMPUTATIONAL LINGUISTICS IN THE NETHERLANDS JOURNAL 11: 231–55.
- Chicago author-date (all authors)
- De Bruyne, Luna, Orphée De Clercq, and Veronique Hoste. 2021. “Prospects for Dutch Emotion Detection : Insights from the New EmotioNL Dataset.” COMPUTATIONAL LINGUISTICS IN THE NETHERLANDS JOURNAL 11: 231–255.
- Vancouver
- 1.De Bruyne L, De Clercq O, Hoste V. Prospects for Dutch emotion detection : insights from the new EmotioNL dataset. COMPUTATIONAL LINGUISTICS IN THE NETHERLANDS JOURNAL. 2021;11:231–55.
- IEEE
- [1]L. De Bruyne, O. De Clercq, and V. Hoste, “Prospects for Dutch emotion detection : insights from the new EmotioNL dataset,” COMPUTATIONAL LINGUISTICS IN THE NETHERLANDS JOURNAL, vol. 11, pp. 231–255, 2021.
@article{8743302, abstract = {{Although emotion detection has become a crucial research direction in NLP, the main focus is on English resources and data. The main obstacles for more specialized emotion detection are the lack of annotated data in smaller languages and the limited emotion taxonomy. In a first step towards improving emotion detection for Dutch, we present EmotioNL, an emotion dataset consisting of 1,000 Dutch tweets and 1,000 captions from TV-shows, annotated with emotion categories (anger, fear, joy, love, sadness and neutral) and dimensions (valence, arousal and dominance). We evaluate the state-of-the-art Dutch transformer models BERTje and RobBERT on this new dataset, investigate model generalizability across domains and perform a thorough error analysis based on the Component Process Model of emotions.}}, author = {{De Bruyne, Luna and De Clercq, Orphée and Hoste, Veronique}}, issn = {{2211-4009}}, journal = {{COMPUTATIONAL LINGUISTICS IN THE NETHERLANDS JOURNAL}}, keywords = {{lt3}}, language = {{eng}}, pages = {{231--255}}, title = {{Prospects for Dutch emotion detection : insights from the new EmotioNL dataset}}, url = {{https://www.clinjournal.org/clinj/article/view/138}}, volume = {{11}}, year = {{2021}}, }