Investigating cross-document event coreference for Dutch
- Author
- Loic De Langhe (UGent) , Orphée De Clercq (UGent) and Veronique Hoste (UGent)
- Organization
- Abstract
- In this paper we present baseline results for Event Coreference Resolution (ECR) in Dutch using gold-standard (i.e non-predicted) event mentions. A newly developed benchmark dataset allows us to properly investigate the possibility of creating ECR systems for both within and cross-document coreference. We give an overview of the state of the art for ECR in other languages, as well as a detailed overview of existing ECR resources. Afterwards, we provide a comparative report on our own dataset. We apply a significant number of approaches that have been shown to attain good results for English ECR including feature-based models, monolingual transformer language models and multilingual language models. The best results were obtained using the monolingual BERTje model. Finally, results for all models are thoroughly analysed and visualised, as to provide insight into the inner workings of ECR and long-distance semantic NLP tasks in general.
Downloads
-
2022.crac-1.9.pdf
- full text (Published version)
- |
- open access
- |
- |
- 839.08 KB
Citation
Please use this url to cite or link to this publication: http://hdl.handle.net/1854/LU-01GPE00TMXQC2VHY0QYQ4ZWJNR
- MLA
- De Langhe, Loic, et al. “Investigating Cross-Document Event Coreference for Dutch.” Proceedings of the Fifth Workshop on Computational Models of Reference, Anaphora and Coreference (CRAC 2022), Association for Computational Linguistics, 2022, pp. 88–98.
- APA
- De Langhe, L., De Clercq, O., & Hoste, V. (2022). Investigating cross-document event coreference for Dutch. Proceedings of the Fifth Workshop on Computational Models of Reference, Anaphora and Coreference (CRAC 2022), 88–98. Gyeongju, Republic of Korea: Association for Computational Linguistics.
- Chicago author-date
- De Langhe, Loic, Orphée De Clercq, and Veronique Hoste. 2022. “Investigating Cross-Document Event Coreference for Dutch.” In Proceedings of the Fifth Workshop on Computational Models of Reference, Anaphora and Coreference (CRAC 2022), 88–98. Gyeongju, Republic of Korea: Association for Computational Linguistics.
- Chicago author-date (all authors)
- De Langhe, Loic, Orphée De Clercq, and Veronique Hoste. 2022. “Investigating Cross-Document Event Coreference for Dutch.” In Proceedings of the Fifth Workshop on Computational Models of Reference, Anaphora and Coreference (CRAC 2022), 88–98. Gyeongju, Republic of Korea: Association for Computational Linguistics.
- Vancouver
- 1.De Langhe L, De Clercq O, Hoste V. Investigating cross-document event coreference for Dutch. In: Proceedings of the Fifth Workshop on Computational Models of Reference, Anaphora and Coreference (CRAC 2022). Gyeongju, Republic of Korea: Association for Computational Linguistics; 2022. p. 88–98.
- IEEE
- [1]L. De Langhe, O. De Clercq, and V. Hoste, “Investigating cross-document event coreference for Dutch,” in Proceedings of the Fifth Workshop on Computational Models of Reference, Anaphora and Coreference (CRAC 2022), Gyeongju, Republic of Korea, 2022, pp. 88–98.
@inproceedings{01GPE00TMXQC2VHY0QYQ4ZWJNR, abstract = {{In this paper we present baseline results for Event Coreference Resolution (ECR) in Dutch using gold-standard (i.e non-predicted) event mentions. A newly developed benchmark dataset allows us to properly investigate the possibility of creating ECR systems for both within and cross-document coreference. We give an overview of the state of the art for ECR in other languages, as well as a detailed overview of existing ECR resources. Afterwards, we provide a comparative report on our own dataset. We apply a significant number of approaches that have been shown to attain good results for English ECR including feature-based models, monolingual transformer language models and multilingual language models. The best results were obtained using the monolingual BERTje model. Finally, results for all models are thoroughly analysed and visualised, as to provide insight into the inner workings of ECR and long-distance semantic NLP tasks in general.}}, author = {{De Langhe, Loic and De Clercq, Orphée and Hoste, Veronique}}, booktitle = {{Proceedings of the Fifth Workshop on Computational Models of Reference, Anaphora and Coreference (CRAC 2022)}}, issn = {{2951-2093}}, language = {{eng}}, location = {{Gyeongju, Republic of Korea}}, pages = {{88--98}}, publisher = {{Association for Computational Linguistics}}, title = {{Investigating cross-document event coreference for Dutch}}, url = {{https://aclanthology.org/2022.crac-1.9.pdf}}, year = {{2022}}, }