Advanced search
1 file | 1.34 MB Add to list

D-terminer : online demo for monolingual and bilingual automatic term extraction

Author
Organization
Abstract
This contribution presents D-Terminer: an open access, online demo for monolingual and multilingual automatic term extraction from parallel corpora. The monolingual term extraction is based on a recurrent neural network, with a supervised methodology that relies on pretrained embeddings. Candidate terms can be tagged in their original context and there is no need for a large corpus, as the methodology will work even for single sentences. With the bilingual term extraction from parallel corpora, potentially equivalent candidate term pairs are extracted from translation memories and manual annotation of the results shows that good equivalents are found for most candidate terms. Accompanying the release of the demo is an updated version of the ACTER Annotated Corpora for Term Extraction Research (version 1.5).
Keywords
automatic term extraction, multilingual term extraction, terminology

Downloads

  • LREC DTerminer published.pdf
    • full text (Published version)
    • |
    • open access
    • |
    • PDF
    • |
    • 1.34 MB

Citation

Please use this url to cite or link to this publication:

MLA
Rigouts Terryn, Ayla, et al. “D-Terminer : Online Demo for Monolingual and Bilingual Automatic Term Extraction.” Proceedings of the Workshop on Terminology in the 21st Century : Many Faces, Many Places, edited by Rute Costa et al., European Language Resources Association (ELRA), 2022, pp. 33–40.
APA
Rigouts Terryn, A., Hoste, V., & Lefever, E. (2022). D-terminer : online demo for monolingual and bilingual automatic term extraction. In R. Costa, S. Carvalho, A. Ostroski Anic, & A. F. Khan (Eds.), Proceedings of the Workshop on Terminology in the 21st century : many faces, many places (pp. 33–40). Marseille, France: European Language Resources Association (ELRA).
Chicago author-date
Rigouts Terryn, Ayla, Veronique Hoste, and Els Lefever. 2022. “D-Terminer : Online Demo for Monolingual and Bilingual Automatic Term Extraction.” In Proceedings of the Workshop on Terminology in the 21st Century : Many Faces, Many Places, edited by Rute Costa, Sara Carvalho, Ana Ostroski Anic, and Anas Fahad Khan, 33–40. Marseille, France: European Language Resources Association (ELRA).
Chicago author-date (all authors)
Rigouts Terryn, Ayla, Veronique Hoste, and Els Lefever. 2022. “D-Terminer : Online Demo for Monolingual and Bilingual Automatic Term Extraction.” In Proceedings of the Workshop on Terminology in the 21st Century : Many Faces, Many Places, ed by. Rute Costa, Sara Carvalho, Ana Ostroski Anic, and Anas Fahad Khan, 33–40. Marseille, France: European Language Resources Association (ELRA).
Vancouver
1.
Rigouts Terryn A, Hoste V, Lefever E. D-terminer : online demo for monolingual and bilingual automatic term extraction. In: Costa R, Carvalho S, Ostroski Anic A, Khan AF, editors. Proceedings of the Workshop on Terminology in the 21st century : many faces, many places. Marseille, France: European Language Resources Association (ELRA); 2022. p. 33–40.
IEEE
[1]
A. Rigouts Terryn, V. Hoste, and E. Lefever, “D-terminer : online demo for monolingual and bilingual automatic term extraction,” in Proceedings of the Workshop on Terminology in the 21st century : many faces, many places, Marseille, France, 2022, pp. 33–40.
@inproceedings{8757222,
  abstract     = {{This contribution presents D-Terminer: an open access, online demo for monolingual and multilingual automatic term extraction from parallel corpora. The monolingual term extraction is based on a recurrent neural network, with a supervised methodology that relies on pretrained embeddings. Candidate terms can be tagged in their original context and there is no need for a large corpus, as the methodology will work even for single sentences. With the bilingual term extraction from parallel corpora, potentially equivalent candidate term pairs are extracted from translation memories and manual annotation of the results shows that good equivalents are found for most candidate terms. Accompanying the release of the demo is an updated version of the ACTER Annotated Corpora for Term Extraction Research (version 1.5).}},
  author       = {{Rigouts Terryn, Ayla and Hoste, Veronique and Lefever, Els}},
  booktitle    = {{Proceedings of the Workshop on Terminology in the 21st century : many faces, many places}},
  editor       = {{Costa, Rute and Carvalho, Sara and Ostroski Anic, Ana and Khan, Anas Fahad}},
  isbn         = {{9791095546955}},
  keywords     = {{automatic term extraction,multilingual term extraction,terminology}},
  language     = {{eng}},
  location     = {{Marseille, France}},
  pages        = {{33--40}},
  publisher    = {{European Language Resources Association (ELRA)}},
  title        = {{D-terminer : online demo for monolingual and bilingual automatic term extraction}},
  url          = {{https://lt3.ugent.be/dterminer/}},
  year         = {{2022}},
}