Project: ArisToCAT - Assessing The Comprehensibility of Automatic Translations
2017-01-01 – 2020-12-31
- Abstract
Machine translation systems cannot guarantee that the text they produce will be fluent and coherent in both syntax and semantics. Erroneous words and syntax occur frequently in machinetranslated text, leaving the reader to guess parts of the intended message. This project (i) analyzes eye movement data to investigate to what extent the lack of predictability in texts that were created by MT impairs comprehension, and (ii) tries to automatically estimate the comprehensibility of machine-translated text. To tackle the first research objective, we will collect and analyze eye movements of participants reading Dutch machine-translated text. In a first experiment we investigate the impact of different categories of MT errors (syntactic versus semantic, function words versus content words, shortdistance versus long-distance triggers of errors) on comprehension. In a second experiment, the participants read six short machine-translated texts of approximately 300-400 words for comprehension. To tackle the second research objective, an MT comprehensibility estimation system for Dutch will be built. The system takes as input a machine-translated sentence and tries to detect the MT errors that seriously hamper comprehension. We start off with a basic system incorporating baseline features such as sentence length and word frequency and gradually add features derived from language models with increasing complexity, namely n-gram, dependency and neural language models.
-
- Conference Paper
- P1
- open access
GECO-MT : the Ghent Eye-tracking Corpus of Machine Translation
-
It's all in the eyes : using eye-tracking to assess the readability of machine translated literature
-
It’s all in the eyes: an eye tracking experiment to assess the readability of machine translated literature
-
- Journal Article
- A2
- open access
Klassiekers door de vertaalmachine gehaald : Frankensteinmonsters of Treasure Islands?
-
- Conference Paper
- C3
- open access
Literary MT under the magnifying glass : assessing the quality of an NMT-translated Agatha Christie novel
-
Post-editing human translations and revising machine translations : impact on efficiency and quality
-
- Journal Article
- A2
- open access
Gutenberg goes neural : comparing features of Dutch human translations with raw neural machine translation outputs in a corpus of English literary classics
-
- Conference Paper
- C1
- open access
Assessing the Comprehensibility of Automatic Translations (ArisToCAT)
-
- Conference Paper
- P1
- open access
Literary machine translation under the magnifying glass : assessing the quality of an NMT-translated detective novel on document level