Advanced search
Add to list

Lost in post-editing: Identifying the MT error types that are most problematic for student post-editors

Joke Daems (UGent) , Lieve Macken (UGent) , Sonia Vandepitte (UGent) and Robert Hartsuiker (UGent)
(2014)
Author
Organization
Abstract
Post-editing machine translation (MT) is an important step towards high quality translations. In order to better understand the post-editing process, a better understanding of the relationship between MT output and the post-editing of this output is necessary. While automatic evaluation metrics such as the widely used BLEU can be used to compare the overall quality of different MT systems, a more detailed error analysis is necessary to identify the typical errors that appear in MT output and the subsequent post-editing. By using a fine-grained Translation Quality Assessment approach and grouping translation errors into source text-related error sets, we link MT errors to errors after post-editing to examine their relationship. We are mainly interested in answering the following questions: What (and how many) MT errors are solved by post-editors, what (and how many) problems occur in post-edited MT and which (and how many) of these originate from MT? We present the results of a pilot study in which student translators post-edited newspaper articles and user documentation from English into Dutch. We found that the MT errors that student post-editors most easily correct are grammatical errors, whereas e.g. wrong collocations, word sense disambiguation errors and the misspelling of compounds prove to be more challenging. As such, we can identify the types of errors that post-editor training should pay more attention to.

Citation

Please use this url to cite or link to this publication:

MLA
Daems, Joke, et al. Lost in Post-Editing: Identifying the MT Error Types That Are Most Problematic for Student Post-Editors. 2014.
APA
Daems, J., Macken, L., Vandepitte, S., & Hartsuiker, R. (2014). Lost in post-editing: Identifying the MT error types that are most problematic for student post-editors.
Chicago author-date
Daems, Joke, Lieve Macken, Sonia Vandepitte, and Robert Hartsuiker. 2014. “Lost in Post-Editing: Identifying the MT Error Types That Are Most Problematic for Student Post-Editors.”
Chicago author-date (all authors)
Daems, Joke, Lieve Macken, Sonia Vandepitte, and Robert Hartsuiker. 2014. “Lost in Post-Editing: Identifying the MT Error Types That Are Most Problematic for Student Post-Editors.”
Vancouver
1.
Daems J, Macken L, Vandepitte S, Hartsuiker R. Lost in post-editing: Identifying the MT error types that are most problematic for student post-editors. 2014.
IEEE
[1]
J. Daems, L. Macken, S. Vandepitte, and R. Hartsuiker, “Lost in post-editing: Identifying the MT error types that are most problematic for student post-editors.” 2014.
@misc{7197248,
  abstract     = {{Post-editing machine translation (MT) is an important step towards high quality translations. In order to better understand the post-editing process, a better understanding of the relationship between MT output and the post-editing of this output is necessary. While automatic evaluation metrics such as the widely used BLEU can be used to compare the overall quality of different MT systems, a more detailed error analysis is necessary to identify the typical errors that appear in MT output and the subsequent post-editing.  

By using a fine-grained Translation Quality Assessment approach and grouping translation errors into source text-related error sets, we link MT errors to errors after post-editing to examine their relationship. We are mainly interested in answering the following questions: What (and how many) MT errors are solved by post-editors, what (and how many) problems occur in post-edited MT and which (and how many) of these originate from MT?

We present the results of a pilot study in which student translators post-edited newspaper articles and user documentation from English into Dutch. We found that the MT errors that student post-editors most easily correct are grammatical errors, whereas e.g. wrong collocations, word sense disambiguation errors and the misspelling of compounds prove to be more challenging. As such, we can identify the types of errors that post-editor training should pay more attention to.}},
  author       = {{Daems, Joke and Macken, Lieve and Vandepitte, Sonia and Hartsuiker, Robert}},
  language     = {{eng}},
  title        = {{Lost in post-editing: Identifying the MT error types that are most problematic for student post-editors}},
  year         = {{2014}},
}