Estimating post-editing time using a gold-standard set of machine translation errors
- Author
- Arda Tezcan (UGent) , Veronique Hoste (UGent) and Lieve Macken (UGent)
- Organization
- Abstract
- With the improved quality of Machine Translation (MT) systems in the last decades, post-editing (the correction of MT errors) has gained importance in Computer-Assisted Translation (CAT) workflows. Depending on the number and the severity of the errors in the MT output, the effort required to post-edit varies from sentence to sentence. The existing Quality Estimation (QE) systems provide quality scores that reflect the quality of an MT output at sentence level or word level. However, they fail to explain the relationship between different types of MT errors and the required post-editing effort to correct them. We suggest a more informative approach to QE in which different types of MT errors are detected in a first step, which are then used to estimate post-editing effort in a second step. In this paper we define the upper boundary of such a system. We use different machine learning methods to estimate Post-Editing Time (PET) by using a gold-standard set of MT errors as features. We show that post-editing time can be estimated with high accuracy when all the translation errors in the MT output are known. Furthermore, we apply feature selection methods and investigate the predictive power of different MT error types on PET. Our results show that the same prediction performance can be achieved by only using a small subset of MT error types, indicating that successful two-step QE systems can be built with less effort in the future, by detecting only the error types with highest predictive power.
- Keywords
- Machine translation, Quality estimation, Post-editing, Machine learning, Feature selection, lt3
Downloads
-
(...).pdf
- full text
- |
- UGent only
- |
- |
- 2.36 MB
Citation
Please use this url to cite or link to this publication: http://hdl.handle.net/1854/LU-8580359
- MLA
- Tezcan, Arda, et al. “Estimating Post-Editing Time Using a Gold-Standard Set of Machine Translation Errors.” COMPUTER SPEECH AND LANGUAGE, vol. 55, Elsevier, 2019, pp. 120–44, doi:10.1016/j.csl.2018.10.005.
- APA
- Tezcan, A., Hoste, V., & Macken, L. (2019). Estimating post-editing time using a gold-standard set of machine translation errors. COMPUTER SPEECH AND LANGUAGE, 55, 120–144. https://doi.org/10.1016/j.csl.2018.10.005
- Chicago author-date
- Tezcan, Arda, Veronique Hoste, and Lieve Macken. 2019. “Estimating Post-Editing Time Using a Gold-Standard Set of Machine Translation Errors.” COMPUTER SPEECH AND LANGUAGE 55: 120–44. https://doi.org/10.1016/j.csl.2018.10.005.
- Chicago author-date (all authors)
- Tezcan, Arda, Veronique Hoste, and Lieve Macken. 2019. “Estimating Post-Editing Time Using a Gold-Standard Set of Machine Translation Errors.” COMPUTER SPEECH AND LANGUAGE 55: 120–144. doi:10.1016/j.csl.2018.10.005.
- Vancouver
- 1.Tezcan A, Hoste V, Macken L. Estimating post-editing time using a gold-standard set of machine translation errors. COMPUTER SPEECH AND LANGUAGE. 2019;55:120–44.
- IEEE
- [1]A. Tezcan, V. Hoste, and L. Macken, “Estimating post-editing time using a gold-standard set of machine translation errors,” COMPUTER SPEECH AND LANGUAGE, vol. 55, pp. 120–144, 2019.
@article{8580359, abstract = {{With the improved quality of Machine Translation (MT) systems in the last decades, post-editing (the correction of MT errors) has gained importance in Computer-Assisted Translation (CAT) workflows. Depending on the number and the severity of the errors in the MT output, the effort required to post-edit varies from sentence to sentence. The existing Quality Estimation (QE) systems provide quality scores that reflect the quality of an MT output at sentence level or word level. However, they fail to explain the relationship between different types of MT errors and the required post-editing effort to correct them. We suggest a more informative approach to QE in which different types of MT errors are detected in a first step, which are then used to estimate post-editing effort in a second step. In this paper we define the upper boundary of such a system. We use different machine learning methods to estimate Post-Editing Time (PET) by using a gold-standard set of MT errors as features. We show that post-editing time can be estimated with high accuracy when all the translation errors in the MT output are known. Furthermore, we apply feature selection methods and investigate the predictive power of different MT error types on PET. Our results show that the same prediction performance can be achieved by only using a small subset of MT error types, indicating that successful two-step QE systems can be built with less effort in the future, by detecting only the error types with highest predictive power.}}, author = {{Tezcan, Arda and Hoste, Veronique and Macken, Lieve}}, issn = {{0885-2308}}, journal = {{COMPUTER SPEECH AND LANGUAGE}}, keywords = {{Machine translation,Quality estimation,Post-editing,Machine learning,Feature selection,lt3}}, language = {{eng}}, pages = {{120--144}}, publisher = {{Elsevier}}, title = {{Estimating post-editing time using a gold-standard set of machine translation errors}}, url = {{http://doi.org/10.1016/j.csl.2018.10.005}}, volume = {{55}}, year = {{2019}}, }
- Altmetric
- View in Altmetric
- Web of Science
- Times cited: