Advanced search
1 file | 915.51 KB Add to list

Detecting grammatical errors in machine translation output using dependency parsing and treebank querying

Arda Tezcan (UGent) , Veronique Hoste (UGent) and Lieve Macken (UGent)
Author
Organization
Abstract
Despite the recent advances in the field of machine translation (MT), MT systems cannot guarantee that the sentences they produce will be fluent and coherent in both syntax and semantics. Detecting and highlighting errors in machine-translated sentences can help post-editors to focus on the erroneous fragments that need to be corrected. This paper presents two methods for detecting grammatical errors in Dutch machine-translated text, using dependency parsing and treebank querying. We test our approach on the output of a statistical and a rule-based MT system for English-Dutch and evaluate the performance on sentence and word-level. The results show that our method can be used to detect grammatical errors with high accuracy on sentence-level in both types of MT output.
Keywords
Dependency Parsing, Treebanks, Quality Estimation, Machine Translation

Downloads

  • 4 2 12 Tezcan-3.pdf
    • full text (Published version)
    • |
    • open access
    • |
    • PDF
    • |
    • 915.51 KB

Citation

Please use this url to cite or link to this publication:

MLA
Tezcan, Arda, et al. “Detecting Grammatical Errors in Machine Translation Output Using Dependency Parsing and Treebank Querying.” BALTIC JOURNAL OF MODERN COMPUTING, vol. 4, no. 2, 2016, pp. 203–17.
APA
Tezcan, A., Hoste, V., & Macken, L. (2016). Detecting grammatical errors in machine translation output using dependency parsing and treebank querying. BALTIC JOURNAL OF MODERN COMPUTING, 4(2), 203–217.
Chicago author-date
Tezcan, Arda, Veronique Hoste, and Lieve Macken. 2016. “Detecting Grammatical Errors in Machine Translation Output Using Dependency Parsing and Treebank Querying.” BALTIC JOURNAL OF MODERN COMPUTING 4 (2): 203–17.
Chicago author-date (all authors)
Tezcan, Arda, Veronique Hoste, and Lieve Macken. 2016. “Detecting Grammatical Errors in Machine Translation Output Using Dependency Parsing and Treebank Querying.” BALTIC JOURNAL OF MODERN COMPUTING 4 (2): 203–217.
Vancouver
1.
Tezcan A, Hoste V, Macken L. Detecting grammatical errors in machine translation output using dependency parsing and treebank querying. BALTIC JOURNAL OF MODERN COMPUTING. 2016;4(2):203–17.
IEEE
[1]
A. Tezcan, V. Hoste, and L. Macken, “Detecting grammatical errors in machine translation output using dependency parsing and treebank querying,” BALTIC JOURNAL OF MODERN COMPUTING, vol. 4, no. 2, pp. 203–217, 2016.
@article{7241707,
  abstract     = {{Despite the recent advances in the field of machine translation (MT), MT systems cannot guarantee that the sentences they produce will be fluent and coherent in both syntax and semantics. Detecting and highlighting errors in machine-translated sentences can help post-editors to focus on the erroneous fragments that need to be corrected. This paper presents two methods for detecting grammatical errors in Dutch machine-translated text, using dependency parsing and treebank querying. We test our approach on the output of a statistical and a rule-based MT system for English-Dutch and evaluate the performance on sentence and word-level. The results show that our method can be used to detect grammatical errors with high accuracy on sentence-level in both types of MT output.}},
  author       = {{Tezcan, Arda and Hoste, Veronique and Macken, Lieve}},
  issn         = {{2255-8942}},
  journal      = {{BALTIC JOURNAL OF MODERN COMPUTING}},
  keywords     = {{Dependency Parsing,Treebanks,Quality Estimation,Machine Translation}},
  language     = {{eng}},
  number       = {{2}},
  pages        = {{203--217}},
  title        = {{Detecting grammatical errors in machine translation output using dependency parsing and treebank querying}},
  url          = {{http://www.bjmc.lu.lv/fileadmin/user_upload/lu_portal/projekti/bjmc/Contents/4_2_12_Tezcan.pdf}},
  volume       = {{4}},
  year         = {{2016}},
}