Advanced search
2 files | 3.91 MB

Treebank querying with GrETEL 3: bigger, faster, stronger

Author
Organization
Abstract
We describe the new version of GrETEL (http://gretel.ccl.kuleuven.be/gretel3), an online tool which allows users to query treebanks by means of a natural language example (example-based search) or via a formal query (XPath search). The new release comprises an update to the interface and considerable improvements in the back-end search mechanism. The update of the front-end is based on user suggestions. In addition to an overall design update, major changes include a more intuitive query builder in the example-based search mode and a visualizer for syntax trees that is compatible with all modern browsers. Moreover, the results are presented to the user as soon as they are found, so users can browse the matching sentences before the treebank search is completed. We will demonstrate that those changes considerably improve the query procedure. The update of the back-end mainly includes optimizing the search algorithm for querying the (very) large SoNaR treebank. Querying this 500-million word treebank was already made possible in the previous version of GrETEL, but due to the complex search mechanism this often resulted in long query times or even a timeout before the search completed. The improved version of the search algorithm results in faster query times and more accurate search results, which greatly enhances the usability of the SoNaR treebank for linguistic research.
Keywords
user interface, ux, design, user experience, nlp, natural language processing, treebank querying, corpus linguistics, search tool, lt3

Downloads

  • Treebank querying wth GrETEL.pdf
    • full text
    • |
    • open access
    • |
    • PDF
    • |
    • 243.16 KB
  • GrETEL3-poster-clin.pdf
    • full text
    • |
    • open access
    • |
    • PDF
    • |
    • 3.66 MB

Citation

Please use this url to cite or link to this publication:

Chicago
Augustinus, Liesbeth, Bram Vanroy, and Vincent Vandeghinste. 2017. “Treebank Querying with GrETEL 3: Bigger, Faster, Stronger.” In Computational Linguistics in the Netherlands: Abstracts, ed. Vincent Vandeghinste and Frank Van Eynde.
APA
Augustinus, L., Vanroy, B., & Vandeghinste, V. (2017). Treebank querying with GrETEL 3: bigger, faster, stronger. In V. Vandeghinste & F. Van Eynde (Eds.), Computational Linguistics in the Netherlands: Abstracts. Presented at the 27th Conference of Computational Linguistics in the Netherlands.
Vancouver
1.
Augustinus L, Vanroy B, Vandeghinste V. Treebank querying with GrETEL 3: bigger, faster, stronger. In: Vandeghinste V, Van Eynde F, editors. Computational Linguistics in the Netherlands: Abstracts. 2017.
MLA
Augustinus, Liesbeth, Bram Vanroy, and Vincent Vandeghinste. “Treebank Querying with GrETEL 3: Bigger, Faster, Stronger.” Computational Linguistics in the Netherlands: Abstracts. Ed. Vincent Vandeghinste & Frank Van Eynde. 2017. Print.
@inproceedings{8537246,
  abstract     = {We describe the new version of GrETEL (http://gretel.ccl.kuleuven.be/gretel3), an online tool which allows users to query treebanks by means of a natural language example (example-based search) or via a formal query (XPath search).
The new release comprises an update to the interface and considerable improvements in the back-end search mechanism.
The update of the front-end is based on user suggestions. In addition to an overall design update, major changes include a more intuitive query builder in the example-based search mode and a visualizer for syntax trees that is compatible with all modern browsers. Moreover, the results are presented to the user as soon as they are found, so users can browse the matching sentences before the treebank search is completed. We will demonstrate that those changes considerably improve the query procedure.
The update of the back-end mainly includes optimizing the search algorithm for querying the (very) large SoNaR treebank. Querying this 500-million word treebank was already made possible in the previous version of GrETEL, but due to the complex search mechanism this often resulted in long query times or even a timeout before the search completed. The improved version of the search algorithm results in faster query times and more accurate search results, which greatly enhances the usability of the SoNaR treebank for linguistic research.},
  author       = {Augustinus, Liesbeth and Vanroy, Bram and Vandeghinste, Vincent},
  booktitle    = {Computational Linguistics in the Netherlands: Abstracts},
  editor       = {Vandeghinste, Vincent and Van Eynde, Frank},
  keyword      = {user interface,ux,design,user experience,nlp,natural language processing,treebank querying,corpus linguistics,search tool,lt3},
  language     = {eng},
  location     = {Leuven, Belgium},
  title        = {Treebank querying with GrETEL 3: bigger, faster, stronger},
  year         = {2017},
}